Location: Remote with a 6 hour overlap with EST Remote | Full-time Compensation: $75K - $100K We are hiring on behalf of our client who is seeking a Web Scraping Specialist to join a specialized technical team focused on…
Location: Remote with a 6 hour overlap with EST Remote | Full-time Compensation: $75K - $100K We are hiring on behalf of our client who is seeking a Web Scraping Specialist to join a specialized technical team focused on…
Location: Remote with a 6 hour overlap with EST Remote | Full-time Compensation: $75K - $100K We are hiring on behalf of our client who is seeking a Web Scraping Specialist to join a specialized technical team focused on…
Location: Remote - Asia timezone Remote | Full-time Compensation: $48K - $72K We are hiring on behalf of our client who operates cutting-edge digital arenas where artificial intelligence agents compete publicly alongside…
Skills: KOL Management, Community Growth, Bilingual Chinese and English, Performance Marketing, Deal Negotiation
Location: Remote - Asia timezone Remote | Full-time Compensation: $48K - $72K We are hiring on behalf of our client who operates cutting-edge digital arenas where artificial intelligence agents compete publicly alongside…
Skills: KOL Management, Community Growth, Bilingual Chinese and English, Performance Marketing, Deal Negotiation
Location: Remote - Asia timezone Remote | Full-time Compensation: $48K - $72K We are hiring on behalf of our client who operates cutting-edge digital arenas where artificial intelligence agents compete publicly alongside…
Skills: KOL Management, Community Growth, Bilingual Fluency (Chinese/English), Deal Negotiation, Performance Marketing
Hong Kong, Hong Kong Island, Hong Kong S.A.R. · Remote Solely
$48k–$72k/yr
Mid level
Location: Remote - Asia timezone Remote | Full-time Compensation: $48K - $72K We are hiring on behalf of our client who operates cutting-edge digital arenas where artificial intelligence agents compete publicly alongside…
Skills: KOL Management, Community Growth, Bilingual Fluency (Chinese/English), Deal Negotiation, Performance Marketing
Location: Remote - Asia timezone Remote | Full-time Compensation: $48K - $72K We are hiring on behalf of our client who operates cutting-edge digital arenas where artificial intelligence agents compete publicly alongside…
Skills: KOL Management, Community Growth, Bilingual Chinese and English, Performance Marketing, Deal Negotiation
Location: New York, United States (Hybrid) - Must be able to work in NYC on a hybrid basis Hybrid | Full-time Compensation: $150K - $175K Our client is a well-funded financial technology firm backed by leading institutio…
Skills: Requirements Elicitation, Functional Specification Writing, User Acceptance Testing (UAT), Gap Analysis, Requirements Traceability
Location: Toronto (Office) On-site | Full-time Compensation: $140K - $260K Our client is a high-growth, open-source infrastructure provider dedicated to the advancement of AI agents and web-based applications. By develop…
Skills: Applied AI, Full-Stack Development, API Design, Technical Writing, Framework Development
Location: Shanghai · Onsite Onsite | Full-time Compensation: $100K–$200K + equity We are hiring on behalf of our client, an innovative, collaborative financial technology platform who is seeking an exceptional AI Enginee…
Skills: AI, Investment Research, Data Analytics, Trend Monitoring, Strategy Development
Location: Shanghai · Onsite On-Site | Full-time Compensation: ~$100K + equity We are hiring on behalf of our client, an innovative fin-tech organization who is seeking a high-caliber Product Manager to join its team onsi…
Skills: Product Management, AI Agent Orchestration, Fintech, Trading Systems, UGC Platforms
Location: New York, United States (Hybrid) Hybrid | Full-time Compensation: $180K - $320K Our client is the largest decentralized perpetual exchange on Arbitrum, having processed over $20 billion in cumulative trading vo…
Location: New York, United States (Hybrid) Hybrid | Full-time Compensation: $180K - $320K We are hiring on behalf of our client, a leading firm operating at the intersection of advanced financial markets and digital asse…
Amsterdam, North Holland, Netherlands · Remote Solely
$180k–$225k/yr
Senior+
Location: Remote - Based in EST to CET Remote | Full-time Compensation: $180K - $225K Our client is building the foundational product layer for the next major wave of onchain participants: autonomous, always-on, and ince…
Skills: Product Strategy, AI Agent Development, Web3, Blockchain, Mechanism Design
Location: Remote - Based in EST to CET Remote | Full-time Compensation: $180K - $225K Our client is building the foundational product layer for the next major wave of onchain participants: autonomous, always-on, and ince…
Skills: Product Strategy, AI Agent Development, Web3, Blockchain, Mechanism Design
Location: Remote - Based in EST to CET Remote | Full-time Compensation: $180K - $225K Our client is building the foundational product layer for the next major wave of onchain participants: autonomous, always-on, and ince…
Skills: Product Strategy, AI Agent Development, Web3, Blockchain, Mechanism Design
Philadelphia, Pennsylvania, United States · Remote Solely
$180k–$225k/yr
Senior+
Location: Remote - Based in EST to CET Remote | Full-time Compensation: $180K - $225K Our client is building the foundational product layer for the next major wave of onchain participants: autonomous, always-on, and ince…
Skills: Product Strategy, AI Agent Development, Web3, Blockchain, Mechanism Design
Washington, District of Columbia, United States · Remote Solely
$180k–$225k/yr
Senior+
Location: Remote - Based in EST to CET Remote | Full-time Compensation: $180K - $225K Our client is building the foundational product layer for the next major wave of onchain participants: autonomous, always-on, and ince…
Skills: Product Strategy, AI Agent Development, Web3, Blockchain, Mechanism Design
Sign up with Clera and we'll reach out the moment a role actually fits you — no more spraying applications into the void.
$75k–$100k/yr
Full-time
Comprehensive Benefits Package, Equity Package
Posted 1d ago
~40 hrs/week
Remote in New Jersey, United States
Responsibilities
Develop and optimize high-performance code to extract massive amounts of web data for AI model training. Manage complex data pipelines, including cleaning, formatting, and storing data in NoSQL databases.
Requirements
Requires advanced proficiency in Python or JavaScript and experience with scraping frameworks like Scrapy or Selenium. Must have expertise in distributed architectures, cloud infrastructure, and handling dynamic web content.
Full job description
Location: Remote with a 6 hour overlap with EST
Remote | Full-time
Compensation: $75K - $100K
We are hiring on behalf of our client who is seeking a Web Scraping Specialist to join a specialized technical team focused on building the infrastructure that delivers massive amounts of web data for the training of advanced AI models. This organization operates a massive distributed crawler and manages complex pipelines for ingesting, segmenting, and annotating billions of data points, including videos, transcripts, and audio files.
The successful candidate will lead efforts to gather and analyze data, optimize scraping processes, and support the scaling of high-quality public web data accessibility. This role is ideal for a lean, technical builder who thrives in a fast-paced environment without bureaucratic red tape.
Key Responsibilities:
Code Development: Write, test, and refine high-performance code to extract data from various online sources, ensuring maximum reliability and efficiency.
Data Retrieval: Manage complex data retrieval tasks, including handling pagination and dynamic content loaded via AJAX.
Data Quality: Clean and format extracted data to ensure it meets rigorous quality standards for downstream analysis and processing.
Database Management: Store and manage scraped data in appropriate databases, optimizing for both access speed and long-term data integrity.
Monitoring and Maintenance: Regularly monitor scraping processes and infrastructure to identify and resolve issues, ensuring a continuous and stable data flow.
Extraction Expertise: Demonstrated ability to extract data from complex websites with minimal supervision, supported by a portfolio of past projects.
Technical Proficiency: Advanced skills in Python or JavaScript, specifically with libraries and frameworks such as BeautifulSoup, Scrapy, or Selenium.
Advanced Programming: Strong knowledge of asynchronous programming, multithreading, and distributed scraping architectures.
Web Fundamentals: In-depth knowledge of HTML, CSS, JavaScript, and the Document Object Model (DOM).
Data Storage: Experience with NoSQL databases (e.g., MongoDB, Cassandra), including the ability to design efficient storage solutions.
Cloud Infrastructure: Experience deploying and managing large-scale scraping jobs using cloud services such as AWS, Google Cloud, or Azure.
Preferred Skills: Ability to apply machine learning algorithms for data cleaning, categorization, or predictive analysis; active participation in relevant open-source projects.
Competitive Compensation: A highly competitive salary ranging from $75,000 to $100,000, complemented by a comprehensive benefits and equity package.
Impactful Work: The opportunity to work at the forefront of AI development and web-scale knowledge graph creation.
High-Output Culture: A professional environment that prioritizes low ego, technical autonomy, and rapid execution.
Remote Flexibility: This is a remote position requiring a 6-hour overlap with the core team's schedule.
Due to the high volume of applications we anticipate, we regret that we are unable to provide individual feedback to all candidates. If you do not hear back from us within 4 weeks of your application, please assume that you have not been successful on this occasion. We genuinely appreciate your interest and wish you the best in your job search.
Commitment to Equality and Accessibility:
At MLabs, we are committed to offer equal opportunities to all candidates. We ensure no discrimination, accessible job adverts, and providing information in accessible formats. Our goal is to foster a diverse, inclusive workplace with equal opportunities for all. If you need any reasonable adjustments during any part of the hiring process or you would like to see the job-advert in an accessible format please let us know at the earliest opportunity by emailing [email protected].
MLabs Ltd collects and processes the personal information you provide such as your contact details, work history, resume, and other relevant data for recruitment purposes only. This information is managed securely in accordance with MLabs Ltd’s Privacy Policy and Information Security Policy, and in compliance with applicable data protection laws. Your data may be shared only with clients and trusted partners where necessary for recruitment purposes. You may request the deletion of your data or withdraw your consent at any time by contacting [email protected].
Related keywords
Web ScrapingAI Training DataDistributed CrawlerAJAXPaginationNoSQLMongoDBCassandraAWSGoogle CloudAzurePythonJavaScriptBeautifulSoupScrapySelenium
We are a Haskell, Rust, Blockchain and AI consultancy.
Industry
IT Services and IT Consulting
Company size
51-200 employees
Founded
2018
Headquarters
London
LinkedIn followers
59,338
MLabs Consulting helps to setup project specification, implementation, management and maintenance of technical projects for AI, Fintech, Information Technology and more. We specialise in functional programming, compilers, AI, DevOps and full-stack development.
We are a Haskell, Rust, Blockchain and AI consultancy.
Industry
IT Services and IT Consulting
Company size
51-200 employees
Founded
2018
Headquarters
London
LinkedIn followers
59,338
MLabs Consulting helps to setup project specification, implementation, management and maintenance of technical projects for AI, Fintech, Information Technology and more. We specialise in functional programming, compilers, AI, DevOps and full-stack development.