Clera home
·Dashboard

Jobs at Reflection (Now Hiring) — 10 open

Reflection logoReflection

Member of Technical Staff - Web Crawl Engineer

San Francisco, California, United States · On-site

Senior$2.1B raised

Our Mission Reflection’s mission is to build open superintelligence and make it accessible to all. We’re developing open weight models for individuals, agents, enterprises, and even nation states. Our team of AI research…

Skills: Web Crawling, Distributed Systems, URL Frontier Management, Content Extraction, HTML Parsing

Reflection logoReflection

Strategic Finance Manager

San Francisco, California, United States · On-site

Senior$2.1B raised

Our Mission Reflection’s mission is to build open superintelligence and make it accessible to all. We’re developing open weight models for individuals, agents, enterprises, and even nation states. Our team of AI research…

Skills: Financial Modeling, Deal Economics, Revenue Modeling, Commercial Finance, Cost Modeling

Reflection logoReflection

Engagement Manager

San Francisco, California, United States · On-site

Senior$2.1B raised

Our Mission Reflection’s mission is to build open superintelligence and make it accessible to all. We’re developing open weight models for individuals, agents, enterprises, and even nation states. Our team of AI research…

Skills: Engagement Management, Technical Program Management, AI/ML Concepts, Software Architecture, Stakeholder Management

Reflection logoReflection

Workplace Experience Manager (NY)

New York, New York, United States · On-site

Mid level$2.1B raised

Our Mission Reflection’s mission is to build open superintelligence and make it accessible to all. We’re developing open weight models for individuals, agents, enterprises, and even nation states. Our team of AI research…

Skills: Office Management, Facilities Management, Vendor Management, Onboarding, Event Planning

Reflection logoReflection

Director, HR Business Partner

San Francisco, California, United States · On-site

Senior+$2.1B raised

Our Mission Reflection’s mission is to build open superintelligence and make it accessible to all. We’re developing open weight models for individuals, agents, enterprises, and even nation states. Our team of AI research…

Skills: Strategic HR Advisory, Organizational Design, Workforce Planning, Performance Management, Employee Relations

Reflection logoReflection

HR Generalist

London, England, United Kingdom · On-site

Senior$2.1B raised

Our Mission Reflection’s mission is to build open superintelligence and make it accessible to all. We’re developing open weight models for individuals, agents, enterprises, and even nation states. Our team of AI research…

Skills: UK Employment Law, Employee Relations, People Operations, HR Business Partnering, Performance Management

Reflection logoReflection

Accounting Manager

San Francisco, California, United States · On-site

Senior$2.1B raised

Our Mission Reflection’s mission is to build open superintelligence and make it accessible to all. We’re developing open weight models for individuals, agents, enterprises, and even nation states. Our team of AI research…

Skills: Monthly Close, Bookkeeping, Accounts Payable, Payroll Management, Tax Compliance

Reflection logoReflection

Forward Deployed Engineer - AI Engineer

Seoul, South Korea · On-site

Mid level$2.1B raised

Our Mission Reflection’s mission is to build open superintelligence and make it accessible to all. We’re developing open weight models for individuals, agents, enterprises, and even nation states. Our team of AI research…

Skills: Python, Typescript, Docker, Kubernetes, CI/CD

Reflection logoReflection

Forward Deployed Engineer, Lead - AI Engineer

Seoul, South Korea · On-site

Senior$2.1B raised

Our Mission Reflection’s mission is to build open superintelligence and make it accessible to all. We’re developing open weight models for individuals, agents, enterprises, and even nation states. Our team of AI research…

Skills: Python, Typescript, Docker, Kubernetes, CI/CD

Reflection logoReflection

Engagement Manager

Seoul, South Korea · On-site

Senior$2.1B raised

Our Mission Reflection’s mission is to build open superintelligence and make it accessible to all. We’re developing open weight models for individuals, agents, enterprises, and even nation states. Our team of AI research…

Skills: Engagement Management, Technical Program Management, AI/ML Concepts, Stakeholder Management, Korean Fluency

Reflection logo

Member of Technical Staff - Web Crawl Engineer

Reflection

San Francisco, California, United States • On-site

Apply
Senior

Tired of cold applications?

Sign up with Clera and we'll reach out the moment a role actually fits you — no more spraying applications into the void.

  • Full-time
  • Salary, Equity, Medical Insurance, Dental Insurance, Vision Insurance, Life Insurance
  • Posted 9d ago
  • ~40 hrs/week

Responsibilities

Build and operate large-scale web crawling infrastructure to discover and acquire high-value content for AI model training. Design systems for URL discovery, content extraction, and distributed crawl orchestration while ensuring reliability and scalability.

Requirements

Requires extensive experience building internet-scale data collection systems and working with distributed frameworks like Ray or Spark. Candidates must be proficient in content extraction, browser automation, and managing petabyte-scale datasets.

Full job description

Our Mission

Reflection’s mission is to build open superintelligence and make it accessible to all.

We’re developing open weight models for individuals, agents, enterprises, and even nation states. Our team of AI researchers and company builders come from DeepMind, OpenAI, Google Brain, Meta, Character.AI, Anthropic and beyond.

About the Role

The web is one of the most important sources of information for frontier AI systems. The quality, coverage, freshness, and diversity of web data directly influence model capabilities.

As a member of the Data Team, your mission is to build and operate large-scale web crawling systems that continuously discover, acquire, and process content from across the internet. You will own the infrastructure that powers web-scale data collection, from URL discovery and scheduling to distributed crawling, content extraction, and dataset delivery.

You will work directly with world-class researchers to understand which parts of the web matter most for model performance and build systems that efficiently acquire high-value content at scale.

This role is ideal for engineers who love building distributed systems, optimizing large-scale crawlers, and solving the unique technical challenges of collecting data from the modern web.

What You’ll Do

Working closely with our pre-training, infrastructure, and data quality teams, you will:

  • Build and operate web-scale crawling infrastructure capable of continuously collecting data across billions of URLs

  • Design and optimize URL discovery, prioritization, scheduling, and crawl orchestration systems

  • Develop distributed crawlers that efficiently acquire content while respecting site constraints and operational requirements

  • Build systems for content extraction, rendering, parsing, and normalization across diverse web formats

  • Improve crawl coverage, freshness, efficiency, and quality through measurement and experimentation

  • Design infrastructure for large-scale recrawling, change detection, and incremental updates

  • Develop specialized crawlers for high-value domains, dynamic websites, and difficult-to-access content sources

  • Analyze crawl performance and web coverage to identify gaps, inefficiencies, and opportunities for improvement

  • Build observability, monitoring, and reliability systems for large-scale crawl operations

  • Debug production issues and continuously improve the performance, scalability, and resilience of crawling infrastructure

About You

  • Passionate about web-scale systems and the challenges of collecting information from the internet

  • Curious about how web data influences model capabilities and willing to iterate based on downstream results

  • Comfortable balancing crawl quality, coverage, freshness, and operational efficiency

  • Enjoy working at the intersection of distributed systems, data infrastructure, and AI

  • Able to collaborate closely with researchers, infrastructure engineers, and data quality teams

Skills and Qualifications

  • Experience building large-scale web crawling, search indexing, content acquisition, or internet-scale data collection systems

  • Strong understanding of crawling architectures, URL frontier management, scheduling, and distributed crawl coordination

  • Experience with large-scale distributed systems using technologies such as Ray, Spark, Beam, Flink, or similar frameworks

  • Familiarity with content extraction, HTML parsing, browser automation, rendering systems, and modern web technologies

  • Experience operating systems that process petabyte-scale datasets

  • Strong systems engineering skills, including reliability, observability, performance optimization, and debugging

  • Experience designing experiments and using data to improve crawl quality, coverage, and efficiency

  • Excellent communication skills and the ability to reason clearly about system tradeoffs and operational constraints

Nice to Have

  • Experience building search engines, web indexes, or internet-scale crawling platforms

  • Familiarity with anti-bot systems, dynamic web content, browser automation, and large-scale extraction pipelines

  • Understanding of how web data is used in training and evaluating large language models

  • Experience with distributed storage systems, content deduplication, and web-scale dataset management

What We Offer:

We believe that to build superintelligence that is truly open, you need to start at the foundation. Joining Reflection means building from the ground up as part of a small talent-dense team. You will help define our future as a company, and help define the frontier of open foundational models.

We want you to do the most impactful work of your career with the confidence that you and the people you care about most are supported.

  • Top-tier compensation: Salary and equity structured to recognize and retain the best talent globally.

  • Health & wellness: Comprehensive medical, dental, vision, life, and disability insurance.

  • Life & family: Fully paid parental leave for all new parents, including adoptive and surrogate journeys. Financial support for family planning.

  • Benefits & balance: paid time off when you need it, relocation support, and more perks that optimize your time.

  • Opportunities to connect with teammates: lunch and dinner are provided daily. We have regular off-sites and team celebrations.

Related keywords

Web CrawlSuperintelligenceOpen Weight ModelsDistributed SystemsURL DiscoveryContent ExtractionRaySparkBeamFlinkHTML ParsingBrowser AutomationPetabyte-scaleSearch EnginesWeb IndexesAnti-bot Systems

About Reflection

LinkedInVisit site

AI lab building frontier open models.

Industry
Software Development
Company size
51-200 employees
Headquarters
New York, NY
LinkedIn followers
23,435
Total funding
$2.1B

Reflection is an AI lab building frontier open weight models. Our team previously built frontier LLMs at labs like DeepMind, OpenAI, and Anthropic. We believe AI should be built in the open, with transparent research and collaborative development. That means giving enterprises, governments, and sovereign entities true ownership and control of AI that performs at the highest level. Our mission: frontier open intelligence, accessible to all.

Offices: 124 E 14th St, New York, NY 10003, US · 144 2nd St, San Francisco, California 94105, US · 221 Pentonville Road, London, England WC1X 9DJ, GB

Foundational AISoftwareDeveloper ToolsDatabaseArtificial Intelligence (AI)Computer Vision
View all jobs at Reflection

About Reflection

LinkedInVisit site

AI lab building frontier open models.

Industry
Software Development
Company size
51-200 employees
Headquarters
New York, NY
LinkedIn followers
23,435
Total funding
$2.1B

Reflection is an AI lab building frontier open weight models. Our team previously built frontier LLMs at labs like DeepMind, OpenAI, and Anthropic. We believe AI should be built in the open, with transparent research and collaborative development. That means giving enterprises, governments, and sovereign entities true ownership and control of AI that performs at the highest level. Our mission: frontier open intelligence, accessible to all.

Offices: 124 E 14th St, New York, NY 10003, US · 144 2nd St, San Francisco, California 94105, US · 221 Pentonville Road, London, England WC1X 9DJ, GB

Foundational AISoftwareDeveloper ToolsDatabaseArtificial Intelligence (AI)Computer Vision
View all jobs at Reflection

Similar companies hiring

Amazon (4947)Prolific (3401)AgileEngine (1668)Bosch (1656)Speechify (1456)Google (969)Booz Allen Hamilton (779)Microsoft (722)Transport AI (669)SAP (579)Salesforce (515)Meta (456)
Clera home

Your AI-talent agent. Connecting talents with dream jobs.

Earn $5,000

Tools

  • Salary Calculator
  • Resume Review
  • Startup Map

Explore

  • Jobs
  • Discover Jobs
  • Companies
  • Acquihire
  • Referral

Company

  • Manifesto
  • Engineering
  • We are hiring!
  • FAQs
  • Blog
  • Press

Tools

  • Salary Calculator
  • Resume Review
  • Startup Map

Explore

  • Jobs
  • Discover Jobs
  • Companies
  • Acquihire
  • Referral

Company

  • Manifesto
  • Engineering
  • We are hiring!
  • FAQs
  • Blog
  • Press

© 2026 Clera Labs, Inc.

PrivacyTermsBug Bounty