ML Infrastructure Engineer

San Francisco · On-site$200k – $300k + EquityVisa Sponsorship Available

About this role

We are looking for an ML Infrastructure Engineer with 3+ years of experience to own and scale the training and inference stack at a fast-growing AI document processing platform. You'll be a strong generalist who understands the mechanics of how ML models work – from serving and monitoring to building robust data pipelines – and can improve inference performance, reliability, and cost efficiency. This is a high-impact IC role where you'll work closely with ML researchers to ensure models are deployed quickly and reliably, and that infrastructure is never a bottleneck for the products being served. The ideal candidate is AI-native from the get-go, comfortable with 1-to-3 node training and single-to-double node serving, and thrives in a fast-paced startup environment.

What you will be doing

Building and maintaining model serving infrastructure – improving inference speed, monitoring, and reliability to ensure it's never a bottleneck for customers

Setting up and improving training infrastructure for models ranging from 300M to 30B parameters across 1-to-3 node environments

Developing observability, logging, and monitoring systems across the ML stack

Building internal data pipelines and tooling to help ML researchers move faster from experiment to production

Architecting infrastructure to arbitrate inference between multiple cloud providers while optimizing for accuracy, latency, and cost

Reducto offers an API-style platform that converts complex documents into inputs for large language models, serving hundreds of customers from startups to Fortune 10 and processing tens of millions of pages monthly from its San Francisco headquarters.

IndustryAI/ML

Team Size11-50

WorkspaceOn-site

StageSeries B

Founded2023

Location

San Francisco, CA, USA

Investors

Andreessen Horowitz ·Benchmark Capital ·BoxGroup ·First Round Capital ·Liquid 2 Ventures ·SV Angel ·Y Combinator

Websitereducto.ai

LinkedInLinkedIn

About the Team

Team Distribution

Engineering54%
Sales21%
Operations8%
Other Specialized6%
Leadership4%

Where the Team Studied

1.Massachusetts Institute of Technology
2.University of California, Berkeley
3.Stanford University
4.Carnegie Mellon University
5.University of Michigan

Team Worked At

Microsoft
Google
Amazon
Oracle
IBM

Culture & values

We work in person in our San Francisco office.

We care a lot about the product we're building.

Our work is centered around what customers need.

We don't do recurring meetings.

Everyone is involved with everything—from the product roadmap to working with the end user.

Funding History

Cumulative Funding

Life at Reducto

Know someone who'd be great for this?

ML Infrastructure Engineer

About this role

What you will be doing

About the Team

Team Distribution

Where the Team Studied

Team Worked At

Culture & values

Funding History

Life at Reducto

Tools

Explore

Company

Tools

Explore

Company

About the Team

Team Distribution

Where the Team Studied

Team Worked At

Funding History

Life at Reducto

ML Infrastructure Engineer

About this role

What you will be doing

Company at a glance

About the Team

Team Distribution

Where the Team Studied

Team Worked At

Culture & values

Funding History

Life at Reducto

Tools

Explore

Company

About the Team

Team Distribution

Where the Team Studied

Team Worked At

Funding History

Life at Reducto