About this role
<h2><strong>About the Role</strong></h2>
<p data-start="342" data-end="724">As a Research Engineer at Metis, you’ll work on building the next generation of autonomous post-training systems that leverage our Mantis platform. You’ll operate at the intersection of cutting-edge ML research and scalable engineering, designing, implementing, and deploying algorithms that improve how AI agents learn from feedback, synthetic data, and real-world interactions.</p>
<p data-start="342" data-end="724">You’ll move seamlessly between papers and production, leading large-scale experiments, creating optimized training pipelines, and helping shape the future of post-training autonomy. You’ll have significant ownership, high compute budgets, and the mandate to push the state of the art in applied reinforcement and preference optimization.</p>
<h2><strong>What You'll Do</strong></h2>
<ul>
<li data-stringify-indent="0" data-stringify-border="0">Research and help build an autonomous post-training agent leveraging the Mantis platform</li>
<li data-stringify-indent="0" data-stringify-border="0">Design and execute large-scale experiments on synthetic data generation and algorithmic architecture</li>
<li data-stringify-indent="0" data-stringify-border="0">Develop and refine methods for reinforcement learning, reward modeling, and human feedback integration</li>
<li data-stringify-indent="0" data-stringify-border="0">Collaborate cross-functionally with Core and Platform Engineering to deploy and evaluate models in production settings</li>
<li data-stringify-indent="0" data-stringify-border="0">Publish or contribute to leading-edge research in the post-training domain</li>
<li data-stringify-indent="0" data-stringify-border="0">Use tooling and compute efficiently to iterate on experimental pipelines and accelerate research velocity</li>
</ul>
<h2><strong>Requirements</strong></h2>
<ul>
<li data-stringify-indent="0" data-stringify-border="0">Deep experience in machine learning, preferably reinforcement learning, post-training, or alignment research</li>
<li data-stringify-indent="0" data-stringify-border="0">Demonstrated research contributions; ideally published papers (ICML, NeurIPS) or public implementations</li>
<li data-stringify-indent="0" data-stringify-border="0">Strong proficiency in Python and ML frameworks (PyTorch, JAX, or TensorFlow)</li>
<li data-stringify-indent="0" data-stringify-border="0">Comfort with distributed training, high-throughput data pipelines, and large-scale experiment management</li>
<li data-stringify-indent="0" data-stringify-border="0">Ability to reason independently, formulate hypotheses, and run experiments from idea → insight → product impact</li>
</ul>
<h2><strong>Compensation & Benefits</strong></h2>
<ul>
<li>Base: $200,000–$1,000,000</li>
<li>Significant Equity</li>
<li>Full medical, dental, and vision</li>
<li>Wellness & L&D stipend</li>
<li>Equinox membership</li>
<li>Breakfast, lunch, and dinner provided (Unlimited Doordash)</li>
<li>$25,000 housing stipend</li>
</ul>
<h2><strong>About Metis</strong></h2>
<p>Metis helps enterprises and labs build the most reliable AI agents by leveraging post-training. Our platform enables the creation, improvement, and deployment of the most capable frontier agents designed for rigorous, real-world workflows.</p>
<div class="p-rich_text_section"><strong data-stringify-type="bold">Momentum</strong></div>
<ul class="p-rich_text_list p-rich_text_list__bullet p-rich_text_list--nested" data-stringify-type="unordered-list" data-list-tree="true" data-indent="0" data-border="0">
<li data-stringify-indent="0" data-stringify-border="0">0 → six-figure monthly revenue in the last six weeks</li>
<li data-stringify-indent="0" data-stringify-border="0">Working with several Fortune 500 enterprises & frontier AI labs</li>
<li data-stringify-indent="0" data-stringify-border="0">Growing 150%+ MoM</li>
</ul>
<div class="p-rich_text_section"><strong data-stringify-type="bold">Backed by</strong></div>
<div class="p-rich_text_section"> </div>
<div class="p-rich_text_section">Y Combinator, CRV, and executives from OpenAI, Google, Mercor, NVIDIA, and others.</div>