About this role
<div class="textLayer">Emergent builds autonomous coding agents that replace traditional software development by generating, testing, and deploying production applications directly from plain-language intent. Our systems run in production at global scale and are used to build millions of real applications.</div>
<div class="textLayer"><br>Since public launch, Emergent has reached <strong>$50M ARR in 7 months</strong>. <strong>5M+ users across 190+ countries</strong> have built <strong>6M+ applications</strong> on Emergent. We’ve raised <strong>$100M</strong>, backed by <strong>Khosla Ventures, SoftBank, Google, Lightspeed, Prosus, Together, and Y Combinator</strong>.</div>
<div class="textLayer"><br>We’re solving the hard part of AI-driven software creation: correctness, reliability, security, and scale in real production systems. The team is built by repeat founders, Olympiad medalists, IIT & IIM alumni, and leaders from Google, Amazon, and Dropbox.</div>
<div class="textLayer"><br><strong>We’re hiring builders who want ownership, speed, and impact at global scale.</strong></div>
<p class="pf0"><strong><span class="cf1">The Role:</span></strong></p>
<p class="pf0"><span class="cf1">We’re building AI coding agents that can plan, build, test, and ship real software. Your role is </span><span class="cf1">to turn LLM model and system capabilities into measurable, shippable improvements in </span><span class="cf1">agent performance. You will own experimentation and shipping decisions - what we try, </span><span class="cf1">how we measure better, what goes live, and what gets rolled back.</span></p>
<p class="pf0"><strong><span class="cf1">What You’ll Do :-</span></strong></p>
<ul>
<li class="pf1"><span class="cf1">Develop deep intuition for LLM and agent behavior, identifying failure modes and </span><span class="cf1">regressions.</span></li>
<li class="pf1"><span class="cf1">Define and run high-leverage experiments to improve agent quality, reliability, and code </span><span class="cf1">outcomes.</span></li>
<li class="pf1"><span class="cf1">Ship improvements with clear metrics, evaluation gates, staged rollouts, and rollback </span><span class="cf1">criteria.</span></li>
<li class="pf1"><span class="cf1">Define evaluation frameworks and work with engineering teams to measure agent </span><span class="cf1">quality at scale.</span></li>
<li class="pf1"><span class="cf1">Drive initiatives around context engineering, memory systems, and frontier agent </span><span class="cf1">capabilities.</span></li>
<li class="pf1"><span class="cf1">Think like the agent — continuously making it smarter, more reliable, and more useful. </span></li>
</ul>
<p class="pf0"><strong><span class="cf1">Who You Are</span><span class="cf0"> :-</span></strong></p>
<ul>
<li class="pf1"><span class="cf1">5+ years of software engineering experience with end-to-end ownership.</span></li>
<li class="pf1"><span class="cf0">Strong technical depth paired with sharp product intuition.</span></li>
<li class="pf1"><span class="cf0">Comfortable working with metrics, experimentation, SQL, and/or Python.</span></li>
<li class="pf1"><span class="cf0">Able to thrive in ambiguity and move fast with rigor.</span></li>
<li class="pf1"><span class="cf0">Hands-on and up-to-date with emerging AI research and industry trends</span></li>
</ul>
<p> </p>