
HeyMilo helps recruiters deploy multimodal AI agents that interview and evaluate candidates at scale. We're a fast-moving team backed by prominent investors growing at an unprecedented rate.
You'll own architectural decisions, ship end-to-end, and lead technical direction across our platform. This is a high-ownership role, not a ticket queue.
Requires flexibility to work evenings (IST) for overlap with our New York/Toronto teams.
What you'll do
Lead system design and architecture decisions
Build and scale real-time AI infrastructure
Ship full-stack features across backend and frontend
Drive reliability, performance, and observability
Mentor engineers and set technical standards
What we're looking for
5β7+ years of software engineering experience
Strong distributed systems and AWS experience
Experience building agentic AI systems; prompt engineering, RAG pipelines, LLM orchestration
Familiarity with vector databases and retrieval-augmented generation patterns
You use AI tools (Cursor, Devin) to write code daily
You ship fast and own outcomes
Comfortable with Python, FastAPI, React, Next.js
Experience with MongoDB, Redis, and queue-based systems
Role is not for you if
You want narrow scope and predictable roadmaps
You avoid production responsibility
You're not comfortable with evening hours for cross-timezone work
Why you won't regret it
Real-time AI systems in production
Small team, high ownership, no bureaucracy
Awesome culture with a global team across New York πΊπΈ, Toronto π¨π¦, and Colombo π±π°.
Competitive salary with benefits/allowances
Take the next step in your career journey
Get matched with similar opportunities at top startups
This role is hosted on HeyMilo's careers site.
Join our talent pool first to get notified about similar roles that match your profile.