Clera home
·Dashboard

Jobs at Novita AI (Now Hiring) — 1 open

Novita AI logoNovita AI

Forward Deployed Engineer

San Mateo, California, United States · On-site

Mid level

About Novita AI Novita AI is an AI & Agent Cloud platform for builders. We provide a unified platform for Model APIs, GPU Cloud, and Agent Sandbox infrastructure, helping developers and enterprises build, deploy, and sca…

Skills: Python, Cloud-native Applications, Docker, Kubernetes, Linux

Novita AI logo

Forward Deployed Engineer

Novita AI

San Mateo, California, United States • On-site

Apply
Mid level

Tired of cold applications?

Sign up with Clera and we'll reach out the moment a role actually fits you — no more spraying applications into the void.

  • Full-time
  • Medical Insurance, Dental Insurance, Vision Insurance, 401(k) Plan, Free Meals
  • Posted 3d ago
  • ~40 hrs/week

Responsibilities

The Forward Deployed Engineer partners with customers to deploy AI workloads and integrate model APIs, GPU infrastructure, and sandbox environments. They act as a technical advisor, troubleshooting production issues and translating customer feedback into product requirements.

Requirements

Candidates must have strong software engineering fundamentals, proficiency in Python, and experience with cloud-native tools like Docker and Kubernetes. Experience with AI/ML infrastructure, GPUs, and distributed serving frameworks is highly preferred.

Full job description

About Novita AI

Novita AI is an AI & Agent Cloud platform for builders.

We provide a unified platform for Model APIs, GPU Cloud, and Agent Sandbox infrastructure, helping developers and enterprises build, deploy, and scale production AI systems without managing complex infrastructure. Today, thousands of teams rely on Novita to power AI agents, coding tools, multimodal applications, and large-scale inference workloads.

The Role

As a Forward Deployed Engineer (FDE), you’ll work at the intersection of engineering, customer success, and product. You’ll partner closely with customers to understand their AI workloads, deploy solutions, troubleshoot production issues, and influence the evolution of our platform.

You will act as an extension of our customers’ engineering teams, helping them integrate model APIs, GPU infrastructure, and sandbox environments into real-world applications.

This role is ideal for someone who enjoys wearing multiple hats—software engineer, solutions architect, product thinker, and trusted technical advisor.

What You’ll Do

  • Work directly with customers to understand their AI products, technical requirements, and business goals.

  • Deploy and integrate Novita AI products, including:

    • Model APIs and inference endpoints

    • Dedicated model hosting

    • GPU cloud infrastructure

    • Agent sandbox environments

  • Debug production issues across APIs, networking, containers, GPUs, and distributed systems.

  • Build customer-specific solutions, integrations, demos, and prototypes.

  • Travel to customer sites when necessary and collaborate closely with their engineering teams.

  • Translate customer feedback into product requirements and work with internal engineering teams to improve our platform.

  • Create reusable tooling, automation, and reference implementations that benefit future customers.

  • Support proof-of-concepts (PoCs) and help customers successfully move into production.

What We’re Looking For

Requirements

  • Strong software engineering fundamentals.

  • Proficiency in Python and at least one additional programming language.

  • Experience building or deploying cloud-native applications.

  • Familiarity with Docker, Kubernetes, Linux, and networking concepts.

  • Strong debugging and problem-solving skills in production environments.

  • Excellent communication skills and ability to work directly with customers.

  • Ability to operate with ambiguity and move quickly in a startup environment.

Nice to Have

  • Experience with AI/ML infrastructure, LLMs, or inference systems.

  • Familiarity with vLLM, SGLang, Ray, or distributed serving frameworks.

  • Experience working with GPUs, CUDA, or large-scale compute systems.

  • Background in solutions engineering, developer relations, or customer-facing engineering roles.

  • Experience building AI agents, coding agents, or sandboxed execution environments.

  • Prior startup experience or experience as an early engineer.

  • Professional working proficiency in Mandarin Chinese.

What Success Looks Like

In your first six months, you will:

  • Help multiple customers successfully launch AI workloads into production.

  • Become a trusted technical advisor for strategic accounts.

  • Build reusable solutions that accelerate future customer deployments.

  • Influence our product roadmap through direct customer insights.

  • Bridge the gap between customer requirements and internal engineering execution.

Why Join Novita AI

  • Work on cutting-edge AI infrastructure used by leading AI companies and developers.

  • Solve challenging problems across inference, GPUs, agent systems, and distributed computing.

  • Have direct impact on customers and product direction.

  • Move fast in a highly technical and entrepreneurial environment.

  • Collaborate with a team deeply passionate about the future of open AI infrastructure.

  • Competitive pay package, 100% employer-covered premium medical, dental, and vision insurance, 401(k) plan, free meals in the office

Related keywords

AIAgent CloudModel APIsGPU CloudAgent SandboxInferencePythonDockerKubernetesLinuxvLLMSGLangRayCUDADistributed ComputingCloud-native

About Novita AI

LinkedInVisit site

AI & Agent Cloud for Developers

Industry
Technology, Information and Internet
Company size
11-50 employees
Founded
2024
Headquarters
San Francisco, California
LinkedIn followers
1,825

Novita AI is an AI & Agent Cloud platform built to make advanced AI infrastructure accessible, reliable, and cost-effective for developers and fast-growing teams. We provide high-performance Model APIs, GPU Cloud, and Agent Sandbox environments, enabling teams to build, deploy, and scale AI applications without managing complex infrastructure. From open-source LLMs and multimodal models to production-grade GPU workloads, Novita handles the heavy lifting so developers can focus on building. Our platform is trusted by startups, research teams, and AI-native companies running real production workloads—processing massive token volumes and GPU hours every day across global regions. We emphasize performance, transparency, and flexibility, with straightforward pricing and infrastructure designed for real-world usage. At Novita, we believe AI infrastructure should be open, efficient, and developer-first. Our mission is to lower the barrier to building powerful AI systems—whether you’re shipping your first prototype or operating at scale.

Offices: 156 2nd, San Francisco, California, US

Information TechnologyArtificial Intelligence (AI)Cloud Infrastructure
View all jobs at Novita AI

About Novita AI

LinkedInVisit site

AI & Agent Cloud for Developers

Industry
Technology, Information and Internet
Company size
11-50 employees
Founded
2024
Headquarters
San Francisco, California
LinkedIn followers
1,825

Novita AI is an AI & Agent Cloud platform built to make advanced AI infrastructure accessible, reliable, and cost-effective for developers and fast-growing teams. We provide high-performance Model APIs, GPU Cloud, and Agent Sandbox environments, enabling teams to build, deploy, and scale AI applications without managing complex infrastructure. From open-source LLMs and multimodal models to production-grade GPU workloads, Novita handles the heavy lifting so developers can focus on building. Our platform is trusted by startups, research teams, and AI-native companies running real production workloads—processing massive token volumes and GPU hours every day across global regions. We emphasize performance, transparency, and flexibility, with straightforward pricing and infrastructure designed for real-world usage. At Novita, we believe AI infrastructure should be open, efficient, and developer-first. Our mission is to lower the barrier to building powerful AI systems—whether you’re shipping your first prototype or operating at scale.

Offices: 156 2nd, San Francisco, California, US

Information TechnologyArtificial Intelligence (AI)Cloud Infrastructure
View all jobs at Novita AI

Similar companies hiring

Carvana (2261)Mindrift (1398)Delivery Hero (616)Tieto (535)Toloka Annotators (515)Peraton (434)Celestica (352)Cox Business (335)SFS (321)Nscale (223)AUTO1 Group (190)WashU IT (186)
Clera home

Your AI-talent agent. Connecting talents with dream jobs.

Earn $5,000

Tools

  • Salary Calculator
  • Resume Review
  • Startup Map

Explore

  • Jobs
  • Discover Jobs
  • Companies
  • Acquihire
  • Referral

Company

  • Manifesto
  • Engineering
  • We are hiring!
  • FAQs
  • Blog
  • Press

Tools

  • Salary Calculator
  • Resume Review
  • Startup Map

Explore

  • Jobs
  • Discover Jobs
  • Companies
  • Acquihire
  • Referral

Company

  • Manifesto
  • Engineering
  • We are hiring!
  • FAQs
  • Blog
  • Press

© 2026 Clera Labs, Inc.

PrivacyTermsBug Bounty