Steampunk logo
LLMOps Engineer
OTHERMcLean$115k - $145k

Summary

Location

McLean

Salary

$115k - $145k

Type

OTHER

Claim this Company

Are you the employer? Manage your company page directly.

Explore Jobs

About this role

Overview

We are looking for an experienced LLMOps Engineer to design, implement, and maintain production-grade large-language-model (LLM) pipelines, deployment architectures, and monitoring systems across enterprise environments. The Senior LLMOps Engineer will play a critical role in operationalizing generative AI capabilities, ensuring that LLM-based applications are scalable, secure, reliable, and compliant with emerging AI risk and governance frameworks. This role spans the spectrum of model deployment, orchestration, evaluation, and optimization. 

Contributions

 

  • Architect and maintain scalable LLM and RAG pipelines, including model hosting, inference optimization, retrieval layers, and context management frameworks. 
  • Lead the design and implementation of secure GenAI infrastructure across cloud environments, ensuring reliability, performance, and cost efficiency. 
  • Build and manage automated evaluation systems that assess LLM output quality, safety, latency, and adherence to AI governance requirements. 
  • Develop CI/CD workflows tailored for LLM- and GenAI-based applications, including dataset versioning, model lineage, and automated testing of prompt and model behaviors. 
  • Collaborate with AI Product Engineers and Data Scientists to productionize LLM-based prototypes into enterprise-grade, maintainable systems. 
  • Integrate vector databases, model gateways, content filters, and guardrail frameworks into end-to-end LLM solutions. 
  • Implement observability and monitoring solutions that track performance metrics, hallucination rates, cost profiles, and user interaction patterns. 
  • Lead troubleshooting and root-cause analysis for issues related to LLM deployment, inference performance, or pipeline reliability. 
  • Stay current with emerging LLM architectures, inference optimizations, fine-tuning techniques, and relevant MLSecOps patterns. 
  • Ensure compliance with data privacy, ethical AI, and AI-governance frameworks throughout pipeline design and operations. 
  • Mentor junior engineers and contribute to Steampunk’s AI engineering best practices, tooling, and reusable infrastructure patterns. 
  • You will contribute to the growth of our AI & Data Exploitation Practice! 

Qualifications

 

  • Ability to hold a position of public trust with the U.S. government. 
  • Master's Degree (related program) and 7 years of relevant experience; OR
    • Bachelor's Degree (related program) and 10 years of relevant experience; OR
    • No degree and 16 years of relevant experience
  • Possesses at least one professional certification relevant to the technical service provided. Maintain a certification relevant to the product being deployed and/or maintained.
  • 5+ years of experience in software engineering, data engineering, MLOps, or cloud engineering, with 2+ years focusing specifically on LLM or GenAI operations. 
  • Strong experience deploying models using frameworks such as Hugging Face Transformers, vLLM, TensorRT-LLM, or similar. 
  • Proficiency in Python and operational tooling such as FastAPI, PyTorch, LangChain, LlamaIndex, and vector databases (FAISS, Milvus, Pinecone, or similar). 
  • Advanced knowledge of cloud platforms (AWS, Azure, GCP) including model hosting, distributed compute, and secure networking patterns. 
  • Hands-on experience building CI/CD pipelines, automated testing frameworks, and environment provisioning for AI/ML workloads. 
  • Experience with Docker, Kubernetes, and infrastructure-as-code (Terraform, CloudFormation). 
  • Familiarity with MLSecOps, AI governance, model hardening, prompt injection defenses, and content safety monitoring. 
  • Strong understanding of logging, observability, and performance profiling for high-throughput LLM inference systems. 
  • Excellent written and verbal communication skills, with the ability to explain trade-offs and architectural decisions to technical and non-technical stakeholders. 
  • Demonstrated ability to balance long-term platform thinking with hands-on operations and rapid problem solving. 
  • Experience working in agile teams and using modern project management tools. 

About steampunk

Steampunk relies on several factors to determine salary, including but not limited to geographic location, contractual requirements, education, knowledge, skills, competencies, and experience. The projected compensation range for this position is $115,000 to $145,000.  The estimate displayed represents a typical annual salary range for this position. Annual salary is just one aspect of Steampunk’s total compensation package for employees. Learn more about additional Steampunk benefits here. 

 

Identity Statement

As part of the application process, you are expected to be on camera during interviews and assessments. We reserve the right to take your picture to verify your identity and prevent fraud.

 

Steampunk is a Change Agent in the Federal contracting industry, bringing new thinking to clients in the Homeland, Federal Civilian, Health and DoD sectors.  Through our Human-Centered delivery methodology, we are fundamentally changing the expectations our Federal clients have for true shared accountability in solving their toughest mission challenges.  As an employee owned company, we focus on investing in our employees to enable them to do the greatest work of their careers – and rewarding them for outstanding contributions to our growth. If you want to learn more about our story, visit http://www.steampunk.com.

 

We are an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability status, protected veteran status, or any other characteristic protected by law. Steampunk participates in the E-Verify program. 

Other facts

Tech stack
LLMOps,LLM Pipelines,GenAI Infrastructure,Model Deployment,RAG Pipelines,CI/CD Workflows,Vector Databases,Observability,MLSecOps,Python,Kubernetes,Terraform,Cloud Platforms,Inference Optimization,AI Governance,Prompt Engineering

About Steampunk

When Steampunk Digital Consulting was established, we committed ourselves to the concept of providing our clients with Unfair Ideas, and we’ve been re-establishing ourselves every day since to deliver the Unfair Ideas that give our clients an unfair advantage. Because we believe that’s what this business requires: a fresh perspective, a new approach, and an interdependent group of people dedicated to taking on whatever new challenge or opportunity the day brings. That’s why we bring a holistic approach to marketing, gathering together the best of all disciplines under one roof. We understand that today’s modern companies need a creative partner that works the way they do: fast, open, and collaboratively. Which, done correctly, is itself a little unfair.

We truly believe in the transformative power of illustration and design and their ability to simplify communications, elevate experiences, engage and inspire people everywhere. Good design and good relationships come from collaboration.

Team size: 1 employee
LinkedIn: Visit
Industry: Advertising Services
Founding Year: 2017

What you'll do

  • The engineer will architect and maintain scalable LLM and RAG pipelines, including model hosting, inference optimization, and context management frameworks across secure cloud environments. Key duties involve building automated evaluation systems, developing specialized CI/CD workflows, and integrating vector databases and guardrail frameworks into end-to-end solutions.

Join Clera's Talent Pool

Get matched with similar opportunities at top startups

This role is hosted on Steampunk's careers site.
Join our talent pool first to get notified about similar roles that match your profile.

Frequently Asked Questions

What does Steampunk pay for a LLMOps Engineer?

Steampunk offers a competitive compensation package for the LLMOps Engineer role. The salary range is USD 115k - 145k per year. Apply through Clera to learn more about the full compensation details.

What does a LLMOps Engineer do at Steampunk?

As a LLMOps Engineer at Steampunk, you will: the engineer will architect and maintain scalable LLM and RAG pipelines, including model hosting, inference optimization, and context management frameworks across secure cloud environments. Key duties involve building automated evaluation systems, developing specialized CI/CD workflows, and integrating vector databases and guardrail frameworks into end-to-end solutions..

Why join Steampunk as a LLMOps Engineer?

Steampunk is a leading Advertising Services company. The LLMOps Engineer role offers competitive compensation.

Is the LLMOps Engineer position at Steampunk remote?

The LLMOps Engineer position at Steampunk is based in McLean, Virginia, United States. Contact the company through Clera for specific work arrangement details.

How do I apply for the LLMOps Engineer position at Steampunk?

You can apply for the LLMOps Engineer position at Steampunk directly through Clera. Click the "Apply Now" button above to start your application. Clera's AI-powered platform will help match your profile with this opportunity and guide you through the application process. You can also learn more about Steampunk on their website.