Member of ML Technical Staff

on-site•San Francisco•$200k - $350k+ Competitive equity

Summary

Location

San Francisco

Salary

$200k - $350k

Equity

Competitive equity

Workplace

On-site

Experience

1+ years

Visa

Visa sponsorship available

Company links

Website LinkedIn

About this role

We are looking for members of technical staff specializing in ML. We’re particularly interested in self-motivated researchers and engineers who want to meaningfully contribute towards training powerful models, no matter whether that means working on low level GPU optimizations or new optimization theory.

Relevant Skills (not all are necessary):

Work on large language models at a lab (e.g. OpenAI, Google, Mistral, Z.ai, Qwen, Deepseek, Ai2, or academic).
Experience with pretraining language models and large scale AI infrastructure, can include any of the following:
- Contributions to community initiatives such as NanoGPT Speedrun, Marin, etc.
- Understanding of different types of model parallelism.
- Software/hardware codesign to maximize training throughput.
- Experience with monitoring and maintaining large scale training runs.
- Academic works (e.g. papers on optimization, data, etc).
Experience with post-training language models
- Work on reinforcement learning for language models (environments, infrastructure, training).
- Academic or personal work on instruction data curation, tool use, or generally post-training related tasks.
Experience with inference/systems optimization
- Contributions to vLLM, SGLang, Dynamo, MegaKernels, etc
- Strong systems level understand of these frameworks and how to optimize for batching, KV cache pressure, long context, etc
- Experience with low level kernel design + DSLs
  - CUDA, C++, CuTE, Triton, PTX, TileLang, etc

Interview process

1
Initial Interview
2
Interview
3
Technical Interview

About DeepGrove

Developing AI foundation models that can run efficiently on any device. They're working on a new large language model architecture to make these models more efficient. Their big goal is to create a top-performing model, similar to those from OpenAI or Google, that can run entirely on a phone locally. They're focused on making AI that's not just powerful but also super efficient.

Ready to join DeepGrove?

Take the next step in your career journey

Frequently Asked Questions

What does DeepGrove pay for a Member of ML Technical Staff?

DeepGrove offers a competitive compensation package for the Member of ML Technical Staff role. The salary range is USD 200k - 350k per year, plus Competitive equity equity. Apply through Clera to learn more about the full compensation details.

What does a Member of ML Technical Staff do at DeepGrove?

The Member of ML Technical Staff role at DeepGrove involves We are looking for members of technical staff specializing in ML. We’re particularly interested in self-motivated researchers and engineers who want to meaningfully contribute towards training powerfu...

Is the Member of ML Technical Staff position at DeepGrove remote?

The Member of ML Technical Staff position at DeepGrove is based in San Francisco, United States and is on-site. Contact the company through Clera for specific work arrangement details.

How do I apply for the Member of ML Technical Staff position at DeepGrove?

You can apply for the Member of ML Technical Staff position at DeepGrove directly through Clera. Click the "Apply Now" button above to start your application. Clera's AI-powered platform will help match your profile with this opportunity and guide you through the application process.

About this role

Relevant Skills (not all are necessary):

Work on large language models at a lab (e.g. OpenAI, Google, Mistral, Z.ai, Qwen, Deepseek, Ai2, or academic).
Experience with pretraining language models and large scale AI infrastructure, can include any of the following:
- Contributions to community initiatives such as NanoGPT Speedrun, Marin, etc.
- Understanding of different types of model parallelism.
- Software/hardware codesign to maximize training throughput.
- Experience with monitoring and maintaining large scale training runs.
- Academic works (e.g. papers on optimization, data, etc).
Experience with post-training language models
- Work on reinforcement learning for language models (environments, infrastructure, training).
- Academic or personal work on instruction data curation, tool use, or generally post-training related tasks.
Experience with inference/systems optimization
- Contributions to vLLM, SGLang, Dynamo, MegaKernels, etc
- Strong systems level understand of these frameworks and how to optimize for batching, KV cache pressure, long context, etc
- Experience with low level kernel design + DSLs
  - CUDA, C++, CuTE, Triton, PTX, TileLang, etc

About DeepGrove