Jobs at EnCharge AI (Now Hiring) — 1 open

Research Engineer, AI Models

India · Remote OK

Senior$163M raised

Research Engineer, Applied AI Location: India (or Remote-friendly with travel) About EnCharge AI: EnCharge AI is building the next generation AI platform. Our novel in-memory-computing architecture delivers a 10x step-fu…

Skills: Python, PyTorch, Transformers, Diffusion Models, Fine-tuning

Research Engineer, AI Models

EnCharge AI

India • Remote OK

Apply

Senior

Tired of cold applications?

Sign up with Clera and we'll reach out the moment a role actually fits you — no more spraying applications into the void.

Full-time
Posted 1d ago
~40 hrs/week
Remote in United States

Responsibilities

Research and implement state-of-the-art techniques to accelerate AI inference and optimize model quality on custom silicon. Build fine-tuning pipelines and benchmarking frameworks to characterize tradeoffs between latency, throughput, and power consumption.

Requirements

Requires 5+ years of experience in ML research or systems with strong proficiency in Python and PyTorch. Candidates must have hands-on experience fine-tuning large generative models and implementing techniques from research papers.

Full job description

Research Engineer, Applied AI

Location: India (or Remote-friendly with travel)

About EnCharge AI:

EnCharge AI is building the next generation AI platform. Our novel in-memory-computing architecture delivers a 10x step-function improvement in compute energy efficiency and performance for AI inference workloads. As the demands of artificial intelligence move beyond today's models, we believe fundamental underlying infrastructure must evolve. We are an experienced team of AI researchers, silicon & systems engineers, and architects backed by leading investors, poised to become the essential platform for the next wave of AI innovation.

The Opportunity:

Modern AI workloads—from large language models to diffusion-based generators to multimodal systems—represent some of the most compute-intensive frontiers in AI, and some of the most promising applications for our hardware’s energy efficiency advantages. We’re building a vertically integrated AI stack that will showcase the transformative potential of our silicon while delivering real value to customers today.

We are seeking a Research Engineer to push the boundaries of AI model capability, quality, and efficiency. You’ll build fine-tuning and post training pipelines, develop rigorous benchmarking frameworks, and work at the intersection of ML research and hardware-aware optimization—ensuring our models run beautifully on our silicon.

This is a role for someone who thrives at the boundary between research and engineering. You’ll read papers, implement techniques, and ship production-quality code—all in service of making AI inference faster, cheaper, and better.

Key Responsibilities:

Algorithmic Acceleration: Research and implement state-of-the-art techniques to accelerate AI inference—quantization, sparsity, distillation, speculative decoding, caching strategies, and architectural modifications. Systematically characterize tradeoffs between model quality, latency, throughput, and power consumption to find optimal operating points across different use cases.
Hardware Co-Design: Partner closely with hardware, compiler, and quantization teams to ensure algorithmic improvements translate to real gains on our silicon. Identify optimizations aligned with our architecture's strengths—maximizing throughput while minimizing power. Shape the feedback loop between model development and hardware.
Evaluation: Build profiling tools and comprehensive benchmarking frameworks to understand compute bottlenecks, measure model quality across standard and domain-specific evals, and track efficiency metrics.
Applied Research: Build robust fine-tuning workflows for modern AI models, enabling rapid experimentation with LoRA, adapters, and full fine-tuning. Stay current with the rapidly evolving landscape—evaluate new architectures, implement promising techniques, and contribute insights that inform technical and go-to-market strategy.

Qualifications:

5+ years of experience in ML research, applied ML, or ML systems
Strong fundamentals in Python and PyTorch
Hands-on experience with transformers, diffusion models, state space models etc.
Experience fine-tuning large models and building training/evaluation pipelines
Deep understanding of transformers, attention mechanisms, & optimization techniques
Comfort reading and implementing techniques from research papers

Nice to Have:

Experience with efficient inference techniques (KV cache optimization, attention variants, MoE routing, flow matching)
Background in hardware-aware ML optimization or quantization
Familiarity with profiling tools (PyTorch Profiler, Nsight, custom instrumentation)
Publications in generative modeling, efficient inference, or ML systems
Contributions to open-source ML projects

Related keywords

AI ModelsIn-Memory ComputingAI InferenceQuantizationSparsityDistillationSpeculative DecodingHardware Co-DesignPyTorch ProfilerNsightLoRAAdaptersTransformersDiffusion ModelsMultimodal SystemsMLOps

About EnCharge AI

LinkedIn Visit site

Where the future of AI compute is being defined and built, to unlock new levels of machine intelligence.

Industry: Embedded Software Products
Company size: 11-50 employees
Founded: 2022
Headquarters: Santa Clara, California
LinkedIn followers: 24,911
Total funding: $163M

EnCharge AI is a leader in advanced hardware and software systems for AI computing. EnCharge’s robust and scalable next-generation in-memory computing technology provides orders-of-magnitude higher compute efficiency and density compared to today’s best-in-class solutions, at a fraction of the cost. The high-performance architecture is coupled with seamless software and will enable the immense potential of AI to be accessible in power -, energy-, and space-constrained applications. EnCharge AI launched in 2022 and is led by veteran technologists with backgrounds in semiconductor design and AI systems.

Offices: 4500 Great American Parkway, Suite 230, Santa Clara, California 95054, US

SemiconductorArtificial IntelligenceSoftwareEmbedded SoftwareArtificial Intelligence (AI)Machine LearningAI Infrastructure

View all jobs at EnCharge AI

About EnCharge AI

LinkedIn Visit site

Where the future of AI compute is being defined and built, to unlock new levels of machine intelligence.

Industry: Embedded Software Products
Company size: 11-50 employees
Founded: 2022
Headquarters: Santa Clara, California
LinkedIn followers: 24,911
Total funding: $163M

Offices: 4500 Great American Parkway, Suite 230, Santa Clara, California 95054, US

SemiconductorArtificial IntelligenceSoftwareEmbedded SoftwareArtificial Intelligence (AI)Machine LearningAI Infrastructure

View all jobs at EnCharge AI

Similar companies hiring

GSAS Micro Systems India (14)ClinicMind (8)PlayOn|Younify (7)R2 (6)Sodales Solutions (6)Finexio (4)LiveLike (4)knokcare Brazil (3)8am (3)Obvio (2)Kanopi (2)DSP Concepts (1)

·Dashboard

Jobs at EnCharge AI (Now Hiring) — 1 open

EnCharge AI

Research Engineer, AI Models

India · Remote OK

Senior$163M raised

Skills: Python, PyTorch, Transformers, Diffusion Models, Fine-tuning

Research Engineer, AI Models

EnCharge AI

India • Remote OK

Apply

Senior

Tired of cold applications?

Sign up with Clera and we'll reach out the moment a role actually fits you — no more spraying applications into the void.

Full-time
Posted 1d ago
~40 hrs/week
Remote in United States

Responsibilities

Requirements

Full job description

Research Engineer, Applied AI

Location: India (or Remote-friendly with travel)

About EnCharge AI:

The Opportunity:

Key Responsibilities:

Algorithmic Acceleration: Research and implement state-of-the-art techniques to accelerate AI inference—quantization, sparsity, distillation, speculative decoding, caching strategies, and architectural modifications. Systematically characterize tradeoffs between model quality, latency, throughput, and power consumption to find optimal operating points across different use cases.
Hardware Co-Design: Partner closely with hardware, compiler, and quantization teams to ensure algorithmic improvements translate to real gains on our silicon. Identify optimizations aligned with our architecture's strengths—maximizing throughput while minimizing power. Shape the feedback loop between model development and hardware.
Evaluation: Build profiling tools and comprehensive benchmarking frameworks to understand compute bottlenecks, measure model quality across standard and domain-specific evals, and track efficiency metrics.
Applied Research: Build robust fine-tuning workflows for modern AI models, enabling rapid experimentation with LoRA, adapters, and full fine-tuning. Stay current with the rapidly evolving landscape—evaluate new architectures, implement promising techniques, and contribute insights that inform technical and go-to-market strategy.

Qualifications:

5+ years of experience in ML research, applied ML, or ML systems
Strong fundamentals in Python and PyTorch
Hands-on experience with transformers, diffusion models, state space models etc.
Experience fine-tuning large models and building training/evaluation pipelines
Deep understanding of transformers, attention mechanisms, & optimization techniques
Comfort reading and implementing techniques from research papers

Nice to Have:

Experience with efficient inference techniques (KV cache optimization, attention variants, MoE routing, flow matching)
Background in hardware-aware ML optimization or quantization
Familiarity with profiling tools (PyTorch Profiler, Nsight, custom instrumentation)
Publications in generative modeling, efficient inference, or ML systems
Contributions to open-source ML projects

Related keywords

AI ModelsIn-Memory ComputingAI InferenceQuantizationSparsityDistillationSpeculative DecodingHardware Co-DesignPyTorch ProfilerNsightLoRAAdaptersTransformersDiffusion ModelsMultimodal SystemsMLOps

About EnCharge AI

LinkedIn Visit site

Where the future of AI compute is being defined and built, to unlock new levels of machine intelligence.

Industry: Embedded Software Products
Company size: 11-50 employees
Founded: 2022
Headquarters: Santa Clara, California
LinkedIn followers: 24,911
Total funding: $163M

Offices: 4500 Great American Parkway, Suite 230, Santa Clara, California 95054, US

SemiconductorArtificial IntelligenceSoftwareEmbedded SoftwareArtificial Intelligence (AI)Machine LearningAI Infrastructure

View all jobs at EnCharge AI

About EnCharge AI

LinkedIn Visit site

Where the future of AI compute is being defined and built, to unlock new levels of machine intelligence.

Industry: Embedded Software Products
Company size: 11-50 employees
Founded: 2022
Headquarters: Santa Clara, California
LinkedIn followers: 24,911
Total funding: $163M

Offices: 4500 Great American Parkway, Suite 230, Santa Clara, California 95054, US

SemiconductorArtificial IntelligenceSoftwareEmbedded SoftwareArtificial Intelligence (AI)Machine LearningAI Infrastructure

View all jobs at EnCharge AI

Similar companies hiring

GSAS Micro Systems India (14)ClinicMind (8)PlayOn|Younify (7)R2 (6)Sodales Solutions (6)Finexio (4)LiveLike (4)knokcare Brazil (3)8am (3)Obvio (2)Kanopi (2)DSP Concepts (1)

Jobs at EnCharge AI (Now Hiring) — 1 open

Research Engineer, AI Models

Research Engineer, AI Models

Tired of cold applications?

Responsibilities

Requirements

Full job description

Related keywords

About EnCharge AI

About EnCharge AI

Similar companies hiring

Tools

Explore

Company

Tools

Explore

Company

Jobs at EnCharge AI (Now Hiring) — 1 open

Research Engineer, AI Models

Research Engineer, AI Models

Tired of cold applications?

Responsibilities

Requirements

Full job description

Related keywords

About EnCharge AI

About EnCharge AI

Similar companies hiring

Tools

Explore

Company

Tools

Explore

Company