Bjak logo
Principal Machine Learning Engineer
full-timeYuexiu District

Summary

Location

Yuexiu District

Type

full-time

Explore Jobs

About this role

About the Role

A1 is building a proactive AI system that understands context across conversations, plans actions, and carries work forward over time.

You will be responsible for turning research direction into working, production-grade ML systems. This role owns the execution layer of A1’s intelligence – training pipelines, inference systems, evaluation tooling, and deployment.


Focus

  • Build and own end-to-end ML pipelines spanning data, training, evaluation, inference, and deployment.

  • Fine-tune and adapt models using state-of-the-art methods such as LoRA, QLoRA, SFT, DPO, and distillation.

  • Architect and operate scalable inference systems, balancing latency, cost, and reliability.

  • Design and maintain data systems for high-quality synthetic and real-world training data.

  • Implement evaluation pipelines covering performance, robustness, safety, and bias, in partnership with research leadership.

  • Own production deployment, including GPU optimization, memory efficiency, latency reduction, and scaling policies.

  • Collaborate closely with application engineering to integrate ML systems cleanly into backend, mobile, and desktop products.

  • Make pragmatic trade-offs and ship improvements quickly, learning from real usage.

  • Work under real production constraints: latency, cost, reliability, and safety


Requirements

  • Strong background in deep learning and transformer-based architectures.

  • Hands-on experience training, fine-tuning, or deploying large-scale ML models in production.

  • Proficiency with at least one modern ML framework (e.g. PyTorch, JAX), and ability to learn others quickly.

  • Experience with distributed training and inference frameworks (e.g. DeepSpeed, FSDP, Megatron, ZeRO, Ray).

  • Strong software engineering fundamentals – you write robust, maintainable, production-grade systems.

  • Experience with GPU optimization, including memory efficiency, quantization, and mixed precision.

  • Comfort owning ambiguous, zero-to-one ML systems end-to-end.

  • A bias toward shipping, learning fast, and improving systems through iteration.


Ideal Experience

  • Experience with LLM inference frameworks such as vLLM, TensorRT-LLM, or FasterTransformer.

  • Contributions to open-source ML or systems libraries.

  • Background in scientific computing, compilers, or GPU kernels.

  • Experience with RLHF pipelines (PPO, DPO, ORPO).

  • Experience training or deploying multimodal or diffusion models.

  • Experience with large-scale data processing (Apache Arrow, Spark, Ray).


How We Work

Our organization is very flat and our team is small, highly motivated, and focused on engineering and product excellence. All members are expected to be hands-on and to contribute directly to the company’s mission.


Interview process

If there appears to be a fit, we'll reach to schedule 3, but no more than 4 interviews.

Applications are evaluated by our technical team members. Interviews will be conducted via virtual meetings and/or onsite.

We value transparency and efficiency, so expect a prompt decision. If you've demonstrated the exceptional skills and mindset we're looking for, we'll extend an offer to join us. This isn't just a job offer; it's an invitation to be part of a team that's bringing AI to have practical benefits to billions globally.

Other facts

Tech stack
Deep Learning,Transformer-Based Architectures,ML Frameworks,Distributed Training,Inference Frameworks,GPU Optimization,Software Engineering,Production-Grade Systems,Evaluation Pipelines,Synthetic Data,Real-World Data,Model Fine-Tuning,Scalable Inference Systems,Latency Reduction,Cost Management,Robustness,Safety

About Bjak

Our mission is to develop technology based solutions to improve financial inclusion.

We develop new & innovative platforms & services globally. For example, we are the first platform to simplify and digitise comprehensive life and medical insurance, supported by AI agent. BJAK is the largest insurance platform in Southeast Asia.

If you enjoy building cutting edge platform-ecosystems that gives equal access to financial services to everyone at scale, join us.

Team size: 201-500 employees
LinkedIn: Visit
Industry: Software Development
Founding Year: 2019

What you'll do

  • You will be responsible for turning research direction into working, production-grade ML systems, owning the execution layer of A1’s intelligence. This includes building end-to-end ML pipelines, fine-tuning models, and collaborating with application engineering to integrate ML systems into products.

Ready to join Bjak?

Take the next step in your career journey

Frequently Asked Questions

What does a Principal Machine Learning Engineer do at Bjak?

As a Principal Machine Learning Engineer at Bjak, you will: you will be responsible for turning research direction into working, production-grade ML systems, owning the execution layer of A1’s intelligence. This includes building end-to-end ML pipelines, fine-tuning models, and collaborating with application engineering to integrate ML systems into products..

Why join Bjak as a Principal Machine Learning Engineer?

Bjak is a leading Software Development company.

Is the Principal Machine Learning Engineer position at Bjak remote?

The Principal Machine Learning Engineer position at Bjak is based in Yuexiu District, Guangdong Province, China. Contact the company through Clera for specific work arrangement details.

How do I apply for the Principal Machine Learning Engineer position at Bjak?

You can apply for the Principal Machine Learning Engineer position at Bjak directly through Clera. Click the "Apply Now" button above to start your application. Clera's AI-powered platform will help match your profile with this opportunity and guide you through the application process. You can also learn more about Bjak on their website.