CommonAI CIC is a non-profit membership organisation, founded on a belief in collaborative engineering for the safe and responsible development of foundational AI technologies. A place where AI startups, enterprises larg…
CommonAI CIC is a non-profit membership organisation, founded on a belief in collaborative engineering for the safe and responsible development of foundational AI technologies. A place where AI startups, enterprises larg…
Skills: Rust, Systems Programming, LLMs, AI Agents, Software Architecture
We are looking to hire ambitious entrepreneurs to start and scale their own startups. We are serial entrepreneurs, for example Paul Müller (founder Adjust, €1.2B exit) and Petter Made (founder SumUp, €8B) who are eager t…
Applied Scientist, Silicon and Systems Group Edge AI
Cambridge, England, United Kingdom · On-site
Mid level$35B raised
Amazon Devices is an inventive research and development company that designs and engineer high-profile devices like Echo, Fire Tablets, Fire TV, and other consumer devices. We are looking for exceptional scientists to jo…
Skills: Multimodal Language Models, Machine Learning, Python, C++, Java
Senior Embedded Software Engineer Cambridge, UK | Full-time | Permanent | Hybrid Salary: £70,000 to £90,000 DOE The salary range for this role is broad, as we are able to consider varying levels of experience. Any offer …
Cambridge, UK | Full-time | Permanent | Hybrid Salary: £60,000 to £88,000 (DOE) + Bonus + Benefits The salary range for this role is broad, as we are able to consider varying levels of experience. Any offer made will car…
Skills: Functional Verification, SystemVerilog, UVM, OVM, SVA
Cambridge, UK | Full-time or Part-time | Permanent | Hybrid Salary: £62,000 to £75,000, DOE We will also consider part-time applications for this role. Please indicate your preferred working schedule in your cover letter…
Skills: Quantum Computing, Quantum Error Correction, Experimental Design, Data Analysis, Technical Communication
About the job Help prove the ML software stack that future AI systems will depend on. Graphcore is expanding the software systems behind AI compute at datacenter scale. This role focuses on validating a complex machine l…
Skills: Software Design, Python, CI/CD, Automated Testing, Linux
About the job Validate the ML stack that turns accelerator hardware into trusted AI performance. This role sits where modern ML models meet Graphcore’s software and hardware stack. You will test, benchmark and validate c…
Cambridge, UK | Full-time | Permanent | Hybrid Salary: £90,000 to £115,000 DOE + Bonus + Benefits The salary range for this role is broad, as we are able to consider varying levels of experience. Any offer made will care…
About the job Validate the ML stack that turns accelerator hardware into trusted AI performance. This role sits where modern ML models meet Graphcore’s software and hardware stack. You will test, benchmark and validate c…
Cambridge, UK | Full-time | Permanent | Hybrid Salary: £60,000 - £70,000 DOE We will also consider part-time applications for this role. Please indicate your preferred working schedule in your cover letter. About us Rive…
Cambridge, UK | Full-time | Permanent Salary: £72,000 - £90,000 DOE We will also consider part-time applications for this role. Please indicate your preferred working schedule in your cover letter. About us Riverlane’s m…
Cambridge, UK | Full-time | Permanent | Hybrid Salary: £70,000 to £90,000, DOE We will also consider part-time applications for this role. Please indicate your preferred working schedule in your cover letter. We are able…
Cambridge, UK | Full-time | Permanent | Hybrid Salary: £72,000 to £140,000, DOE + Bonus + Benefits The salary range for this role is broad, as we are able to consider varying levels of experience. Any offer made will car…
Cambridge, UK | Full-time | Permanent | Hybrid Salary: £72,000 to £90,000 DOE We will also consider part-time applications for this role. Please indicate your preferred working schedule in your cover letter. About us Riv…
Cambridge, UK | Full-time | Permanent | Hybrid Salary: £72,000 to £90,000 DOE We will also consider part-time applications for this role. Please indicate your preferred working schedule in your cover letter. About us Riv…
Cambridge, UK | Full-time | Permanent | Hybrid Salary: £70,000 to £90,000 DOE + Bonus + Benefits The salary range for this role is broad, as we are able to consider varying levels of experience. Any offer made will caref…
Skills: Digital Design, FPGA, RTL, Low-latency Systems, High Throughput Systems
Cambridge, UK | Full-time | Permanent | Hybrid Salary: £60,000 to £85,000, DOE The salary range for this role is broad, as we are able to consider varying levels of experience. Any offer made will carefully take into acc…
Senior FPGA Engineer – Systems Team Cambridge, UK | Full-time | Permanent | Hybrid Salary: £68,000 to £82,000, DOE The salary range for this role is broad, as we are able to consider varying levels of experience. Any off…
Sign up with Clera and we'll reach out the moment a role actually fits you — no more spraying applications into the void.
Full-time
Competitive Salary Package, Pension, Professional Development Opportunities, Networking Opportunities, Vibrant Office Environment
Posted 1d ago
~40 hrs/week
Responsibilities
Deploy, optimize, and extend vLLM to support novel hardware architectures and high-assurance environments. Collaborate with the open-source community to upstream core changes and improve inference performance.
Requirements
Requires a Senior Software Engineer with deep expertise in Python and low-level programming, along with a history of contributions to high-performance ML projects. Must have a strong understanding of LLM inference mechanics and hardware accelerator optimization.
Full job description
CommonAI CIC is a non-profit membership organisation, founded on a belief in collaborative engineering for the safe and responsible development of foundational AI technologies. A place where AI startups, enterprises large and small, public sector bodies and academia can share resources and knowledge, to codevelop and grow businesses, fast.
We are led by experienced founders, investors and engineers who believe that collaborative engineering drives faster AI innovation and are backed by a mix of UK Government and private funding in order to design, build and deploy innovative AI systems.
The Opportunity
We are seeking a Senior Software Engineer with a passion for open-source AI infrastructure to work on deploying, extending and optimising vLLM (and potentially other inference serving engines) to support our projects. You will play a crucial, high-impact role across both our key programmes, the Scaling Inference Lab (https://scalinginferece.org and the High Assurance programme.
As an AI-first company, we strongly believe in collaborative engineering powered by the tools we are helping to build. This role places a major emphasis on using LLMs, AI coding assistants, and autonomous agents for software development.
What You'll Do
Deploy, instrument and monitor open weight models served using vLLM.
Implement new features within vLLM to support novel hardware architectures as part of the Scaling Inference Lab.
Work with the Panopticon team to identify opportunities to extend vLLM to enhance accuracy, explainability, and accountability when using it to serve models in regulated environments.
Actively collaborate with the open-source vLLM community to propose, review, and upstream core changes.
Troubleshoot, profile, and optimise inference performance, focusing on latency, throughput, and hardware utilisation.
Proven experience working as a Senior Software Engineer with deep expertise in both Python and low-level programming (e.g., C/C++, Rust, assembly or CUDA).
A history of direct contributions to vLLM or similar high-performance open-source ML/AI projects (e.g., PyTorch, Hugging Face TGI, TensorRT-LLM, Ray).
Strong understanding of LLM inference mechanics (e.g., KV caching, continuous batching, memory management, model quantisation).
Experience interacting with, and upstreaming code to, active open-source communities.
Hands-on experience working on performance optimisation for hardware accelerators (GPUs, TPUs, CPU vector units or other accelerators).
A strong enthusiasm for using LLMs, coding assistants, and agents as core tools in your own software development process.
We also value:
Experience working on software systems that operate in highly regulated or high-assurance environments (e.g., financial services).
An understanding of the latest AI safety research and active involvement in that community.
Deep knowledge of modern MLOps practices, CI/CD, and large-scale deployments.
A collaborative and supportive work environment
The opportunity to have a high impact in a growing organisation
Competitive salary package and pension
Professional development opportunities
Networking opportunities with influential people from across the tech sector, financial services, and academia
A vibrant office environment located a few minutes' walk away from Cambridge train station
CommonAI CIC is an equal opportunity employer and is committed to creating an inclusive and diverse workplace.
Related keywords
vLLMLLMInferenceOpen SourcePythonC++RustCUDAPyTorchHugging Face TGITensorRT-LLMRayKV CachingContinuous BatchingModel QuantisationMLOps
How many Software jobs are open in Cambridge, United Kingdom right now?
There are currently 109 open software positions in Cambridge, United Kingdom listed on Clera. New openings are added daily as companies post roles.
Which companies are hiring for Software roles in Cambridge, United Kingdom?
Companies currently hiring include Riverlane, Darktrace, Graphcore, AVEVA, Microsoft, among others. Browse the listings above to see every active employer.
Are there remote or hybrid Software jobs in Cambridge, United Kingdom?
Yes — 84 of the 109 open software positions offer remote or hybrid work (11 remote, 73 hybrid).
How do I apply for Software jobs in Cambridge, United Kingdom?
Each listing links directly to the employer's application page. Apply early — fresh listings get the most recruiter attention in the first two weeks.