NeuReality is looking for a sharp, hands-on Product Manager to help define NR-NEXUS, our next-generation AI inference platform. This role is ideal for experienced product manager who is fast-learning, technical, and ambi…
Skills: Product Management, Software Engineering, SaaS, AI Infrastructure, PRD Writing
NeuReality is seeking a Lead System Architect to join our system architecture team and help define NR-NEXUS, our next-generation AI inference platform. Responsibilities Lead the software architecture and technical roadma…
Skills: Software Architecture, Kubernetes, Cloud-Native Architecture, GenAI/LLM Infrastructure, Distributed Systems
NeuReality is seeking a Lead System Architect to join our system architecture team and help define NR-NEXUS, our next-generation AI inference platform. Responsibilities Lead the software architecture and technical roadma…
Skills: Software Architecture, Kubernetes, Cloud-Native Architecture, GenAI/LLM Infrastructure, Distributed Systems
Senior SW Engineer – AI Infrastructure & Optimization
Israel · On-site
Senior$60M raised
We are looking for a Senior Software Engineer to help build and optimize large-scale, high-performance GenAI infrastructure and inference systems on Kubernetes. As AI workloads increasingly move toward Kubernetes-native …
Skills: Go, Python, Kubernetes, GenAI Infrastructure, Distributed Systems
We are looking for a Senior Software Engineer to help build and optimize large-scale, high-performance GenAI infrastructure and inference systems on Kubernetes. As AI workloads increasingly move toward Kubernetes-native …
Skills: Go, Python, Kubernetes, GenAI Infrastructure, Distributed Systems
Senior SW Engineer – AI Infrastructure & Optimization We are looking for a Senior Software Engineer to help build and optimize large-scale, high-performance GenAI infrastructure and inference systems on Kubernetes. As AI…
Skills: Go, Python, Kubernetes, GenAI Infrastructure, Distributed Systems
Sign up with Clera and we'll reach out the moment a role actually fits you — no more spraying applications into the void.
Full-time
Posted 4d ago
~40 hrs/week
Responsibilities
Define the capabilities, workflows, and use cases for the NR-NEXUS AI inference platform. Collaborate with R&D, Marketing, and customers to translate market needs into technical requirements and PoCs.
Requirements
Requires over 5 years of experience as a Technical Product Manager with a strong software engineering background and experience in SaaS or AI products. Proficiency with AI programming tools and knowledge of LLMs, MLOps, and Kubernetes is highly advantageous.
Full job description
NeuReality is looking for a sharp, hands-on Product Manager to help define NR-NEXUS, our next-generation AI inference platform.
This role is ideal for experienced product manager who is fast-learning, technical, and ambitious. You’ll work closely with Product, R&D, Marketing and customer’s engineering team to turn market needs, customer feedback, and AI infrastructure trends into clear product requirements, workflows, demos and PoCs.
What You’ll Do
Help define NR-NEXUS product capabilities, workflows, and use cases
Write clear PRDs, user stories, feature briefs, and competitive notes
Research AI infrastructure, SaaS platforms, model serving, and inference trends
Work with engineering to translate technical capabilities into product value
Support customer and partner discovery with sharp product thinking
Requirements
5+ years of experience as a Technical Product Manager
Strong software engineering background
Experience with SaaS and/or AI products- Must
High ownership, strong curiosity, and real hunger to grow
Strong writing, analytical thinking, and ability to learn fast
Comfortable with AI programming tools such as Cursor, Copilot, ChatGPT, Claude Code, or similar - advantage
Big Advantage
Experience with AI infrastructure, LLMs, inference, APIs, developer platforms, MLOps, observability, Kubernetes, Hugging Face, vLLM, or similar technologies.
Transforming AI Infrastructure into a Unified Inference Platform.
Industry
Semiconductor Manufacturing
Company size
51-200 employees
Founded
2019
Headquarters
Tel Aviv, Tel-Aviv District
LinkedIn followers
7,825
Total funding
$60M
AI infrastructure has a hidden problem: the network and orchestration layer.
As models scale to trillions of parameters and inference demand explodes, two bottlenecks emerge: how data moves between GPUs and how workloads are managed across them.
The industry added more GPUs, scaled clusters, optimized models. But utilization still hovers around 50-70%. The compute is there, idle, burning watts.
The bottleneck isn't the silicon. It's how data moves and how work gets distributed.
Traditional networking was built for general-purpose workloads, not AI's east-west traffic and microsecond-sensitive synchronization. Traditional orchestration treats GPUs as generic compute, blind to the demands of prefill, decode, and model synchronization.
Every GPU cycle wasted waiting is money and energy lost.
We asked: What if the network wasn't just faster, but intelligent? What if orchestration understood AI workloads natively?
NR-NEXUS is an inference operating system for large-scale inference. Hardware-agnostic, it unifies fragmented open-source frameworks into a single production platform, running across hyperscale clouds, GPU clusters, and emerging XPUs.
NR2 AI-SuperNIC eliminates data-movement bottlenecks limiting GPU utilization. It executes the networking data path in hardware with no CPUs in the critical path, integrates in-network compute to offload communication operations, and supports open Ethernet-based networking.
Together, they transform distributed GPU and XPU clusters into high-throughput token factories.
The result: GPUs at near-100% utilization. Inference scales without adding racks. Energy consumption drops.
This isn't incremental optimization. It's rethinking the data path and control plane so AI infrastructure matches AI ambition.
For our customers: maximum performance from existing hardware. Lower cost, lower power, lower latency, higher throughput.
NeuReality is headquartered in Tel Aviv with offices across North America and Europe.
Offices: 10 Kremenetski, Tel Aviv, Tel-Aviv District 6789910, IL · 14 Tarshish Street, Caesarea, 3079559, IL · 2880 Zanker Rd, 203, San Jose, California 95134, US · Kamienna 21, Krowodrza, Małopolskie 31-403, PL
Machine LearningArtificial IntelligenceSemiconductorsAI InferenceData CentersAI InfrastructureGenerative AILarge Language ModelsAI DeploymentsAI Systems Engineering
Transforming AI Infrastructure into a Unified Inference Platform.
Industry
Semiconductor Manufacturing
Company size
51-200 employees
Founded
2019
Headquarters
Tel Aviv, Tel-Aviv District
LinkedIn followers
7,825
Total funding
$60M
AI infrastructure has a hidden problem: the network and orchestration layer.
As models scale to trillions of parameters and inference demand explodes, two bottlenecks emerge: how data moves between GPUs and how workloads are managed across them.
The industry added more GPUs, scaled clusters, optimized models. But utilization still hovers around 50-70%. The compute is there, idle, burning watts.
The bottleneck isn't the silicon. It's how data moves and how work gets distributed.
Traditional networking was built for general-purpose workloads, not AI's east-west traffic and microsecond-sensitive synchronization. Traditional orchestration treats GPUs as generic compute, blind to the demands of prefill, decode, and model synchronization.
Every GPU cycle wasted waiting is money and energy lost.
We asked: What if the network wasn't just faster, but intelligent? What if orchestration understood AI workloads natively?
NR-NEXUS is an inference operating system for large-scale inference. Hardware-agnostic, it unifies fragmented open-source frameworks into a single production platform, running across hyperscale clouds, GPU clusters, and emerging XPUs.
NR2 AI-SuperNIC eliminates data-movement bottlenecks limiting GPU utilization. It executes the networking data path in hardware with no CPUs in the critical path, integrates in-network compute to offload communication operations, and supports open Ethernet-based networking.
Together, they transform distributed GPU and XPU clusters into high-throughput token factories.
The result: GPUs at near-100% utilization. Inference scales without adding racks. Energy consumption drops.
This isn't incremental optimization. It's rethinking the data path and control plane so AI infrastructure matches AI ambition.
For our customers: maximum performance from existing hardware. Lower cost, lower power, lower latency, higher throughput.
NeuReality is headquartered in Tel Aviv with offices across North America and Europe.
Offices: 10 Kremenetski, Tel Aviv, Tel-Aviv District 6789910, IL · 14 Tarshish Street, Caesarea, 3079559, IL · 2880 Zanker Rd, 203, San Jose, California 95134, US · Kamienna 21, Krowodrza, Małopolskie 31-403, PL
Machine LearningArtificial IntelligenceSemiconductorsAI InferenceData CentersAI InfrastructureGenerative AILarge Language ModelsAI DeploymentsAI Systems Engineering