What we're looking for
We need a builder with 2-6 years of hands-on experience in audio, speech, or voice AI who has direct experience with speech/voice models (like ASR/TTS) or building production voice agents. You should be comfortable with ambiguity and moving quickly in an early-stage startup environment and have a track record of shipping production-grade systems. Bonus points if you have an MS/PhD in a relevant field or are an ex-founder.
What you'll do
Build and ship speech/voice AI processing capabilities and production-grade audio workflows.
Take ownership of improving the quality and performance of our large data processing pipeline end-to-end.
Work closely with the CTO and Ops team to research and determine the next generation of audio data needed by AI labs.
Design and implement evaluation and quality measurement for speech/voice performance in production.
Lead the development of new audio datasets to help the company get ahead of customer requirements.
Work across the stack to productionize voice systems, from data pipelines and model inference to product integration and monitoring.
What you'll do
Build and ship speech/voice AI processing capabilities and production-grade audio workflows.
Take ownership of improving the quality and performance of our large data processing pipeline end-to-end.
Work closely with the CTO and Ops team to research and determine the next generation of audio data needed by AI labs.
Design and implement evaluation and quality measurement for speech/voice performance in production.
Lead the development of new audio datasets to help the company get ahead of customer requirements.
Work across the stack to productionize voice systems, from data pipelines and model inference to product integration and monitoring.
About Besimple AI
Good data is the key to fine-tuning and evaluation of language models, and most models hallucinate without human-in-the-loop. That's why we built Besimple AI for AI teams of all sizes and budgets. We build data flywheel for AI companies so you can continuously improve and monitor your models with human-in-the-loop. We got you covered across text, image, audio and more.
Ready to join Besimple AI?
Take the next step in your career journey
Frequently Asked Questions
What does Besimple AI pay for a Founding Engineer, Audio AI?
Besimple AI offers a competitive compensation package for the Founding Engineer, Audio AI role. The salary range is USD 180k - 260k per year, plus Up to 1.3% equity. Apply through Clera to learn more about the full compensation details.
What does a Founding Engineer, Audio AI do at Besimple AI?
As a Founding Engineer, Audio AI at Besimple AI, you will: build and ship speech/voice AI processing capabilities and production-grade audio workflows.; take ownership of improving the quality and performance of our large data processing pipeline end-to-end.; work closely with the CTO and Ops team to research and determine the next generation of audio data needed by AI labs.; and more.
Is the Founding Engineer, Audio AI position at Besimple AI remote?
The Founding Engineer, Audio AI position at Besimple AI offers a hybrid work arrangement, with office presence required in San Mateo, United States. This combines the flexibility of remote work with in-person collaboration.
How do I apply for the Founding Engineer, Audio AI position at Besimple AI?
You can apply for the Founding Engineer, Audio AI position at Besimple AI directly through Clera. Click the "Apply Now" button above to start your application. Clera's AI-powered platform will help match your profile with this opportunity and guide you through the application process.