Clera - Your AI talent agent
LoginStart
Start
Stuut logo
Stuut

Member of Technical Staff - Audio and Voice

full-time•San Francisco

Summary

Location

San Francisco

Type

full-time

Experience

5-10 years

Company links

WebsiteLinkedInLinkedIn

About this role

Stuut is transforming accounts receivable for B2B companies—making collections smarter and faster for companies that have historically relied on manual processes that are labor intensive and costly. Our platform is gaining traction with finance teams across industrials, chemicals, and manufacturing sectors from Fortune 10 brands to scaling midmarkets. We're backed by top-tier investors including a16z, Khosla, Activant, 1984 Ventures and Page One.

The Role
We’re hiring a Member of Technical Staff – Audio and Voice Systems to design, build, and deploy AI-powered voice and audio systems that solve real-world financial operations challenges. You’ll take state-of-the-art research in speech, audio, and multimodal AI and translate it into production-grade, real-time voice experiences that deliver measurable customer impact.

From intelligent voice agents that interact with customers to audio-driven workflow automation and speech-based data extraction, you’ll create scalable, reliable AI systems that integrate seamlessly into Stuut’s platform. Your work will directly shape how finance teams and their customers interact with Stuut through voice.

This is a hands-on role for an engineer who thrives at the intersection of audio AI, real-time systems, and practical business impact—turning cutting-edge models into trusted, delightful voice experiences for enterprise finance workflows.

What You’ll Do

  • Build & Deploy Voice AI Systems: design and ship production-ready audio and voice-based AI features, including real-time voice agents and speech-driven workflows.

  • Craft High-Quality Voice UX: use modern speech-to-text, text-to-speech, and conversational AI platforms to create natural, responsive, and emotionally aware voice experiences tailored to financial use cases.

  • Adapt & Fine-Tune Audio and Multimodal Models: fine-tune and optimize speech, audio, and LLM-based models for accuracy, latency, and reliability in real-world environments.

  • Engineer Real-Time, Scalable AI Pipelines: build end-to-end AI/ML pipelines spanning audio ingestion, streaming inference, orchestration, and monitoring with enterprise-grade availability and performance.

  • Establish Evaluation & Monitoring Frameworks (LLMOps): design rigorous evaluation systems to measure quality, latency, accuracy, drift, and business outcomes for voice and text-based AI systems.

  • Automate Financial Workflows via Voice: develop AI-powered voice automations that reduce manual effort in collections, reconciliation, and customer communication.

  • Collaborate Cross-Functionally: partner with Product, Engineering, Design, and customers to translate business needs into effective, user-centered voice AI solutions.

  • Measure & Communicate Impact: define success metrics and continuously improve AI systems based on real-world usage and customer feedback.

You Might Be a Fit If You…

  • Have 5+ years of software engineering experience, with 2+ years focused on applied AI/ML, speech, or audio systems in production.

  • Have built and shipped voice, audio, or conversational AI systems used by real customers.

  • Have experience with speech-to-text, text-to-speech, audio processing, or multimodal models.

  • Have integrated and fine-tuned LLMs for conversational or agent-based systems.

  • Understand LLMOps / MLOps best practices, including deployment pipelines, monitoring, evaluation, and A/B testing.

  • Are fluent in Python and experienced with PyTorch, TensorFlow, Transformers, or audio ML frameworks.

  • Have built real-time or low-latency systems and understand the tradeoffs involved.

  • Can translate business and UX requirements into robust, scalable AI solutions.

  • Have experience integrating AI systems into existing enterprise or SaaS platforms.

  • Enjoy working on ambiguous problems where product definition, UX, and engineering meet.

Compensation

  • Top-of-market salary and equity package

  • Benefits (for U.S.-based full-time employees)

  • Medical, dental & vision insurance coverage for you

  • 401(k) & Match

  • Equity

  • Flexible PTO

  • Parental Leave

What you'll do

  • Design, build, and deploy AI-powered voice and audio systems for financial operations. Collaborate with cross-functional teams to create scalable and reliable AI solutions that enhance customer interactions.

About Stuut

Stuut is the AI platform that does accounts receivable work instead of just assisting with it. We collect 40% more revenue for businesses by autonomously handling their entire AR process—from customer outreach to payment collection—with seamless integration that's live in days, not months. We're backed by some of the best investors: a16z, Khosla, Activant, 1984, Carya, and Page One. Also, we are hiring!

Ready to join Stuut?

Take the next step in your career journey

Frequently Asked Questions

What does a Member of Technical Staff - Audio and Voice do at Stuut?

Toggle
As a Member of Technical Staff - Audio and Voice at Stuut, you will: design, build, and deploy AI-powered voice and audio systems for financial operations. Collaborate with cross-functional teams to create scalable and reliable AI solutions that enhance customer interactions..

Is the Member of Technical Staff - Audio and Voice position at Stuut remote?

Toggle
The Member of Technical Staff - Audio and Voice position at Stuut is based in San Francisco, United States. Contact the company through Clera for specific work arrangement details.

How do I apply for the Member of Technical Staff - Audio and Voice position at Stuut?

Toggle
You can apply for the Member of Technical Staff - Audio and Voice position at Stuutdirectly through Clera. Click the "Apply Now" button above to start your application. Clera's AI-powered platform will help match your profile with this opportunity and guide you through the application process.
Clera - Your AI talent agent
© 2026 Clera Labs, Inc.TermsPrivacyHelp

Join Clera's Talent Pool

Get matched with similar opportunities at top startups

This role is hosted on Stuut's careers site.
Join our talent pool first to get notified about similar roles that match your profile.