
This role offers the rare opportunity to shape foundational technology in a space where the boundaries are still being defined.
You'll be building the first human foundation model that operates across text, speech, facial expression, and body language in real time. This unified model understands fine-grained human signals — from a quirked eyebrow to a subtle change in voice — and infers meaning in context
You will generate lifelike, responsive avatars whose expressions, gestures, and tone evolve frame-by-frame to deliver genuine responses.
Take the next step in your career journey