Job Description

Overview

Looking to push the boundaries of generative AI for real-time interaction? You'll be joining a well-funded startup working on multimodal AI where voice, vision, and language come together. They're building generative models for natural conversational experiences that need to perform in real-time.

Your mission is to build and optimise diffusion or flow-matching models that power their speech and audio generation. This means developing production-ready architectures that can generate controllable, high-quality output at scale. You'll own the full research-to-production pipeline - from architecture design and training through deployment and optimisation. Your work will directly impact how millions of AI characters sound and interact.

Responsibilities

  • Design and train large-scale diffusion or flow-matching models
  • Developnovel architectures and training techniques to improve controllability and quality
  • Build evalu...

Apply for this Position

Ready to join techire ai? Click the button below to submit your application.

Submit Application