Job Description

We are now expanding our team and are looking for skilled, goal-oriented MLE (TTS) to join our teams.

Requirements

  • 3+ years of hands-on experience with Text-to-Speech (TTS) / speech synthesis
  • Proficiency in Python and deep learning frameworks (especially, PyTorch).
  • Strong understanding of speech synthesis processing techniques.
  • Experience with Fast Attention-Based Models: (FastPitch, FastSpeech 2) and modern variative approaches: (e.g., VITS, Glow-TTS).
  • Strong understanding of techniques to control prosody, rhythm, and emotional tone for expressive speech synthesis.
  • Knowledge of normalization techniques, FSTs, NN for normalization.
  • Familiarity with TTS evaluation techniques, including MOS and A/B testing.
  • Familiarity with vocoder models (e.g. Vocos, HiFi-GAN, mimi).
  • Knowledge of signal processing, statistical modeling, and language structure.

Responsibi...

Apply for this Position

Ready to join Aiphoria? Click the button below to submit your application.

Submit Application