Job Description
We are now expanding our team and are looking for skilled, goal-oriented MLE (TTS) to join our teams.
Requirements
- 3+ years of hands-on experience with Text-to-Speech (TTS) / speech synthesis
- Proficiency in Python and deep learning frameworks (especially, PyTorch).
- Strong understanding of speech synthesis processing techniques.
- Experience with Fast Attention-Based Models: (FastPitch, FastSpeech 2) and modern variative approaches: (e.g., VITS, Glow-TTS).
- Strong understanding of techniques to control prosody, rhythm, and emotional tone for expressive speech synthesis.
- Knowledge of normalization techniques, FSTs, NN for normalization.
- Familiarity with TTS evaluation techniques, including MOS and A/B testing.
- Familiarity with vocoder models (e.g. Vocos, HiFi-GAN, mimi).
- Knowledge of signal processing, statistical modeling, and language structure.
Responsibi...
Apply for this Position
Ready to join Aiphoria? Click the button below to submit your application.
Submit Application