Job Description

A Moving Experience.

·8+ years of hands-on experience in TTS system development with deep expertise in both frontend and backend components

·Proficiency in C/C++ and Python, with mastery of ML frameworks (PyTorch, TensorFlow, etc)

·Strong background in NLP techniques and/or speech signal processing

·Experience with linguistic tools (e.g., Festival) and phonetic knowledge

·Familiarity with transformer-based language models for prosody prediction

·Deep understanding of autoregressive / non-autoregressive acoustic models and neural vocoders

·Experience optimizing models via quantization, pruning, or knowledge distillation

·Knowledge of speech codecs (e.g., Opus, MELP) and real-time streaming protocols

·Production experience with ONNX Runtime, TensorRT, or TorchScript, etc

·Experience with zero-shot/one-shot/few-shot voice cloning or emotional TTS systems

·Skilled GPU/TPU cluster and grid user

<...

Apply for this Position

Ready to join Cerence Inc.? Click the button below to submit your application.

Submit Application