Job Description
The Speech Data Engineer is a key specialist bridging the data market with our technological needs. You will be responsible for identifying unique data sources, evaluating their quality, and building strong relationships with data providers.
Requirements
- Hands-on experience with speech data processing and labeling tools, such as VAD, Pyannote, whisper, and other segmentation or diarization frameworks.
- Familiarity with quality assessment metrics, including SNR (Signal-to-Noise Ratio) and other acoustic analysis indicators.
- Collect, process, and curate speech datasets, including audio recordings, transcripts, and metadata for multilingual ASR and TTS applications.
- Work closely with internal ASR/TTS development teams to align dataset specifications with model training needs.
- Label and validate audio data, ensuring transcription accuracy, speaker diversity, and consistent metadata standards.
Responsibilities
Apply for this Position
Ready to join AIPHORIA? Click the button below to submit your application.
Submit Application