Job Description
Speech AI Engineer – Real-Time Interpretation & Simultaneous
Role: Speech AI Engineer – Real-Time Interpretation & Simultaneous
Location: Remote
Team: AI & Innovation
Reports to: VP of Artificial Intelligence
About BIG Language Solutions
BIG Language Solutions is a global Language Service Provider (LSP) delivering world-class translation and interpretation services for clients across industries. We combine human linguistic expertise with cutting-edge AI to make multilingual communication faster, more accurate, and more accessible. Our innovation spans both written and spoken language solutions—helping organizations break barriers in real time and at scale.
Job Summary
We are looking for a Speech AI Engineer with strong expertise in real-time speech recognition, machine translation, and low-latency audio processing to build and optimize AI systems for simultaneous interpretation. The role focuses on designing scalable, high-accuracy speech-to-speech and speech-to-text solutions used in live multilingual environments.
Key Responsibilities
Speech & Language AI Development
• Design, develop, and optimize real-time ASR (Automatic Speech Recognition) systems with low latency
• Build and improve simultaneous machine translation (SMT) pipelines for live interpretation
• Develop speech-to-text, text-to-text, and speech-to-speech workflows
• Optimize models for streaming audio and real-time inference
Model Training & Optimization
• Train, fine-tune, and evaluate deep learning models for speech and language tasks
• Improve accuracy across accents, dialects, and noisy environments
• Apply techniques such as model compression, quantization, and distillation for real-time performance
System & Infrastructure
• Work with real-time audio pipelines, streaming protocols, and latency-sensitive systems
• Deploy models using cloud and edge environments (AWS, GCP, Azure, on-prem)
• Collaborate on scalable microservices and APIs for live interpretation platforms
Research & Innovation
• Stay updated with the latest research in speech AI, multilingual NLP, and real-time translation
• Experiment with LLMs, foundation models, and multimodal architectures
• Translate research findings into production-ready solutions
Collaboration
• Work closely with product, linguistics, and engineering teams
• Support integration with client platforms and real-world use cases
• Assist in evaluating language quality and real-time performance metrics
Required Qualifications
Technical Skills
• Strong experience in Speech AI / NLP / Machine Learning
• Hands-on expertise with ASR, TTS, and Machine Translation systems
• Proficiency in Python and deep learning frameworks (PyTorch, TensorFlow)
• Experience with streaming inference and low-latency systems
• Familiarity with audio processing (signal processing, codecs, noise handling)
AI & ML Knowledge
• Solid understanding of transformers, seq2seq models, CTC, RNN-T, attention mechanisms
• Experience with multilingual and cross-lingual models
• Knowledge of evaluation metrics for speech and translation quality
Preferred Qualifications
• Experience with simultaneous interpretation or live captioning systems
• Exposure to Whisper, wav2vec, NeMo, Kaldi, Marian, Fairseq, or similar frameworks
• Experience working with LLMs and speech-enabled agents
• Background in linguistics or multilingual AI
• Experience optimizing AI systems for production and scale
Soft Skills
• Strong problem-solving and analytical mindset
• Ability to work in fast-paced, cross-functional teams
• Clear communication of technical concepts to non-technical stakeholders
Nice to Have
• Experience with real-time conferencing or telephony platforms
• Knowledge of compliance and data privacy for speech data
• Familiarity with human-in-the-loop interpretation workflows
Think global. Think BIG.
Visit us:
Linkedin:
Role: Speech AI Engineer – Real-Time Interpretation & Simultaneous
Location: Remote
Team: AI & Innovation
Reports to: VP of Artificial Intelligence
About BIG Language Solutions
BIG Language Solutions is a global Language Service Provider (LSP) delivering world-class translation and interpretation services for clients across industries. We combine human linguistic expertise with cutting-edge AI to make multilingual communication faster, more accurate, and more accessible. Our innovation spans both written and spoken language solutions—helping organizations break barriers in real time and at scale.
Job Summary
We are looking for a Speech AI Engineer with strong expertise in real-time speech recognition, machine translation, and low-latency audio processing to build and optimize AI systems for simultaneous interpretation. The role focuses on designing scalable, high-accuracy speech-to-speech and speech-to-text solutions used in live multilingual environments.
Key Responsibilities
Speech & Language AI Development
• Design, develop, and optimize real-time ASR (Automatic Speech Recognition) systems with low latency
• Build and improve simultaneous machine translation (SMT) pipelines for live interpretation
• Develop speech-to-text, text-to-text, and speech-to-speech workflows
• Optimize models for streaming audio and real-time inference
Model Training & Optimization
• Train, fine-tune, and evaluate deep learning models for speech and language tasks
• Improve accuracy across accents, dialects, and noisy environments
• Apply techniques such as model compression, quantization, and distillation for real-time performance
System & Infrastructure
• Work with real-time audio pipelines, streaming protocols, and latency-sensitive systems
• Deploy models using cloud and edge environments (AWS, GCP, Azure, on-prem)
• Collaborate on scalable microservices and APIs for live interpretation platforms
Research & Innovation
• Stay updated with the latest research in speech AI, multilingual NLP, and real-time translation
• Experiment with LLMs, foundation models, and multimodal architectures
• Translate research findings into production-ready solutions
Collaboration
• Work closely with product, linguistics, and engineering teams
• Support integration with client platforms and real-world use cases
• Assist in evaluating language quality and real-time performance metrics
Required Qualifications
Technical Skills
• Strong experience in Speech AI / NLP / Machine Learning
• Hands-on expertise with ASR, TTS, and Machine Translation systems
• Proficiency in Python and deep learning frameworks (PyTorch, TensorFlow)
• Experience with streaming inference and low-latency systems
• Familiarity with audio processing (signal processing, codecs, noise handling)
AI & ML Knowledge
• Solid understanding of transformers, seq2seq models, CTC, RNN-T, attention mechanisms
• Experience with multilingual and cross-lingual models
• Knowledge of evaluation metrics for speech and translation quality
Preferred Qualifications
• Experience with simultaneous interpretation or live captioning systems
• Exposure to Whisper, wav2vec, NeMo, Kaldi, Marian, Fairseq, or similar frameworks
• Experience working with LLMs and speech-enabled agents
• Background in linguistics or multilingual AI
• Experience optimizing AI systems for production and scale
Soft Skills
• Strong problem-solving and analytical mindset
• Ability to work in fast-paced, cross-functional teams
• Clear communication of technical concepts to non-technical stakeholders
Nice to Have
• Experience with real-time conferencing or telephony platforms
• Knowledge of compliance and data privacy for speech data
• Familiarity with human-in-the-loop interpretation workflows
Think global. Think BIG.
Visit us:
Linkedin:
Apply for this Position
Ready to join ? Click the button below to submit your application.
Submit Application