Job Description
Hiring: Senior Data Scientist — Generative AI (4–10 Years)
Location: India (Remote) | Full-time
We’re looking for a hands-on Data Scientist who’s deeply experienced with transformer architectures , multimodal AI , and model optimization . Someone who loves rapid experimentation, thrives in a startup-style environment, and can take complete ownership from prototype → production.
What You’ll Work On
Build and fine-tune advanced transformer-based models (LLMs, Whisper, diffusion, LoRA, RLHF/SFT).
Work across modalities — audio, video, text — with depth in at least one.
Push the boundaries in lip-sync accuracy , character/scene consistency , audio realism , and video quality .
Design automated evaluation frameworks for image, audio, and video quality scoring.
Experiment with audio–video synchronization , background score blending, and text-to-video alignment.
Stay on top of emerging architectures, tools, and scaling strategies.
Own R&D initiatives end-to-end with strong execution and problem-solving.
️ What You Bring
4–10 years experience in applied ML/AI.
Strong fundamentals in transformers, training dynamics, and optimization.
Hands-on experience with Python , PyTorch/TensorFlow .
Deep expertise in one modality (audio/video/text) with real end-to-end project work.
Experience deploying ML inference using FastAPI or similar.
Ability to design automated evaluation metrics for generative output.
High adaptability and willingness to run rapid experiments.
What You Get
Best-in-class salary — we hire the best, and pay accordingly.
Work with a world-class AI & product team .
Learn directly from leaders who have built global-scale tech products.
Exposure to cutting-edge initiatives across sports, media, entertainment.
A fast-growing environment where your impact is visible and immediate .
Location: India (Remote) | Full-time
We’re looking for a hands-on Data Scientist who’s deeply experienced with transformer architectures , multimodal AI , and model optimization . Someone who loves rapid experimentation, thrives in a startup-style environment, and can take complete ownership from prototype → production.
What You’ll Work On
Build and fine-tune advanced transformer-based models (LLMs, Whisper, diffusion, LoRA, RLHF/SFT).
Work across modalities — audio, video, text — with depth in at least one.
Push the boundaries in lip-sync accuracy , character/scene consistency , audio realism , and video quality .
Design automated evaluation frameworks for image, audio, and video quality scoring.
Experiment with audio–video synchronization , background score blending, and text-to-video alignment.
Stay on top of emerging architectures, tools, and scaling strategies.
Own R&D initiatives end-to-end with strong execution and problem-solving.
️ What You Bring
4–10 years experience in applied ML/AI.
Strong fundamentals in transformers, training dynamics, and optimization.
Hands-on experience with Python , PyTorch/TensorFlow .
Deep expertise in one modality (audio/video/text) with real end-to-end project work.
Experience deploying ML inference using FastAPI or similar.
Ability to design automated evaluation metrics for generative output.
High adaptability and willingness to run rapid experiments.
What You Get
Best-in-class salary — we hire the best, and pay accordingly.
Work with a world-class AI & product team .
Learn directly from leaders who have built global-scale tech products.
Exposure to cutting-edge initiatives across sports, media, entertainment.
A fast-growing environment where your impact is visible and immediate .
Apply for this Position
Ready to join ? Click the button below to submit your application.
Submit Application