Gen AI - Engineer

📍 Mumbai, Maharashtra, India
Full-time Engineering Services Posted January 23, 2026
Apply Now Similar Jobs
Job Description

About Us:   
We are looking for a Gen AI - Engineer to join our growing team to help design, build, fine-tune, and deploy cutting - edge generative AI models and agentic systems. You will work on the full lifecycle of foundational model development - involving both large and small language models (LLMs and SLMs) - to create scalable AI solutions that address diverse needs across different business domains. This role is ideal for proactive individuals with a strong foundation in machine learning and an experimental mindset, who are passionate about driving transformative advancements in generative AI from research to real-world production impact. 

Key Responsibilities:   
Develop and train foundational generative AI models across modalities such as text-to-text, text-to-speech, automatic speech recognition, and vision language. 
Fine-tune and adapt models for specific tasks and domains. 
Build and maintain pipelines for data curation, preprocessing, training, evaluation, and continuous improvement of models. 
Implement debugging, CI/CD, and observability to ensure reliability and efficiency across the development lifecycle. 
Develop retrieval-augmented generation (RAG) pipelines and optimize prompt engineering strategies. 
Optimize training and inference performance through quantization, distributed training/inference, GPU/TPU acceleration. 
Monitor, benchmark, and improve model performance with a focus on accuracy, efficiency, and reducing hallucinations. 
Collaborate with cross-functional teams to build robust AI stacks and integrate them seamlessly into production pipelines for deployment. 
Document technical processes, AI model architectures, and experimental results, while maintaining well-structured, version-controlled code repositories. 
Stay current with advancements in transformer architectures, open-source releases, and AI tooling. 


Minimum Qualifications and Experience:   
Bachelor’s or Master’s in Computer Science, AI/ML, Data Science or any related field with 2 to 5 years of industry experience in applied machine learning or AI development. 

Required Expertise:    
Proficiency in Python programming with solid foundation in computer science fundamentals such as data structures and algorithms. 
Strong problem-solving skills and demonstrated ability to lead projects. 
Hands-on experience with a few of the tools listed below:  
One or more model libraries and ML frameworks such as TensorFlow, PyTorch, HF Transformers, NeMo, etc. 
AI application libraries and orchestration frameworks such as DSPy, Langgraph, Langchain, Llamaindex, etc. 
GPU/TPU based training and inference using libraries such as vLLM. 
Distributed training tools such as SLURM, Ray, Pytorch DDP, NCCL, etc. 
Version control, observability systems, and MLOps tools such as Git, DVC, W&B, MLFlow, KubeFlow, etc. 
Data analysis and curation tools such as Dask, Milvus, Apache Spark, Numpy, etc. 
Chunking, embeddings, vector databases (e.g., Pinecone, Weaviate, Milvus), and retrieval-augmented generation (RAG). 
Model context protocol (MCP), Agent to Agent (A2A), and Agent Communication Protocol (ACP). 
Team player with excellent interpersonal skills and ability to collaborate effectively with remote team members. 
Go-getter attitude and ability to flourish in a fast-paced, startup environment. 
Prior experience of building and deploying LLMs or SLMs, experience with multimodal models, and track record of contributions to open-source AI/ML projects would be a big plus. 
Apply for this Position

Ready to join ? Click the button below to submit your application.
Submit Application
Job Details

Location
Mumbai, Maharashtra, India
Job Type
Full-time