Job Description

About Us:
We are looking for a Gen AI - Engineer to join our growing team to help design, build, fine-tune, and deploy cutting - edge generative AI models and agentic systems. You will work on the full lifecycle of foundational model development - involving both large and small language models (LLMs and SLMs) - to create scalable AI solutions that address diverse needs across different business domains. This role is ideal for proactive individuals with a strong foundation in machine learning and an experimental mindset, who are passionate about driving transformative advancements in generative AI from research to real-world production impact.
Key Responsibilities:
- Develop and train foundational generative AI models across modalities such as text-to-text, text-to-speech, automatic speech recognition, and vision language.
- Fine-tune and adapt models for specific tasks and domains.
- Build and maintain pipelines for data curation, preprocessing, training, evaluation, and continuous improvement of models.
- Implement debugging, CI/CD, and observability to ensure reliability and efficiency across the development lifecycle.
- Develop retrieval-augmented generation (RAG) pipelines and optimize prompt engineering strategies.
- Optimize training and inference performance through quantization, distributed training/inference, GPU/TPU acceleration.
- Monitor, benchmark, and improve model performance with a focus on accuracy, efficiency, and reducing hallucinations.
- Collaborate with cross-functional teams to build robust AI stacks and integrate them seamlessly into production pipelines for deployment.
- Document technical processes, AI model architectures, and experimental results, while maintaining well-structured, version-controlled code repositories.
- Stay current with advancements in transformer architectures, open-source releases, and AI tooling.
Minimum Qualifications and Experience:
- Bachelor’s or Master’s in Computer Science, AI/ML, Data Science or any related field with 2 to 5 years of industry experience in applied machine learning or AI development.
Required Expertise:
- Proficiency in Python programming with solid foundation in computer science fundamentals such as data structures and algorithms.
- Strong problem-solving skills and demonstrated ability to lead projects.
- Hands-on experience with a few of the tools listed below:
- One or more model libraries and ML frameworks such as Tensor Flow, Py Torch, HF Transformers, Ne Mo, etc.
- AI application libraries and orchestration frameworks such as DSPy, Langgraph, Langchain, Llamaindex, etc.
- GPU/TPU based training and inference using libraries such as v LLM.
- Distributed training tools such as SLURM, Ray, Pytorch DDP, NCCL, etc.
- Version control, observability systems, and MLOps tools such as Git, DVC, W&B, MLFlow, Kube Flow, etc.
- Data analysis and curation tools such as Dask, Milvus, Apache Spark, Numpy, etc.
- Chunking, embeddings, vector databases (e.g., Pinecone, Weaviate, Milvus), and retrieval-augmented generation (RAG).
- Model context protocol (MCP), Agent to Agent (A2 A), and Agent Communication Protocol (ACP).
- Team player with excellent interpersonal skills and ability to collaborate effectively with remote team members.
- Go-getter attitude and ability to flourish in a fast-paced, startup environment.
- Prior experience of building and deploying LLMs or SLMs, experience with multimodal models, and track record of contributions to open-source AI/ML projects would be a big plus.

Apply for this Position

Ready to join ? Click the button below to submit your application.

Submit Application