Mlops (Python, Azure)

📍 India, India, India

Full-time Computer Occupations Posted January 27, 2026

Apply Now Similar Jobs

Job Description

***** Immediate Joiner *****

Job title: Senior AI Engineer — LLM / RAG / MLOps (Python, Azure)
Location: Hybrid (office + remote) — [Vijayawada]
Employment type: Full-time
Experience: 4–5 years
Job Code: 1113

About us
Arvya Tech is a service-first SaaS startup building AI-powered products that deliver enterprise-grade ML features to customers. We need a pragmatic Senior AI Engineer who can deliver production systems end-to-end: architect, implement, deploy, monitor, and scale ML/LLM features with strong ownership.

What you’ll own
Design, build and operate production LLM-based features using Retrieval-Augmented Generation (RAG) and vector retrieval.
Deploy and maintain model-serving infra on Azure (Azure ML, AKS), ensure high availability and low latency under realistic traffic.
Implement robust MLOps pipelines: model packaging, registry, CI/CD, automated canary/rollback strategies, and retraining triggers.
Instrument models and data with observability: model performance metrics, drift detection, alerts and SLOs.
Optimize inference throughput and cost: batching, caching, quantization or distillation where appropriate.
Lead and mentor other engineers; 
 manage multiple projects and deliverables concurrently.
Collaborate with product, security, and data teams to ensure data governance and safe model behavior.



Must-have qualifications
4–5 years of professional experience building production software; 
 demonstrable ownership of shipping ML/AI features.
Expert-level Python (async, profiling, packaging, typing).
Hands-on experience with RAG/LLM systems: vector stores, embedding pipelines, retrieval, prompt engineering, and hallucination mitigation.
Strong MLOps / production deployment experience: CI/CD, model registries, monitoring, and incident response.
Azure platform experience: Azure ML, AKS or equivalent experience on public cloud with managed Kubernetes.
Experience with containerization (Docker), Kubernetes, and infra-as-code (Terraform/ARM).
Experience designing for scale, traffic management, and latency-sensitive systems.
Excellent communication skills and experience mentoring engineers.

Nice-to-have
Experience with Hugging Face, LangChain or other LLM orchestration libraries.
Familiarity with GPU inference optimizations and multi-GPU setups.
Experience with SaaS multi-tenant design, usage billing telemetry and feature flags.
What we offer
Competitive compensation.
Hybrid working model.
Opportunity to shape a product used by enterprise customers and to lead technical decisions.

For Quick Response:
Send your resume and a short note about a production ML/LLM project you led (link to repo or architecture diagram encouraged).
Email: [email protected]
Subject line format: 1113 – Senior AI Engineer

Apply for this Position

Ready to join ? Click the button below to submit your application.

Submit Application

Job Details

Location

India, India, India

Job Type

Full-time