Job Description
***** Immediate Joiner *****
Job title: Senior AI Engineer — LLM / RAG / MLOps (Python, Azure)
Location: Hybrid (office + remote) — [Vijayawada]
Employment type: Full-time
Experience: 4–5 years
Job Code: 1113
About us
Arvya Tech is a service-first SaaS startup building AI-powered products that deliver enterprise-grade ML features to customers. We need a pragmatic Senior AI Engineer who can deliver production systems end-to-end: architect, implement, deploy, monitor, and scale ML/LLM features with strong ownership.
What you’ll own
- Design, build and operate production LLM-based features using Retrieval-Augmented Generation (RAG) and vector retrieval.
- Deploy and maintain model-serving infra on Azure (Azure ML, AKS), ensure high availability and low latency under realistic traffic.
- Implement robust MLOps pipelines: model packaging, registry, CI/CD, automated canary/rollback strategies, and retraining triggers.
- Instrument models and data with observability: model performance metrics, drift detection, alerts and SLOs.
- Optimize inference throughput and cost: batching, caching, quantization or distillation where appropriate.
- Lead and mentor other engineers;
manage multiple projects and deliverables concurrently. - Collaborate with product, security, and data teams to ensure data governance and safe model behavior.
Must-have qualifications
- 4–5 years of professional experience building production software;
demonstrable ownership of shipping ML/AI features. - Expert-level Python (async, profiling, packaging, typing).
- Hands-on experience with RAG/LLM systems: vector stores, embedding pipelines, retrieval, prompt engineering, and hallucination mitigation.
- Strong MLOps / production deployment experience: CI/CD, model registries, monitoring, and incident response.
- Azure platform experience: Azure ML, AKS or equivalent experience on public cloud with managed Kubernetes.
- Experience with containerization (Docker), Kubernetes, and infra-as-code (Terraform/ARM).
- Experience designing for scale, traffic management, and latency-sensitive systems.
- Excellent communication skills and experience mentoring engineers.
Nice-to-have
- Experience with Hugging Face, LangChain or other LLM orchestration libraries.
- Familiarity with GPU inference optimizations and multi-GPU setups.
- Experience with SaaS multi-tenant design, usage billing telemetry and feature flags.
What we offer
- Competitive compensation.
- Hybrid working model.
- Opportunity to shape a product used by enterprise customers and to lead technical decisions.
For Quick Response:
Send your resume and a short note about a production ML/LLM project you led (link to repo or architecture diagram encouraged).
Email: [email protected]
Subject line format: 1113 – Senior AI Engineer
Apply for this Position
Ready to join ? Click the button below to submit your application.
Submit Application