Job Description

***** Immediate Joiner *****


Job title: Senior AI Engineer — LLM / RAG / MLOps (Python, Azure)

Location: Hybrid (office + remote) — [Vijayawada]

Employment type: Full-time

Experience: 4–5 years

Job Code: 1113


About us

Arvya Tech is a service-first SaaS startup building AI-powered products that deliver enterprise-grade ML features to customers. We need a pragmatic Senior AI Engineer who can deliver production systems end-to-end: architect, implement, deploy, monitor, and scale ML/LLM features with strong ownership.


What you’ll own

  • Design, build and operate production LLM-based features using Retrieval-Augmented Generation (RAG) and vector retrieval.
  • Deploy and maintain model-serving infra on Azure (Azure ML, AKS), ensure high availability and low latency under realistic traffic.
  • Implement robust MLOps pipelines: model packaging, registry, CI/CD, automated canary/rollback strategies, and retraining triggers.
  • Instrument models and data with observability: model performance metrics, drift detection, alerts and SLOs.
  • Optimize inference throughput and cost: batching, caching, quantization or distillation where appropriate.
  • Lead and mentor other engineers;
    manage multiple projects and deliverables concurrently.
  • Collaborate with product, security, and data teams to ensure data governance and safe model behavior.




Must-have qualifications

  • 4–5 years of professional experience building production software;
    demonstrable ownership of shipping ML/AI features.
  • Expert-level Python (async, profiling, packaging, typing).
  • Hands-on experience with RAG/LLM systems: vector stores, embedding pipelines, retrieval, prompt engineering, and hallucination mitigation.
  • Strong MLOps / production deployment experience: CI/CD, model registries, monitoring, and incident response.
  • Azure platform experience: Azure ML, AKS or equivalent experience on public cloud with managed Kubernetes.
  • Experience with containerization (Docker), Kubernetes, and infra-as-code (Terraform/ARM).
  • Experience designing for scale, traffic management, and latency-sensitive systems.
  • Excellent communication skills and experience mentoring engineers.


Nice-to-have

  • Experience with Hugging Face, LangChain or other LLM orchestration libraries.
  • Familiarity with GPU inference optimizations and multi-GPU setups.
  • Experience with SaaS multi-tenant design, usage billing telemetry and feature flags.

What we offer

  • Competitive compensation.
  • Hybrid working model.
  • Opportunity to shape a product used by enterprise customers and to lead technical decisions.


For Quick Response:

Send your resume and a short note about a production ML/LLM project you led (link to repo or architecture diagram encouraged).

Email: [email protected]

Subject line format: 1113 – Senior AI Engineer

Apply for this Position

Ready to join ? Click the button below to submit your application.

Submit Application