Job Description

Dev Ops + ML Engineer
Experience – 7+ Years
Location – Chennai / Bangalore
Work Mode - Hybrid
Key Responsibilities:
Maintain and support machine learning applications running on Windows and Linux servers in on-premises environments.
Manage and troubleshoot Kubernetes clusters hosting ML workloads.
Collaborate with data scientists and engineers to deploy machine learning models reliably and efficiently.
Implement and maintain monitoring and alerting solutions using Data Dog to ensure system health and performance.
Debug and resolve issues in production environments using Python and monitoring tools.
Automate operational tasks to improve system reliability and scalability.
Ensure best practices in security, performance, and availability for ML applications.
Document system architecture, deployment processes, and troubleshooting guides.
Required Qualifications:
Proven experience working with Windows and Linux operating systems in production enviro...

Apply for this Position

Ready to join Mindsprint? Click the button below to submit your application.

Submit Application