Job Description
Overview
Are you a HPC Infrastructure Reliability Engineer seeking a new interesting challenge? If your answer is yes, it’s your lucky day so keep reading, it can be just what you're looking for.
Responsibilities
- Manage and optimize high-performance physical infrastructure (servers, GPUs, and advanced networking)
- Ensure availability, performance, and reliability of HPC and AI environments
- Drive infrastructure automation (IaC) and enable zero-touch provisioning
- Oversee the full hardware lifecycle (capacity planning, deployment, and decommissioning)
- Work with tools such as HPE OneView, Lenovo XClarity, and ServiceNow CMDB
- Collaborate with R&D;, science, and engineering teams to design optimal infrastructure solutions
- Optimize resource utilization (CPU/GPU) and improve overall infrastructure efficiency
Qualifications
- 5–7+ years of experience in Data Center Engineering, Bar...
Apply for this Position
Ready to join Tata Consultancy Services? Click the button below to submit your application.
Submit Application