Job Description

Overview

Are you a HPC Infrastructure Reliability Engineer seeking a new interesting challenge? If your answer is yes, it’s your lucky day so keep reading, it can be just what you're looking for.

Responsibilities

  • Manage and optimize high-performance physical infrastructure (servers, GPUs, and advanced networking)
  • Ensure availability, performance, and reliability of HPC and AI environments
  • Drive infrastructure automation (IaC) and enable zero-touch provisioning
  • Oversee the full hardware lifecycle (capacity planning, deployment, and decommissioning)
  • Work with tools such as HPE OneView, Lenovo XClarity, and ServiceNow CMDB
  • Collaborate with R&D;, science, and engineering teams to design optimal infrastructure solutions
  • Optimize resource utilization (CPU/GPU) and improve overall infrastructure efficiency

Qualifications

  • 5–7+ years of experience in Data Center Engineering, Bar...

Apply for this Position

Ready to join Tata Consultancy Services? Click the button below to submit your application.

Submit Application