Job Description

Elevate system reliability as a Senior Site Reliability Engineer (SRE) focused on production applications. Analyze performance and implement optimizations to ensure exceptional system uptime.
We are looking for a highly experienced SRE to oversee reliability, optimization, and incident response. Key responsibilities include implementing best practices for performance and scalability, troubleshooting issues, and leading incident resolutions. You will also engage in capacity planning and operational automation to enhance efficiency.
Key Responsibilities:
• Implement best practices for high availability
• Set up monitoring and troubleshoot incidents
• Analyze resource usage for optimization
• Automate operational workflows and integrations
• Lead incident resolution and performance monitoring
Requirements:
• Proficient in Dynatrace and ELK Stack
• Experience with monitoring tools, AI Ops
• Advanced skills in Python, PowerShell, Shell Scripting
• Knowled...

Apply for this Position

Ready to join TATA Consultancy Services? Click the button below to submit your application.

Submit Application