Job Description

Description GSPANN is hiring an SRE Engineer to build and operate highly reliable, observable, and scalable systems. The role focuses on monitoring, automation, Kubernetes platforms, and incident management across hybrid environments.

Role and Responsibilities

  • Build, deploy, and operate reliability, monitoring, and observability solutions across application, infrastructure, cloud, and on-premises environments.
  • Design, configure, and maintain dashboards and alerting using tools such as AppDynamics, Grafana, Sumo Logic, Datadog, Splunk, Dynatrace, or equivalent platforms.
  • Automate deployments, monitoring, remediation, and operational workflows using Python, Bash, PowerShell, Terraform, and Ansible.
  • Develop self-healing mechanisms to reduce operational toil and improve overall system resilience.
  • Troubleshoot complex production issues, lead incident response during Priority 1 and Priority 2 (P1/P2) events, perform root cause ana...
  • Apply for this Position

    Ready to join GSPANN? Click the button below to submit your application.

    Submit Application