Job Description

Key Responsibilities:

  • Enable SRE support and monitoring for HPE Networking SASE products to ensure applications meet performance and availability requirements.
  • Design and implement strategies to detect, troubleshoot, and resolve issues using monitoring tools such as Prometheus, Grafana, or Datadog.
  • Maintain high availability and performance of cloud-based applications and services.
  • Collaborate with development teams to enhance application performance, reliability, and scalability in production.
  • Analyze monitoring data (logs, metrics, traces) to identify trends, optimize performance, and prevent incidents.
  • Manage Kubernetes clusters, containerized applications, and deployment pipelines (Docker, Helm).
  • Lead and participate in incident management, root cause analysis, and preventive actions to reduce recurrence.
  • Deploy and maintain applications in public cloud environments (AWS, Azure, GCP).
  • Sup...

Apply for this Position

Ready to join Hewlett Packard Enterprise? Click the button below to submit your application.

Submit Application