Job Description
Key Responsibilities:
- Enable SRE support and monitoring for HPE Networking SASE products to ensure applications meet performance and availability requirements.
- Design and implement strategies to detect, troubleshoot, and resolve issues using monitoring tools such as Prometheus, Grafana, or Datadog.
- Maintain high availability and performance of cloud-based applications and services.
- Collaborate with development teams to enhance application performance, reliability, and scalability in production.
- Analyze monitoring data (logs, metrics, traces) to identify trends, optimize performance, and prevent incidents.
- Manage Kubernetes clusters, containerized applications, and deployment pipelines (Docker, Helm).
- Lead and participate in incident management, root cause analysis, and preventive actions to reduce recurrence.
- Deploy and maintain applications in public cloud environments (AWS, Azure, GCP).
- Sup...
Apply for this Position
Ready to join Hewlett Packard Enterprise? Click the button below to submit your application.
Submit Application