Job Description

Job Description:

We are seeking an experienced Site Reliability Engineer (SRE) to ensure the reliability, scalability, and performance of our production systems. The role focuses on DevOps practices, automation, and cloud infrastructure to support highly available and resilient applications.

Key Responsibilities:

  • Design, build, and maintain CI/CD pipelines for reliable and frequent deployments
  • Manage containerized environments using Docker and Kubernetes
  • Monitor system health, performance, and availability using modern monitoring and alerting tools
  • Ensure high availability, scalability, and fault tolerance of cloud infrastructure
  • Automate operational tasks and improve system reliability through SRE best practices
  • Collaborate with development teams to improve deployment, observability, and incident response

Key Skills:

  • Strong experience in DevOps an...

Apply for this Position

Ready to join Zenith Services Inc.? Click the button below to submit your application.

Submit Application