Job Description
We are seeking a highly skilled Site Reliability Engineer (SRE) with deep expertise in Kubernetes and cloud technologies AWS, Azure, or GCP.
The SRE will be responsible for designing, deploying, automating, and supporting highly available, scalable, and secure containerized applications in cloud-native environments. You will work closely with development, operations, and security teams to ensure the reliability, performance, and efficiency of our production systems.
Key Responsibilities
- Design, deploy, and manage Kubernetes clusters on‑premises and/or cloud‑managed such as EKS, AKS, GKE to support scalable microservices architectures.
- Automate infrastructure provisioning and application deployment using Infrastructure as Code (IaC) tools such as Terraform, Helm, or CloudFormation.
- Monitor, troubleshoot, and optimize system performance using observability tools.
- Implement and manage CI/CD pipelines to ensure rapid, repe...
Apply for this Position
Ready to join iXceed Solutions? Click the button below to submit your application.
Submit Application