Job Description
We are seeking a talented and experienced **Senior Site Reliability Engineer (SRE)** to join our dynamic team.
**Responsibilities**
- Design and maintain Kubernetes resource manifests, deploying them into clusters on platforms like AKS or GKE
- Create and manage continuous deployment pipelines using tools like Helm and ArgoCD
- Optimize observability by implementing monitoring, logging, and tracing solutions
- Maintain and manage CI/CD processes within Azure DevOps or similar environments
- Develop and implement solutions on cloud platforms, leveraging expertise in at least one provider (e.g., Microsoft Azure, GCP, AWS)
**Requirements**:
- Minimum 3+ years of programming experience, preferably in GoLang
- Hands-on experience with at least one scripting language (e.g., Bash or Python)
- Proficiency with Kubernetes, with at least 3 years of practical expertise
- Fundamental knowledge of observability tools, with a focus on Prometheus or similar monitoring...
**Responsibilities**
- Design and maintain Kubernetes resource manifests, deploying them into clusters on platforms like AKS or GKE
- Create and manage continuous deployment pipelines using tools like Helm and ArgoCD
- Optimize observability by implementing monitoring, logging, and tracing solutions
- Maintain and manage CI/CD processes within Azure DevOps or similar environments
- Develop and implement solutions on cloud platforms, leveraging expertise in at least one provider (e.g., Microsoft Azure, GCP, AWS)
**Requirements**:
- Minimum 3+ years of programming experience, preferably in GoLang
- Hands-on experience with at least one scripting language (e.g., Bash or Python)
- Proficiency with Kubernetes, with at least 3 years of practical expertise
- Fundamental knowledge of observability tools, with a focus on Prometheus or similar monitoring...
Apply for this Position
Ready to join EPAM Systems, Inc.? Click the button below to submit your application.
Submit Application