Job Description

Responsibilities-

Manage Kubernetes clusters, rollouts, and scaling.

- Implement observability using Dynatrace and logging pipelines.

- Develop IaC using Terraform/Bicep.

- Monitor SLOs, SLIs, error budgets, and service uptime.

- Support incident response and root cause analysis.

- Automate K8s and platform operational workflows.

- Collaborate with engineering teams on reliability improvements.

Skills

- Deep understanding of Kubernetes operations.

- Experience with Dynatrace or equivalent APM tools.

- Strong Terraform or Bicep IaC experience.

- Knowledge of reliability engineering principles.

- Understanding of distributed systems behavior.

- Experience with CI/CD deployment strategies.

- Good troubleshooting and incident management skills.


Skills Required
Reliability Engineering, Distributed Systems, Terraform, Dynatrace, Kubernetes

Apply for this Position

Ready to join APPIT Software Inc? Click the button below to submit your application.

Submit Application