Job Description

Mandatory skill: CI/CD, AWS and/or GCP, Python or Bash or Groovy, monitoring tools like Datadog, Ansible, JMeter.



Key Responsibilities

• Support and enhance observability (monitoring, logging, alerting) across production systems

• Help maintain SLIs/SLOs for key services

• Participate in evaluating services for production readiness

• Collaborate with development teams to identify reliability risks and improve system architecture

• Contribute to automation of operations, including CI/CD pipelines, incident response, and infrastructure provisioning

• Participate in incident response and on-call rotations for critical services

• Contribute to post-incident analysis and drive reliability improvements

• Partner with security, infrastructure, and product teams to support performance, compliance, and operational excellence



Apply for this Position

Ready to join TechDigital Corporation? Click the button below to submit your application.

Submit Application