Job Description

The Senior Site Reliability Engineer (SRE) will be responsible for ensuring the availability, performance, scalability, and operational efficiency of the Informatix cloud platform. This role is focused on reducing manual operations work (toil), automating system reliability, and ensuring production-grade observability. The ideal candidate is a systems-focused engineer who is passionate about uptime, incident response, and continuous improvement through engineering solutions.

Essential Duties and Responsibilities

  • Serve as a primary contributor to the on-call rotation to maintain 24/7 uptime for production systems.
  • Proactively, monitor, and continuously improve SLAs, SLOs, and SLIs across critical services.
  • Develop and maintain robust observability tooling including logging, metrics, and tracing (e.g., Azure Monitor, OpenTelemetry, Prometheus).
  • Proactively conduct postmortems and root cause analysis; implement fixes to prevent repeat in...
  • Apply for this Position

    Ready to join ATEC Spine? Click the button below to submit your application.

    Submit Application