Job Description

Senior Site Reliability Engineer (SRE)

Summary

We are looking for a Senior Site Reliability Engineer (SRE) to build and operate scalable, reliable, and secure platform infrastructure. The ideal candidate will drive automation, observability, incident management, and cloud-native best practices to improve system reliability and operational excellence across distributed systems.


Roles & Responsibilities

  • Define and manage SLIs, SLOs, and error budgets for critical services
  • Design and enhance monitoring, logging, alerting, and tracing capabilities
  • Automate operational processes and improve platform efficiency
  • Participate in incident response, root cause analysis (RCA), and postmortem reviews
  • Support production environments through on-call rotations and reliability initiatives
  • Improve system performance, scalability, availability, and capacity planning
  • Collaborate wit...

Apply for this Position

Ready to join Snapmint? Click the button below to submit your application.

Submit Application