Job Description
Senior Site Reliability Engineer (SRE)
Summary
We are looking for a Senior Site Reliability Engineer (SRE) to build and operate scalable, reliable, and secure platform infrastructure. The ideal candidate will drive automation, observability, incident management, and cloud-native best practices to improve system reliability and operational excellence across distributed systems.
Roles & Responsibilities
- Define and manage SLIs, SLOs, and error budgets for critical services
- Design and enhance monitoring, logging, alerting, and tracing capabilities
- Automate operational processes and improve platform efficiency
- Participate in incident response, root cause analysis (RCA), and postmortem reviews
- Support production environments through on-call rotations and reliability initiatives
- Improve system performance, scalability, availability, and capacity planning
- Collaborate wit...
Apply for this Position
Ready to join Snapmint? Click the button below to submit your application.
Submit Application