Job Description
Senior Site Reliability Engineer (SRE) Summary We are looking for a Senior Site Reliability Engineer (SRE) to build and operate scalable, reliable, and secure platform infrastructure. The ideal candidate will drive automation, observability, incident management, and cloud-native best practices to improve system reliability and operational excellence across distributed systems. Roles & Responsibilities Define and manage SLIs, SLOs, and error budgets for critical services Design and enhance monitoring, logging, alerting, and tracing capabilities Automate operational processes and improve platform efficiency Participate in incident response, root cause analysis (RCA), and postmortem reviews Support production environments through on-call rotations and reliability initiatives Improve system performance, scalability, availability, and capacity planning Collaborate with engineering teams to enhance application resiliency and operational readiness Drive adoption of Infrastructure as Code (IaC...
Apply for this Position
Ready to join Snapmint? Click the button below to submit your application.
Submit Application