Job Description

ROLE OVERVIEW

The Site Reliability Engineer (SRE) is responsible for ensuring the availability, performance, and reliability of production systems hosted on Google Cloud Platform (GCP), with a strong focus on voice and real-time communication services. This role provides L2 production support, actively manages incidents, and drives root cause analysis to prevent recurrence. You will work closely with engineering, network, and operations teams to improve system resilience, automate operational tasks, and meet SLA commitments. The ideal candidate brings a strong mix of cloud reliability engineering and voice/VoIP technical expertise in a live production environment.

SPECIFIC DUTIES AND RESPONSIBILITIES

  • Monitor Production Systems: Use monitoring tools (e.g., Cloud Monitoring) to ensure the health and performance of cloud-based production systems on Google Cloud Platform (GCP).
  • Incident Management: Respond to production incidents, triage issues,...

Apply for this Position

Ready to join Quantrics Enterprises Inc.? Click the button below to submit your application.

Submit Application