Job Description

Site Reliability Engineers (SREs) are essential to PandaDoc's success, ensuring customers receive a reliable service with minimal downtime.

The SRE team achieves this by:

  • Owning the incident management processes and tools.
  • Managing the observability stack and alerting systems to enable timely investigation and mitigation.
  • Actively contributing to service codebases to proactively prevent incidents and resolve performance bottlenecks.
  • In essence, SREs are the cornerstone of production service resiliency, driving efforts in observability, incident management, capacity planning, and maintaining reliable operations. 

    In this role, you will:

  • Own and influence the incident management process end-to-end
  • Maintain and evolve on-prem observability stack
  • Keep production applications running smoothly by participating in the on-call rotation
  • Develop automations and tools to support platform reliability
  • Apply for this Position

    Ready to join PandaDoc? Click the button below to submit your application.

    Submit Application