Job Description
Description
:
Essential Responsibilities:
Take ownership of system performance monitoring, identify inefficiencies, and lead initiatives to improve the overall availability and reliability of digital platforms and applications.Lead and manage the response to complex, high-priority incidents, ensuring prompt resolution and a thorough root cause analysis to prevent future occurrences.Design and implement advanced automation frameworks to improve operational efficiency, streamline processes, and reduce human error.Lead reliability-focused initiatives, ensuring systems are highly available, resilient, and scalable, and promote best practices across engineering teams.Enhance the monitoring infrastructure by identifying key metrics, optimizing alerting, and improving system observability to ensure the reliability of large-scale systems.Forecast resource requirements and lead capacity planning activities to ens...
Apply for this Position
Ready to join PayPal? Click the button below to submit your application.
Submit Application