Job Description
Job Description
Insight Global is seeking a highly skilled Remote Site Reliability Engineer to join the IT Operations team to maintain the reliability, performance, and availability of the enterprise platforms. In this role you'll be responsible for monitoring platform health, responding to critical incidents, implementing automation, and continuously improving our observability and reliability tooling. To be successful you must have experience with the following tech stack: Prometheus, Grafana, Datadog, Python/PowerShell/Shell/Go & Azure DevOps. To be successful in this role you must be self-sufficient and proactive in resolving issues, comfortable working with incomplete details and driving clarity and strong collaboration skills to engage with cross-functional teams.We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative acti...
Apply for this Position
Ready to join Insight Global? Click the button below to submit your application.
Submit Application