Job Description

**Introduction**

As a Site Reliability Engineer (SRE) in the REIS team, you will ensure the reliability, scalability, and security of edge deployments. You will collaborate with a global team to support a hybrid infrastructure of Linux and Windows VMs, containerized services, and cloud-native tools. This role emphasizes automation, observability, and proactive incident management.


Realtime Edge Infrastructure and Services (REIS) is responsible for deploying, monitoring, and supporting edge computing infrastructure on oil rigs. Our primary software platform, DrillOps, runs on these edge devices and enables real-time automation and optimization of well construction processes.

**Your role and responsibilities**

Key Responsibilities


· Design, implement, and maintain monitoring and observability using Elastic Cloud Stack and Elastic Fleet agents.


· Participate in on-call rotations using PagerDuty for incident response a...

Apply for this Position

Ready to join IBM? Click the button below to submit your application.

Submit Application