Job Description

**Job Description**
**Job Summary**
Thank you for your interest in BAE Systems! We re looking for a seasoned Senior Site Reliability Engineer (SRE) to ensure the reliable deployment, operation, and continuous improvement of our digital engineering software tools across BAE Systems factories in North America. The role blends deep technical expertise with strong leadership, guiding crossfunctional teams to keep our missioncritical microservices, monitoring stacks, and data stores healthy and performant.
**Key Responsibilities**

+ Monitor, troubleshoot, and resolve production incidents, ensuring rapid rootcause analysis and longterm fixes.
+ Design, build, and maintain automated deployment pipelines for the digital engineering software suite using asset/inventory management tools.
+ Deploy, configure, and operate the observability stack (Prometheus, Grafana, FluentBit, Loki) to provide realtime metrics, logs, and tracing for all services.
+ Monitor and troublesh...

Apply for this Position

Ready to join BAE Systems? Click the button below to submit your application.

Submit Application