Job Description

**Job Description**



Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems. Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning.



+ Design, implement, and operate scalable, secure, and highly available infrastructure for cloud and AI-driven applications on OCI.

+ Apply SRE best practices including SLI/SLO definition, error budgets, automated monitoring, incident response, and post-incident reviews.

+ Instrument systems using observability tools (Grafana, Prometheus, APM) to monitor performance, availability, latency, and resource utilization.

+ Lead major incident management, perform...

Apply for this Position

Ready to join Oracle? Click the button below to submit your application.

Submit Application