Job Description

We Looking For: An AI-driven Site Reliability Engineer who can turn ML insights into automated infrastructure actions.

Role Purpose: To operationalize and automate the Right‑Sizing platform’s recommendations, ensuring safe, reliable, and scalable adoption of cost optimization measures across Client’s cloud and on‑prem environments.

Key Responsibilities

  • Translate ML/forecasting model outputs into actionable system changes.
  • Implement automation to adjust Kubernetes deployment configs, VM allocations, and storage sizing.
  • Develop rollback and alerting mechanisms for failed or unsafe right‑sizing actions.
  • Build CI/CD pipelines to continuously integrate and deploy updated capacity recommendations.
  • Ensure compliance with OCC and internal governance requirements when applying changes.
  • Collaborate with Data/ML engineers to fine‑tune recommendations before production rollout.
  • Document automa...

Apply for this Position

Ready to join iVedha Inc.? Click the button below to submit your application.

Submit Application