Job Description
We Looking For: An AI-driven Site Reliability Engineer who can turn ML insights into automated infrastructure actions.
Role Purpose: To operationalize and automate the Right‑Sizing platform’s recommendations, ensuring safe, reliable, and scalable adoption of cost optimization measures across Client’s cloud and on‑prem environments.
Key Responsibilities
- Translate ML/forecasting model outputs into actionable system changes.
- Implement automation to adjust Kubernetes deployment configs, VM allocations, and storage sizing.
- Develop rollback and alerting mechanisms for failed or unsafe right‑sizing actions.
- Build CI/CD pipelines to continuously integrate and deploy updated capacity recommendations.
- Ensure compliance with OCC and internal governance requirements when applying changes.
- Collaborate with Data/ML engineers to fine‑tune recommendations before production rollout.
- Document automa...
Apply for this Position
Ready to join iVedha Inc.? Click the button below to submit your application.
Submit Application