Job Description

Seeking a technical Azure Operations Engineer with a customer operations mindset to join our Public Cloud Services - DevOps & Automation team. The right candidate for this role will have a passion to drive a cloud mindset and lead operations and automation to deploy/support/troubleshoot distributed services/applications in Azure infrastructure.



Lead technical efforts in DevOps, IaC and automation, providing end-to-end ownership and support for critical Azure services.



Manage and resolve a high volume of ServiceNow tickets. Perform expert triage to ensure accurate priority levels (High/Medium/Low) and timely resolution.



Develop automation runbooks for scheduled shutdowns of hybrid workers and other resources to drive measurable cost savings.



Enhance Tekton clusters, GitHub repository migrations, and automated subscription lifecycle management.



Remediate vulnerabilities, enable Managed Identities on Function Apps, and apply hardened Linux OS policies and secure communication protocols to production environments.



Lead migration planning and execution for China 21V, including Automation and Storage account migrations.



Develop Dynatrace synthetic monitoring and complex JSON configurations for ARM VM observability to ensure proactive alerting and system health.



Provide Knowledge Transfer (KT) sessions to junior engineers to maintain team technical standards.



Manage and implemen **t** cloud-automation solutions like Exception Frameworks, Cloud Health, Quota Management, and Cost Optimization tools to supply self-service capabilities and remove toil.



Ensure all key services implement metrics, are monitored



+ B.E / B.Tech. degree in Computer Science or equivalent.

+ Total 10 years of experience in the IT & IT Operations industry.

+ Azure Administrator Associate (AZ-104) or Azure Security Engineer (AZ-500) certification (preferred).

+ At least 5+ years of software development experience, including experience in a technical leadership role.

+ Prior experience being part of Cloud Operations with a strong focus on Cloud Security (vulnerability remediation and threat protection).

+ Prior networking background/experience (familiarity with VNETs, NSGs, ExpressRoute, etc.).

+ Proficient in programming with Python, Go, or Java

+ 3+ years of hands-on experience in IaC languages like Terraform and Ansible.

+ 2+ years of experience in designing and building CI/CD pipelines for IaC and microservices (e.g., Tekton, Azure DevOps, or GitHub Actions).

+ 2+ years of hands-on experience in building SRE platforms using Dynatrace or Azure Monitor/Log Analytics.

+ Experience in Linux, virtualization, and cloud security tools.

+ DevOps mindset: You are familiar with Site Reliability Engineering (SRE) concepts. You treat operational issues as if they are software problems. You view software as a primary tool to manage, fix, and extend systems.

+ Automation: You use automation, data analysis, and proactive monitoring to ensure high availability (HA) and remove manual toil from the system.

+ Problem Solver: You love tackling the most difficult challenges, identifying root causes, and implementing solutions to prevent recurrence.



**Requisition ID** : 57387

Apply for this Position

Ready to join ? Click the button below to submit your application.

Submit Application