Job Description

**Introduction**
At IBM Software, we transform client challenges into solutions. Building the world’s leading AI-powered, cloud-native products that shape the future of business and society. Our legacy of innovation creates endless opportunities for IBMers to learn, grow, and make an impact on a global scale. Working in Software means joining a team fueled by curiosity and collaboration. You’ll work with diverse technologies, partners, and industries to design, develop, and deliver solutions that power digital transformation. With a culture that values innovation, growth, and continuous learning, IBM Software places you at the heart of IBM’s product and technology landscape. Here, you’ll have the tools and opportunities to advance your career while creating software that changes the world.
**Your role and responsibilities**
Key Responsibilities

Infrastructure Automation:

Design, implement, and maintain Infrastructure as Code (IaC) solutions using tools such as Terraform to automate Infra provisioning and management.

CI/CD, Deployment & Orchestration

Build & Manage CI/CD framework, platforms & pipelines (Jenkins, ArgoCD) for infrastructure and applications deployments, ensuring seamless and safe delivery processes across environments.

Troubleshooting infrastructure level and product deployment level issues

Deploy, manage, and monitor containerized applications using Docker and Kubernetes (preferably OpenShift).

Ensure system reliability, scalability, and performance across distributed environments.

Hybrid Cloud Management across hybrid cloud (e.g., AWS, Azure etc) ensuring consistency, scalability, and security.

Monitoring & Observability

Develop automation for monitoring setup, alerting, and self-healing/fault response, integrating with tools like Prometheus, Grafana etc.

Documentation & Best Practices

Create and maintain comprehensive documentation for automation workflows, infrastructure patterns, and runbooks; contribute to developing Cloud engineering best practices.
**Required technical and professional expertise**
6+ years of experience in infrastructure software engineering, systems engineering, or Reliability Engineering roles.

Infrastructure as Code: Strong hands-on experience in Terraform, Pulumi, CRD etc.

Proficiency in Coding languages (Go or Python).

Experience with container management and orchestration (Docker, Kubernetes, Helm).

Working knowledge of CI/CD pipelines, version control (Git), and related tooling.

Observability as Code: Good knowledge with monitoring, logging, and alerting stacks.

Solid understanding of networking, security, and troubleshooting.

Strong troubleshooting skills and a passion for automation and process improvement.

Excellent communication skills and ability to work in a collaborative team environment.

Exposure to hybrid/multi-cloud architectures.

Background in performance tuning, cost optimisation, or incident response automation
**Preferred technical and professional experience**
Knowledge of Python for scripting and automation. Experience working with public cloud platforms (IBM Cloud, AWS, Azure, or GCP). Experience with Ansible or other configuration management tools.
IBM is committed to creating a diverse environment and is proud to be an equal-opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender, gender identity or expression, sexual orientation, national origin, caste, genetics, pregnancy, disability, neurodivergence, age, veteran status, or other characteristics. IBM is also committed to compliance with all fair employment practices regarding citizenship and immigration status.

Apply for this Position

Ready to join ? Click the button below to submit your application.

Submit Application