Job Description
Senior Site Reliability / Gitops Engineer
Canonical is hiring a Senior Site Reliability / Gitops Engineer to join our Information Systems (IS) team. This role focuses on automation-first IT operations, infrastructure as code, and scalable service delivery across private and public clouds.
Overview
We are a leading provider of open source software and operating systems to the global enterprise. Our platform, Ubuntu, is widely used in enterprise initiatives across public cloud, data science, AI, engineering innovation, and IoT. The company has 1200+ colleagues in 75+ countries with few office-based roles. Teams meet in person a few times yearly to align on strategy and execution. We are hiring to strengthen the IS team that supports and maintains Canonical’s IT production services used by millions of Ubuntu users.
As a Senior SRE & Gitops Engineer you will drive operations automation to the next level, leveraging open source IaC, CI/CD practices, and Canonical products for software operation automation. You will contribute to product improvements by providing feedback to developers and collaborating across teams.
You will be part of a global team of SREs that support Canonical, its customers, and the Ubuntu Community.
Responsibilities
- Drive the development of automation and Gitops in your team as an embedded tech lead
- Collaborate closely with the IS architect to align solutions with the IS architecture vision
- Design and architect services that IS can offer to the organization as products
- Apply IaC to develop infrastructure as code practices within IS, increasing automation and improving IaC processes
- Automate software operations for re-use and consistency across private and public clouds, considering distributed systems complexities
- Maintain operational responsibility for Canonical's core services, networks, and infrastructure
- Develop skills in troubleshooting, capacity planning, and performance investigation; set up, maintain, and use observability tools (Prometheus, Grafana, Elasticsearch) and design/maintain monitoring and alerting
- Provide assistance and collaborate with globally distributed engineering, operations, and support peers
- Have uninterrupted development time for larger projects and automation of manual tasks
- Share knowledge and best practices with team members through design sessions, mentorship, and collaborative work
- Carry final responsibility for time-critical escalations
What we are looking for
- Modern view on hosting architecture driven by infrastructure as code across private and public clouds
- Product-minded approach focused on building products rather than just solutions
- Python software development experience with large projects
- Experience with Kubernetes or other container orchestration systems
- Proven ability to manage and deploy cloud infrastructure with code
- Practical knowledge of Linux networking, routing, and firewalls
- Familiarity with various Linux storage forms (e.g., Ceph, databases)
- Hands-on experience administering enterprise Linux servers
- Extensive knowledge of cloud computing concepts and technologies
- Bachelor’s degree or higher, preferably in computer science or related engineering
- Clear and effective communication in English (email, chat, video/voice, in-person)
- Motivated with strong troubleshooting skills from kernel to web, and willingness to ask for help when needed
- Flexible, quick to learn, and able to adapt to fast-changing environments
- Open to working within distributed teams and a passion for open source, especially Ubuntu or Debian
What we offer
- Distributed work environment with twice-yearly team sprints in person
- Personal learning and development budget of USD 2,000 per year
- Annual compensation review and performance-driven bonus or commission
- Additional benefits aligned with our values
- Fair worldwide compensation with local adaptations as needed
- Holiday leave, maternity/paternity leave, and wellness programs
- Team Member Assistance Program & Wellness Platform
- Opportunities to travel to new locations to meet colleagues
About Canonical
Canonical is a pioneering tech firm at the forefront of the global move to open source. As the publisher of Ubuntu, Canonical helps drive AI, IoT, and cloud innovation. We recruit on a global basis and expect excellence. Most colleagues have worked remotely since our inception in 2004. We foster a workplace free from discrimination and provide fair consideration to all applicants.
Seniority level
- Mid-Senior level
Employment type
- Full-time
Job function
- Engineering and Information Technology
- Software Development
Referrals increase your chances of interviewing at Canonical by 2x
Sign in to set job alerts for “Senior Site Reliability Engineer” roles.
We’re unlocking community knowledge in a new way. Experts add insights directly into articles, started with the help of AI.
#J-18808-LjbffrApply for this Position
Ready to join ? Click the button below to submit your application.
Submit Application