Job Description

SRE – Site Reliability Engineer

We are currently seeking a Site Reliability Engineer to join our team in GDL, Jalisco (MX-JAL), Mexico (MX).

Responsibilities

  • Perform L1.5 activities such as monitoring, deployment, rollback.
  • Monitor the efficiency of the Azure cloud systems to prevent outages and initiate an Incident Management bridge in case of an outage.
  • Troubleshoot Azure resources, and elevate to Level 3 (Software Development Team).

Qualifications

  • Understand the Microsoft Azure Cloud - ideally Azure Fundamentals certified OR Computer Science/Information Systems Management degree.
  • Familiar with PaaS and IaaS - VMs, Storage, EventHub, Service Fabric Cluster (SFC), Azure Kubernetes Service (AKS), CosmosDB, SQL Server, IoT Hub, Databricks, KeyVault, Datalake. Understand the concept of Internet of Things (IoT) - telemetry, ingestion, processing, data storage, reporting.
  • Understand the concept tools - Octopus, Bamboo, Terraform, Azure DevOps, Jenkins, Github, Ansible. Understand the concept of container orchestration platforms (e.g. Kubernetes). Understand the concept of scripts: Powershell, Python. Understand the difference between NoSQL and SQL databases, and how to maintain them. Understand monitoring and logging systems (LogAnalytics, Splunk, ELK, Prometheus, Nagios, Zabbix, etc.).
  • Independent thinker - why does it break, what can I proactively do to fix it.

#J-18808-Ljbffr

Apply for this Position

Ready to join ? Click the button below to submit your application.

Submit Application