Job Description
- Oversee the reliability and performance of our infrastructure systems to ensure optimal operation.
- Implement and maintain monitoring solutions using Dynatrace to proactively identify and resolve issues.
- Collaborate with development and operations teams to enhance system reliability and performance.
- Provide technical expertise in Site Reliability Engineering (SRE) practices to improve system availability.
- Develop and maintain automation scripts to streamline infrastructure operations and reduce manual intervention.
- Lead incident response efforts to quickly resolve system outages and minimize downtime.
- Conduct root cause analysis of system failures and implement corrective actions to prevent recurrence.
- Monitor system performance metrics and generate reports to provide insights into infrastructure health.
- Ensure compliance with security policies and standard processes in all infrastructure operations.
- Partici...
Apply for this Position
Ready to join Han Digital Solution? Click the button below to submit your application.
Submit Application