Job Description
Key Responsibilities
- Collaborate with cross-functional application and platform teams to understand system value chains and enhance observability practices
- Define and implement end-to-end monitoring, logging, tracing, and alerting strategies for distributed and microservices-based architectures
- Establish and enforce best practices for observability standards across application teams
- Evaluate, select, and implement observability tools aligned with organizational and project goals
- Continuously assess industry trends and adopt emerging observability technologies and best practices
- Identify system performance bottlenecks and work with development and operations teams to resolve them
- Conduct regular performance reviews and implement optimizations to improve reliability and responsiveness
- Support incident response by diagnosing production issues using observability data
- Develop and maintain incident response playbooks to streamline troubleshooting processes
- Contribute to DevOps initiatives including tool consolidation, CI/CD enablement, Agile work management, and continuous delivery
Skills Required
Monitoring, Logging, tracing, Distributed Systems, Dynatrace
Apply for this Position
Ready to join ? Click the button below to submit your application.
Submit Application