Job Description
<p>Key Responsibilities<br /> OMS Reliability: Maintain, monitor, and improve the performance of Sterling OMS or similar order management platforms to meet strict SLAs.<br /> Automation & Scripting: Develop scripts and automation tools to reduce operational toil, automate deployments, and streamline incident responses.<br /> Incident Management: Lead root cause analysis (RCA) and post-mortems for production incidents, applying fixes to prevent recurrence.<br /> Monitoring & Observability: Implement proactive monitoring, logging, and tracing to gain insights into system health and user experience.<br /> System Optimization: Conduct capacity planning and performance tuning to ensure the system handles peak order volumes efficiently.<br /> Collaboration: Work with development teams to ensure new features are reliable and deployable.<br /> Required Skills & Qualifications<br /> Technical Expertise...
Apply for this Position
Ready to join Programmers.io? Click the button below to submit your application.
Submit Application