Job Description
Mandatory:<br /> Strong proficiency in <b>Linux and Windows server environments</b><br /> Deep understanding of networking protocols (<b>TCP/IP, DNS, DHCP, VPN, routing</b>)<br /> Experience with <b>CI/CD tools</b> (e.g., Jenkins, GitLab CI), <b>monitoring platforms</b> (e.g., Prometheus, Grafana), and <b>infrastructure as code</b> (e.g., Terraform, Ansible)<br /> <b>Scripting skills</b> (e.g., Bash, PowerShell, Python)<br /> Experience with asset tracking tools and inventory management systems<br /> Preferred:<br /> -Familiarity with <b>cloud platforms</b> (AWS, Azure, GCP) and containerization (Docker, Kubernetes) <ul> <li>Monitoring tools</li> </ul> Responsibilities:<br /> Maintain and optimize server operating systems and applications in production environments<br /> Monitor and manage network capacity, performance, and reliability<br /> Integrate new products and services into existing server and network infrastructure<br /> Troubleshoot and resolve infrastructure issues across systems and networks<br /> Ensure compliance with security policies and operational standards<br /> Maintain accurate records of hardware, software, and licensing assets across staging and production environment.<br /> Deployment of SMOPs for production implementation of OS upgrades and patching as well as Application server upgrades.<br /> Assist in design and development of automation for CI/CD, testing, deployment, and monitoring<br /> Enhance system reliability, scalability, and performance in collaboration with development teams<br /> Own and evolve observability practices including logging, metrics, and alerting<br /> Drive incident response processes and foster a postmortem culture to improve system resilience<br /> Support onboarding and enablement of engineering teams with platform tooling and documentation.<br />
Apply for this Position
Ready to join ? Click the button below to submit your application.
Submit Application