Job Description
Job Summary :
Experienced Systems Administrator with a strong foundation in Linux, infrastructure management, and incident response, skilled in monitoring, troubleshooting, and maintaining reliable systems across virtualized and cloud-based environments.
Responsibilities :
- Collaborate with the operations team to manage escalations and oversee incident management.
- Implement strategies and solutions to enhance daily operations, including system stability, security, and scalability.
- Drive real-time monitoring of system performance and capacity, addressing alerts and optimizing systems.
- Lead troubleshooting efforts, coordinating responses to network and system issues.
- Conduct and oversee server, application, and network equipment setup and maintenance.
- Ensure effective outage notification and escalation for prompt resolution.
- Mentor and train the team members on technical skills and troubleshooting methods.
- Maintain up-to-date documentation of processes and procedures in the WIKI.
Key Skills :
- Datacenter technologies and cloud (AWS/GCP).
- Application deployment with Git, StackStorm, etc.
- Strong troubleshooting skills across networks and systems; familiarity with network protocols (TCP/IP, UDP, ICMP) and tools like TCPdump.
- Advanced diagnostic skills in network performance and system capacity monitoring.
- Proficient in Linux command-line and system administration.
- Analytical skills with an ability to interpret and act on data.
- Ability to prioritize and escalate issues effectively.
- Adaptability to shift work and capacity for multitasking in high-pressure scenarios.
- Excellent leadership, communication, and interpersonal skills.
Qualifications :
- Bachelor’s degree in Computer Science, Engineering (BE/B.Tech), MCA, or M.Sc (IT).
Must-Have :
- Configuration Management : Basic experience with Ansible, SaltStack, StackStorm, or similar.
- CI/CD : Basic experience with Jenkins or similar.
- Monitoring : Experience with Nagios, Sensu, Zabbix, or similar.
- Log Analytics : Basic experience with Splunk/ Elasticsearch/ Sumo Logic/ Prometheus/ Grafana, or similar.
- Virtualization : VMware, KVM, or similar.
- Linux & Networking : Strong fundamentals in Linux, troubleshooting, and networking.
- Containerization : Knowledge of Kubernetes, Rancher, or similar.
Good to Have :
- Cloud Providers : AWS or GCP.
- Networking : Advanced knowledge of BGP, F5 Load Balancer, and switching protocols.
- Certifications : RHCSA, CCNA, or equivalent.
Apply for this Position
Ready to join ? Click the button below to submit your application.
Submit Application