Job Description
Key Responsibilities
- Operate, monitor, and troubleshoot Linux and Kubernetes platforms
- Perform Day-2 operational support, incident response, and root cause analysis
- Work with monitoring tools to improve system visibility and reliability and Support security, patching, and compliance activities
- Collaborate with cross-functional teams using ITSM processes (ServiceNow) and continuously improve operational processes and platform stability.
Key Requirements:
- Bachelor's degree in computer science, Information Technology or a related field.
- Strong hands-on experience with Linux and Kubernetes (mandatory)
- Hands-on Kubernetes troubleshooting skills, including Pod failures, CrashLoopBackOff, networking, storage, and resource issues, Node-level and cluster-level problem analysis
- Experience with DevOps monitoring and observability tools (metrics, logs, alerts)
- Working knowledge of ServiceNow for incident, problem, and change management
- Windows System Administration experience (AD, patching, services, basic troubleshooting)
- Basic understanding of Network Administration, including: TCP/IP, DNS, routing, firewalls, load balancing
- Knowledge of Cyber Security fundamentals, such as: Access control, patching, vulnerability awareness, and security best practices
- Exposure to or working knowledge of Google AI / ML services or platforms.
Good to Have :
- Certified Kubernetes Administrator (CKA) or equivalent Kubernetes certification.
- Experience supporting Kubernetes in production or mission-critical environments
- Familiarity with cloud platforms (GCP preferred)
- Experience with automation, scripting, or infrastructure-as-code tools
- Understanding of IT operations, SRE, or Day-2 managed services environments
Apply, please kindly email your updated resume toΒ
Only shortlisted applicants will be notified.
APBA TG Human Resource Pte Ltd (14C7275) || Akshya R (R
Apply for this Position
Ready to join ? Click the button below to submit your application.
Submit Application