Job Description
Overall Responsibilities:
Design, implement, and maintain scalable, reliable infrastructureAutomate deployment, scaling, and management of applications and servicesMonitor system health and troubleshoot issues proactivelyParticipate in on-call rotations to ensure uptime and incident managementDevelop runbooks, best practices, and automation scriptsCollaborate with development teams to improve system architecture and reliabilityConduct performance tuning and capacity planningImprove observability and monitoring across the stackDocument operational procedures and incident post-mortemsSoftware Requirements:
Strong experience with cloud platforms such as AWS, GCP, or AzureProficiency in Linux/Unix system administrationKnowledge of scripting languages: Python, Bash, or GoExperience with Infrastructure as Code (IaC) tools like Terraform, CloudFormationFamiliarity with container orchestration: Kubernetes, Docker SwarmMonitoring and alerting tools: Prometheus, Grafana, Nagios, DatadogConfiguration management tools: Ansible, Chef, PuppetCI/CD pipelines setup and management (Jenkins, GitLab CI, CircleCI)Log management and analysis: ELK Stack (Elasticsearch, Logstash, Kibana)Category-wise Technical Skills:
Cloud Platforms: AWS, GCP, AzureContainerization and Orchestration: Docker, KubernetesScripting & Automation: Python, Bash, GoInfrastructure as Code: Terraform, CloudFormationMonitoring & Logging: Prometheus, Grafana, Nagios, ELK StackConfiguration Management: Ansible, Chef, PuppetCI/CD Tools: Jenkins, GitLab CI, CircleCIOperating Systems: Linux/Unix administration skillsNetworking & Security: VPNs, Load balancers, firewalls, SSL/TLSExperience:
5+ years of experience in SRE, DevOps, or infrastructure engineering rolesDay-to-Day Activities:
Managing and scaling cloud infrastructure and servicesMonitoring system health, alerting, and incident responseAutomating deployment, updates, and infrastructure provisioningPerforming capacity planning and performance tuningTroubleshooting and resolving outages or performance issuesCollaborating with development teams to improve architecture and resilienceConducting post-incident reviews and implementing preventative measuresMaintaining documentation for infrastructure and processesQualifications:
Bachelor's or Master’s degree in Computer Science, Information Technology, or related fieldsProven experience working with cloud providers and infrastructure automation toolsRelevant certifications in cloud platforms (AWS Certified Solutions Architect, GCP Professional Cloud Architect, etc.) are preferredSoft Skills:
Strong analytical and problem-solving skillsExcellent communication and collaboration skillsAbility to work under pressure and handle incidents calmlyProactive mindset with a focus on automation and efficiencyEagerness to learn new tools and technologiesGood organizational and time-management skillsDiversity & Inclusion are fundamental to our culture, and Synechron is proud to be an equal opportunity workplace and is an affirmative action employer. Our Diversity, Equity, and Inclusion (DEI) initiative ‘Same Difference’ is committed to fostering an inclusive culture – promoting equality, diversity and an environment that is respectful to all. We strongly believe that a diverse workforce helps build stronger, successful businesses as a global company. We encourage applicants from across diverse backgrounds, race, ethnicities, religion, age, marital status, gender, sexual orientations, or disabilities to apply. We empower our global workforce by offering flexible workplace arrangements, mentoring, internal mobility, learning and development programs, and more.
All employment decisions at Synechron are based on business needs, job requirements and individual qualifications, without regard to the applicant’s gender, gender identity, sexual orientation, race, ethnicity, disabled or veteran status, or any other characteristic protected by law.
Apply for this Position
Ready to join ? Click the button below to submit your application.
Submit Application