Job Description

This role is for one of the Weekday's clients

Location: Hyderabad 

Work Type: 5 Days WFO (On-Site)           
Work Timings: 2 PM – 11 PM 
Type: Full-Time 
Experience: 5+ Years (Senior Level) 
Domain: Digital Adoption Platform (DAP) 
We are looking for a highly skilled, hands-on Senior / Lead DevOps Engineer to own and manage the complete DevOps ecosystem end-to-end for a rapidly scaling B2B SaaS platform. The core product is a Digital Adoption Platform (DAP) delivered via browser extensions that overlay and guide users across enterprise applications. With millions of active users , the platform demands highly scalable, secure, and resilient infrastructure.

This role requires an engineer who can architect, build, optimize, and operate complex cloud environments , scale real-time data pipelines, and support multi-tenant SaaS as well as on-prem enterprise deployments . You will play a critical role in ensuring platform reliability, performance, and security at scale.

If you thrive in ownership-driven environments and enjoy designing infrastructure that supports large-scale SaaS systems, this role is for you.

Requirements

Key Responsibilities

Cloud Infrastructure & Architecture (AWS)

  • Architect, implement, and maintain scalable cloud infrastructure using AWS services including:
    VPC, Route53, CloudFront, S3, EKS, EC2, ALB/ELB, Lambda, RDS, Elasticache, API Gateway, Kinesis Streams, ClickHouse
  • Design and maintain a multi-tenant SaaS architecture supporting millions of active users
  • Optimize and scale high-throughput event pipelines for user activity tracking and analytics processing

DevOps & Automation

  • Own and enhance the complete CI/CD ecosystem using GitHub Actions
  • Build automated deployment pipelines for browser extensions, microservices, backend services, and frontend applications
  • Implement Infrastructure as Code (IaC) using Terraform or CloudFormation
  • Build and maintain Docker images and containerized services
  • Manage Kubernetes workloads using Helm charts

Monitoring, Security & Reliability

  • Implement end-to-end observability: metrics, logs, traces, and alerting
  • Enforce cloud and application security best practices across infrastructure and deployments
  • Ensure high availability through auto-scaling, disaster recovery, and backup strategies
  • Lead security hardening initiatives including CVE remediation, container security, and dependency management

On-Premise Deployments

  • Design and deliver on-prem enterprise deployments that mirror the cloud SaaS architecture
  • Collaborate with enterprise customers to customize deployment models as required
  • Build automation tools and scripts for installation, upgrades, monitoring, and maintenance

Collaboration & Technical Leadership

  • Partner with Engineering, QA, and Product teams to enable reliable and frequent releases
  • Mentor junior DevOps engineers and promote cloud and DevOps best practices across teams
  • Participate in architecture reviews and influence long-term technical decisions

Qualifications

Education

  • Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field

Required Experience & Skills

  • 5+ years of DevOps experience , with at least 3+ years in a senior or lead role
  • Strong expertise in AWS cloud architecture (mandatory)
  • Hands-on experience with:
    • AWS VPC, EC2, RDS, Elasticache
    • Kubernetes (EKS), Docker, Helm
    • S3, CloudFront, Route53
    • API Gateway, Lambda, Kinesis Streams
    • ClickHouse or similar columnar databases
  • Strong CI/CD experience using GitHub Actions
  • Infrastructure as Code using Terraform
  • Proficiency in scripting and automation using Python, Shell, and Node.js
  • Solid understanding of distributed systems, caching, load balancing, and event-driven architectures

Scalability & Performance

  • Proven experience scaling distributed systems for high-traffic, large user bases
  • Hands-on experience designing high-throughput analytics and real-time data pipelines

On-Prem Deployment Experience

  • Demonstrated experience replicating SaaS architectures for on-prem environments
  • Ability to automate both containerized and non-containerized deployments

Other Requirements

  • Strong debugging, troubleshooting, and root-cause analysis skills
  • Ownership mindset with the ability to independently deliver end-to-end solutions
  • Excellent communication and cross-functional collaboration skills

Nice-to-Have Skills

  • Experience with Digital Adoption Platforms or browser-based SaaS products
  • Familiarity with observability tools such as Grafana, Prometheus, Datadog
  • Exposure to SOC 2, ISO, or similar compliance environments

Apply for this Position

Ready to join ? Click the button below to submit your application.

Submit Application