Job Description
This role is for one of the Weekday's clients
Location: Hyderabad
Work Type: 5 Days WFO (On-Site)
Work Timings: 2 PM – 11 PM
Type: Full-Time
Experience: 5+ Years (Senior Level)
Domain: Digital Adoption Platform (DAP)
We are looking for a highly skilled, hands-on Senior / Lead DevOps Engineer to own and manage the complete DevOps ecosystem end-to-end for a rapidly scaling B2B SaaS platform. The core product is a Digital Adoption Platform (DAP) delivered via browser extensions that overlay and guide users across enterprise applications. With millions of active users , the platform demands highly scalable, secure, and resilient infrastructure.
This role requires an engineer who can architect, build, optimize, and operate complex cloud environments , scale real-time data pipelines, and support multi-tenant SaaS as well as on-prem enterprise deployments . You will play a critical role in ensuring platform reliability, performance, and security at scale.
If you thrive in ownership-driven environments and enjoy designing infrastructure that supports large-scale SaaS systems, this role is for you.
Requirements
Key Responsibilities
Cloud Infrastructure & Architecture (AWS)
- Architect, implement, and maintain scalable cloud infrastructure using AWS services including:
VPC, Route53, CloudFront, S3, EKS, EC2, ALB/ELB, Lambda, RDS, Elasticache, API Gateway, Kinesis Streams, ClickHouse - Design and maintain a multi-tenant SaaS architecture supporting millions of active users
- Optimize and scale high-throughput event pipelines for user activity tracking and analytics processing
DevOps & Automation
- Own and enhance the complete CI/CD ecosystem using GitHub Actions
- Build automated deployment pipelines for browser extensions, microservices, backend services, and frontend applications
- Implement Infrastructure as Code (IaC) using Terraform or CloudFormation
- Build and maintain Docker images and containerized services
- Manage Kubernetes workloads using Helm charts
Monitoring, Security & Reliability
- Implement end-to-end observability: metrics, logs, traces, and alerting
- Enforce cloud and application security best practices across infrastructure and deployments
- Ensure high availability through auto-scaling, disaster recovery, and backup strategies
- Lead security hardening initiatives including CVE remediation, container security, and dependency management
On-Premise Deployments
- Design and deliver on-prem enterprise deployments that mirror the cloud SaaS architecture
- Collaborate with enterprise customers to customize deployment models as required
- Build automation tools and scripts for installation, upgrades, monitoring, and maintenance
Collaboration & Technical Leadership
- Partner with Engineering, QA, and Product teams to enable reliable and frequent releases
- Mentor junior DevOps engineers and promote cloud and DevOps best practices across teams
- Participate in architecture reviews and influence long-term technical decisions
Qualifications
Education
- Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field
Required Experience & Skills
- 5+ years of DevOps experience , with at least 3+ years in a senior or lead role
- Strong expertise in AWS cloud architecture (mandatory)
- Hands-on experience with:
- AWS VPC, EC2, RDS, Elasticache
- Kubernetes (EKS), Docker, Helm
- S3, CloudFront, Route53
- API Gateway, Lambda, Kinesis Streams
- ClickHouse or similar columnar databases
- Strong CI/CD experience using GitHub Actions
- Infrastructure as Code using Terraform
- Proficiency in scripting and automation using Python, Shell, and Node.js
- Solid understanding of distributed systems, caching, load balancing, and event-driven architectures
Scalability & Performance
- Proven experience scaling distributed systems for high-traffic, large user bases
- Hands-on experience designing high-throughput analytics and real-time data pipelines
On-Prem Deployment Experience
- Demonstrated experience replicating SaaS architectures for on-prem environments
- Ability to automate both containerized and non-containerized deployments
Other Requirements
- Strong debugging, troubleshooting, and root-cause analysis skills
- Ownership mindset with the ability to independently deliver end-to-end solutions
- Excellent communication and cross-functional collaboration skills
Nice-to-Have Skills
- Experience with Digital Adoption Platforms or browser-based SaaS products
- Familiarity with observability tools such as Grafana, Prometheus, Datadog
- Exposure to SOC 2, ISO, or similar compliance environments
Apply for this Position
Ready to join ? Click the button below to submit your application.
Submit Application