Site Reliability Engineer)

📍 India, Karnataka, India
Full-time Computer Occupations Posted January 23, 2026
Apply Now Similar Jobs
Job Description

                        <p><font face="Calibri, sans-serif">  </font></p> <p>Role <b> : Senior Software Engineer (SRE Focus) (Site Reliability Engineer)</b></p> <p>Experience <b> : 7+ Years (Mandatory)</b></p> <p>Work Location<b> : Bangalore (Work from office with 3 days office and 2 days    Work From Home)</b></p> <p>Budget<b> : 35 - 37 LPA </b></p> <p>Number of Positions<b> : 2 - 3 Positions </b></p> <p>  </p> <p><b>PFB Job Description :</b><b> </b></p> <p>  </p> <p><b>Note: Client need Go Lang experience with Devops. Important mandate skills are highlighted below for reference</b></p> <p style="margin-bottom:16px"><br /> The Role<br /> We're building a team that owns production incident response, deep debugging, and permanent fixes across application, data, and deployment layers.<br /> This is not a tickets-only ops role. You will write code, ship fixes safely, and harden the platform so issues don't repeat.</p> <p style="margin-bottom:16px">Note: This is a SRE/software engineering role with real production ownership. You   ll combine engineering and operations to own outcomes end-to-end: investigate incidents, ship code fixes, and prevent repeat issues through tests, observability, and hardening.</p> <p style="margin-bottom:16px">What you'll do<br />    Lead and execute production incident response: triage, mitigation, stakeholder communication, and coordination across teams<br />    Debug and fix issues across Go services (mandatory) and the broader stack (Node.js services where relevant)<br />    Work across service boundaries: GraphQL/RPC, distributed tracing, dependency failures, performance bottlenecks, and safe degradation patterns<br />    Troubleshoot Kubernetes workloads and deployments<br />    Diagnose PostgreSQL/CNPG issues<br />    Handle production bugs that span application + data pipelines (ETL/Snowflake mappings), including backfills/replays and data-quality validation<br />    Build prevention: add regression tests, improve observability , and maintain runbooks/service passports<br />    Drive reliability improvements: SLOs/SLIs, alert quality, release readiness checks, and operational standards across teams</p> <p style="margin-bottom:16px"><b>What we're looking for (must-have)</b><br />    7+ years in SRE / Production Engineering / Platform Engineering (reliability-focused)<br />    Strong Go (mandatory): ability to read, debug, and ship production fixes in Go codebases<br />    Proven experience debugging distributed systems in production (latency, error rates, timeouts, retries, cascading failures)<br />    Strong hands-on experience with Kubernetes in production environments<br />    Experience with Helm and GitOps workflows (FluxCD preferred; ArgoCD acceptable)<br />    Solid PostgreSQL troubleshooting experience (performance, incident patterns, migrations)<br />    Observability experience (metrics/logging/tracing; Datadog/Grafana/Tempo/Loki experience is a plus)<br />    Strong incident leadership: calm under pressure, clear communication, structured problem-solving<br />    Engineering hygiene: PR discipline, reviews, testing mindset, safe rollouts/rollbacks<br />    Comfortable with IAM/security fundamentals in real production systems: OAuth2/OIDC basics, RBAC/least privilege, and safe secrets handling</p> <p style="margin-bottom:16px"><b>Nice-to-have</b><br />    Node.js backend experience in production<br />    Experience in FinTech / regulated environments / high-availability systems (auditability, change control, incident rigor)<br />    Data reliability experience: ETL monitoring, reconciliation, Snowflake operations, schema/mapping drift handling<br />    Reliability patterns common to trading/fintech platforms: correctness and data integrity mindset (idempotency, reconciliation), resilient partner integrations, and strong observability for critical user journeys</p> <p><b>Why join</b><br />    Build a new function with real impact on reliability and engineering culture<br />    Work across the full production surface area: application + platform + database + data pipelines + Our Whole Product<br />    High ownership role: you'll influence production standards, tooling, and release safety across teams.</p>
                    
Apply for this Position

Ready to join ? Click the button below to submit your application.
Submit Application
Job Details

Location
India, Karnataka, India
Job Type
Full-time