Job Description

Role : Senior Software Engineer (SRE Focus) (Site Reliability Engineer)

Experience : 7+ Years (Mandatory)

Work Location : Bangalore (Work from office with 3 days office and 2 days’ Work From Home)

Number of Positions : 2 - 3 Positions

PFB Job Description :

Note: Client need Go Lang experience with Devops.

Mandatory

7+ years in SRE / Production Engineering / Platform Engineering (reliability-focused)

Strong Go (mandatory): ability to read, debug, and ship production fixes in Go codebases

Strong hands-on experience with Kubernetes in production environments

Experience with Helm and GitOps workflows (FluxCD preferred;
ArgoCD acceptable)

Solid PostgreSQL troubleshooting experience (performance, incident patterns, migrations)


The Role

We're building a team that owns production incident response, deep debugging, and permanent fixes across application, data, and deployment layers.

This is not a tickets-only ops role. You will write code, ship fixes safely, and harden the platform so issues don't repeat.


Note: This is a SRE/software engineering role with real production ownership. You’ll combine engineering and operations to own outcomes end-to-end: investigate incidents, ship code fixes, and prevent repeat issues through tests, observability, and hardening.


What you'll do

· Lead and execute production incident response: triage, mitigation, stakeholder communication, and coordination across teams

· Debug and fix issues across Go services (mandatory) and the broader stack (Node.Js services where relevant)

· Work across service boundaries: GraphQL/RPC, distributed tracing, dependency failures, performance bottlenecks, and safe degradation patterns

· Troubleshoot Kubernetes workloads and deployments

· Diagnose PostgreSQL/CNPG issues

· Handle production bugs that span application + data pipelines (ETL/Snowflake mappings), including backfills/replays and data-quality validation

· Build prevention: add regression tests, improve observability, and maintain runbooks/service passports

· Drive reliability improvements: SLOs/SLIs, alert quality, release readiness checks, and operational standards across teams


What we're looking for (must-have)

· 7+ years in SRE / Production Engineering / Platform Engineering (reliability-focused)

· Strong Go (mandatory): ability to read, debug, and ship production fixes in Go codebases

· Proven experience debugging distributed systems in production (latency, error rates, timeouts, retries, cascading failures)

· Strong hands-on experience with Kubernetes in production environments

· Experience with Helm and GitOps workflows (FluxCD preferred;
ArgoCD acceptable)

· Solid PostgreSQL troubleshooting experience (performance, incident patterns, migrations)

· Observability experience (metrics/logging/tracing;
Datadog/Grafana/Tempo/Loki experience is a plus)

· Strong incident leadership: calm under pressure, clear communication, structured problem-solving

· Engineering hygiene: PR discipline, reviews, testing mindset, safe rollouts/rollbacks

· Comfortable with IAM/security fundamentals in real production systems: OAuth2/OIDC basics, RBAC/least privilege, and safe secrets handling


Nice-to-have

· Node.Js backend experience in production

· Experience in FinTech / regulated environments / high-availability systems (auditability, change control, incident rigor)

· Data reliability experience: ETL monitoring, reconciliation, Snowflake operations, schema/mapping drift handling

· Reliability patterns common to trading/fintech platforms: correctness and data integrity mindset (idempotency, reconciliation), resilient partner integrations, and strong observability for critical user journeys


Why join

· Build a new function with real impact on reliability and engineering culture

· Work across the full production surface area: application + platform + database + data pipelines + Our Whole Product

· High ownership role: you'll influence production standards, tooling, and release safety across teams.

Apply for this Position

Ready to join ? Click the button below to submit your application.

Submit Application