Job Description
Location: Remote
Experience: 5 to 7 years
Role: Database Reliability Engineer
We are seeking a Database Reliability Engineer (DRE) with strong software engineering
and cloud database infrastructure experience to support and evolve our global AWS-based
database platforms. This is a contract role for a hands-on engineer passionate about
automation, reliability, and scalability — with deep expertise in Aurora PostgreSQL and
modern cloud database architectures.
Key Responsibilities
- Design, implement, and maintain highly available and performant Aurora
PostgreSQL systems in AWS.
- Build tools and scripts (Python, SQL) to automate upgrades or migrate databases
from 1 cluster/aws account to another with near zero downtime using DMS; create
contribute to terraform modules for DMS infrastructure deployment/management .
- Implement CI/CD pipelines workflows for repeatable, automated rebuild
deployments.
- Partner with SREs and service teams to design database solutions that meet
application performance and compliance requirements.
- Troubleshoot complex production database issues by analyzing logs, query plans,
metrics, and system-level behavior.
- Optimize schema design, queries, and indexes to ensure efficiency and scalability.
- Document operational runbooks, architecture decisions, and engineering best
practices.
- Contribute to a culture of reliability, continuous improvement, and technical
excellence.
Required Qualifications
- 5–7 years of hands-on experience managing large-scale cloud-native databases,
preferably Aurora PostgreSQL.
- Strong understanding of PostgreSQL internals, replication, and performance tuning.
- Experience supporting multi-region or zero-downtime database migrations or
upgrades using DMS.
- Proficiency in Python for automation and SQL for database development and
optimization.
- Experience with AWS database services (Aurora Postgresql & DMS).
- Solid grasp of CI/CD pipelines, Infrastructure-as-Code(Terraform), and
containers (Docker).
- Strong problem-solving and communication skills; ability to collaborate with
distributed teams.
Nice to Have
Familiarity with Elasticache Valkey/Redis, DynamoDB, or Keyspaces.
Exposure to performance testing platforms, APMs, and observability tooling
(e.g., Datadog, Prometheus, Grafana).
Experience contributing to engineering design reviews, mentoring peers, and
improving team-level processes.
Proven experience with monitoring, alerting, and incident management in
production environments.
Apply for this Position
Ready to join ? Click the button below to submit your application.
Submit Application