Job Description

Vivasoft is seeking experienced Data Engineers to design, build, and optimize large-scale data pipelines using AWS and PySpark. This role focuses on data reliability, performance, and scalability, working closely with cross-functional teams where the Data Engineer often serves as the primary data expert. The ideal candidate is a self-starter who can operate independently while contributing to a broader data ecosystem.

Responsibilities:

  • Design, build, and optimize large-scale data pipelines using PySpark, SQL, and AWS data services
  • Develop and maintain ETL/ELT workflows with strong focus on data quality, lineage, and auditability
  • Work extensively with AWS Glue, S3, Redshift, and Athena
  • Implement and support data processing using Databricks, SparkSQL, and streaming platforms (e.g., Kafka)
  • Independently triage, repair, and optimize data pipelines, including safe backfills and urgent production fixes
  • Collaborat...

Apply for this Position

Ready to join Vivasoft Limited? Click the button below to submit your application.

Submit Application