Job Description
Vivasoft is seeking experienced Data Engineers to design, build, and optimize large-scale data pipelines using AWS and PySpark. This role focuses on data reliability, performance, and scalability, working closely with cross-functional teams where the Data Engineer often serves as the primary data expert. The ideal candidate is a self-starter who can operate independently while contributing to a broader data ecosystem.
Responsibilities:
- Design, build, and optimize large-scale data pipelines using PySpark, SQL, and AWS data services
- Develop and maintain ETL/ELT workflows with strong focus on data quality, lineage, and auditability
- Work extensively with AWS Glue, S3, Redshift, and Athena
- Implement and support data processing using Databricks, SparkSQL, and streaming platforms (e.g., Kafka)
- Independently triage, repair, and optimize data pipelines, including safe backfills and urgent production fixes
- Collaborat...
Apply for this Position
Ready to join Vivasoft Limited? Click the button below to submit your application.
Submit Application