Job Description

We are seeking an experienced Senior Data Engineer with expert-level skills in PySpark and hands-on experience building ETL pipelines, data lake architectures, and data feed integrations on AWS to join our team. You will work with both structured and unstructured data, ingesting from multiple on-premises and enterprise data sources such as SAP, Intelex, SQL, and OSI PI into AWS. This role offers the opportunity to contribute to large-scale data solutions and collaborate with cross-functional teams in a dynamic environment.

Responsibilities

  • Design, develop, and optimize ETL pipelines using PySpark and AWS Glue Jobs to process large volumes of structured and unstructured data
  • Orchestrate data workflows with Apache Airflow, ensuring reliable scheduling, dependency management, and robust error handling
  • Build and maintain data feeds from on-premises and enterprise systems into AWS data lake environments
  • Integrate with enterprise data sou...

Apply for this Position

Ready to join EPAM Systems? Click the button below to submit your application.

Submit Application