Job Description

We are seeking a highly skilled Data Engineer with strong hands-on experience in Python, PySpark, Apache Spark, Kafka, and Databricks to design, build, and maintain scalable and high-performance data pipelines. The role focuses on developing robust batch and streaming data solutions for large-scale distributed data environments.

Key Responsibilities

  • Design, develop, and maintain batch and real-time data pipelines using PySpark and Apache Spark
  • Build and manage scalable data workflows and jobs on Databricks
  • Develop and optimize Python and SQL code to process large and complex datasets
  • Implement Kafka-based streaming solutions for real-time data ingestion and processing
  • Perform Spark job tuning, performance optimization, and resource management
  • Ensure data reliability, scalability, and efficiency across pipelines
  • Work closely with analytics, product, and engineering teams to support data requirements
  • Monitor, t...

Apply for this Position

Ready to join People Prime Worldwide? Click the button below to submit your application.

Submit Application