Job Description
We are seeking a highly skilled Data Engineer with strong hands-on experience in Python, PySpark, Apache Spark, Kafka, and Databricks to design, build, and maintain scalable and high-performance data pipelines. The role focuses on developing robust batch and streaming data solutions for large-scale distributed data environments.
Key Responsibilities
- Design, develop, and maintain batch and real-time data pipelines using PySpark and Apache Spark
- Build and manage scalable data workflows and jobs on Databricks
- Develop and optimize Python and SQL code to process large and complex datasets
- Implement Kafka-based streaming solutions for real-time data ingestion and processing
- Perform Spark job tuning, performance optimization, and resource management
- Ensure data reliability, scalability, and efficiency across pipelines
- Work closely with analytics, product, and engineering teams to support data requirements
- Monitor, t...
Apply for this Position
Ready to join People Prime Worldwide? Click the button below to submit your application.
Submit Application