Job Description
We are seeking an experienced Data Engineer (PySpark) to design, build, optimize, and maintain scalable data pipelines for production environments. The role requires strong hands-on experience in big data processing, pipeline optimization, and deployment using modern data engineering tools and frameworks.
Key Responsibilities
Design, develop, and maintain robust, scalable data pipelines using Python and PySpark
Perform data ingestion, transformation, cleansing, and validation across structured and unstructured datasets
Conduct Exploratory Data Analysis (EDA) to identify data patterns, anomalies, and quality issues
Apply data imputation techniques, data linking, and cleansing to ensure high data quality
Implement feature engineering pipelines to support analytics and downstream use cases
Optimize Spark jobs for performance, scalability, and cost ...
Apply for this Position
Ready to join Black Pearl? Click the button below to submit your application.
Submit Application