Job Description

  • Experience working with ML platforms such as CML, Spark MLlib, and Python ML libraries (scikit‑learn, XGBoost), including model deployment.
  • Design and develop highly scalable, Real time systems using Hadoop ecosystem components(Iceberg, Spark, Ozone, Trino, Hive, Ranger, Kafka, Flink and Nifi)
  • Build robust data ingestion and transformation frameworks using Java, Spark, Python, and shell scripting for ingesting multi model data(image, audio, video, unstructured documents) with both batch and real-time.
  • Develop full‑stack applications and internal engineering tools using Python, shell scripting, and modern web frameworks (e.g., Flask, React).
  • Collaborate closely with data scientists to operationalize machine learning models using Cloudera Machine Learning (CML).
  • Perform performance tuning and optimization of data applications on Hadoop to ensure optimal resource utilization.
  • Having 6 - 9 Years of experience

...

Apply for this Position

Ready to join Saksoft Pte Limited? Click the button below to submit your application.

Submit Application