Job Description
Candidate should be able to:
Coordinate Development, Integration, and Production deployments.
Optimize Spark code, Impala queries, and Hive partitioning strategy for better scalability, reliability, and performance.
Build applications using Maven, SBT and integrated with continuous integration servers like Jenkins to build jobs.
Execute Hadoop ecosystem and Applications through Apache HUE
Build Machine Learning Algorithms using Spark.
Perform migration from Legacy Databases RDBMS to Hadoop Ecosystem
Create mapping documents to outline data flow from source to target.
Use Cloudera Manager, an end-to-end tool to manage Hadoop operations in Cloudera Cluster
Design and deploy enterprise-wide scalable operations
Work on leading BI technologies like MSTR, Tableau over Hadoop Ecosystem through ODBC/JDBC connection
Perform Performance tuning of Impala queries
Work on hive performance optimizations like using distributed cach...
Coordinate Development, Integration, and Production deployments.
Optimize Spark code, Impala queries, and Hive partitioning strategy for better scalability, reliability, and performance.
Build applications using Maven, SBT and integrated with continuous integration servers like Jenkins to build jobs.
Execute Hadoop ecosystem and Applications through Apache HUE
Build Machine Learning Algorithms using Spark.
Perform migration from Legacy Databases RDBMS to Hadoop Ecosystem
Create mapping documents to outline data flow from source to target.
Use Cloudera Manager, an end-to-end tool to manage Hadoop operations in Cloudera Cluster
Design and deploy enterprise-wide scalable operations
Work on leading BI technologies like MSTR, Tableau over Hadoop Ecosystem through ODBC/JDBC connection
Perform Performance tuning of Impala queries
Work on hive performance optimizations like using distributed cach...
Apply for this Position
Ready to join Anicalls (Pty) Ltd? Click the button below to submit your application.
Submit Application