Job Description
About the Company
Capgemini Engineering is a global leader in engineering and R&D services, specializing in innovation and technology solutions across various industries.
About the Role
- Proficiency in SQL and database management
- Knowledge of data engineering concepts, including data pipelines, integration, warehousing, and architecture, is beneficial.
- Cloud computing platforms (Azure) and services like Azure Machine Learning can be advantageous.
- Proficiency in version control systems like Git is crucial for effective collaboration and code management.
Responsibilities
- Proficiency in programming languages like Python
- Skills in handling and analysing large datasets, including data cleaning, wrangling, transformation, and exploratory data analysis (EDA) using tools like pandas.
- Knowledge of statistical concepts such as hypothesis testing, regression analysis, ANOVA, experimental design, and probability theory.
- Understanding machine learning algorithms, supervised and unsupervised learning, feature engineering, model selection and evaluation, and hyperparameter tuning is essential, with libraries like scikit-learn, TensorFlow, or PyTorch commonly used. Knowledge of YOLO is desirable
- Data visualisation skills using libraries like Matplotlib, Seaborn, or ggplot
- Familiarity with big data technologies like Apache Hadoop, Apache Spark, or distributed databases enables the processing and analysis of large-scale datasets.
- Proficiency in SQL and database management
- Knowledge of data engineering concepts, including data pipelines, integration, warehousing, and architecture, is beneficial.
- Cloud computing platforms (Azure) and services like Azure Machine Learning can be advantageous.
- Proficiency in version control systems like Git is crucial for effective collaboration and code management.
Qualifications
BE/BTech/MTech
Required Skills
- Proficiency in programming languages like Python
- Skills in handling and analysing large datasets, including data cleaning, wrangling, transformation, and exploratory data analysis (EDA) using tools like pandas.
- Knowledge of statistical concepts such as hypothesis testing, regression analysis, ANOVA, experimental design, and probability theory.
Preferred Skills
- Proficiency in SQL and database management
- Knowledge of data engineering concepts, including data pipelines, integration, warehousing, and architecture, is beneficial.
- Cloud computing platforms (Azure) and services like Azure Machine Learning can be advantageous.
- Proficiency in version control systems like Git is crucial for effective collaboration and code management.
Apply for this Position
Ready to join ? Click the button below to submit your application.
Submit Application