Job Description
Job Description
We are seeking a Data Engineer with 3 or more years of hands-on experience in data cleaning, transformation, and analysis using Python. The ideal candidate is comfortable working with large, messy datasets, has exposure to modern data technologies, and brings a strong analytical mindset. Experience with machine learning and LLMs is a strong plus.
Key Responsibilities
- Clean, preprocess, and transform structured and unstructured data using Python
- Perform exploratory data analysis (EDA) to uncover insights and trends
- Build reusable data pipelines and feature engineering workflows
- Work with SQL and/or cloud-based data warehouses to extract and prepare data
- Collaborate with stakeholders to translate business problems into data-driven solutions
- Develop and maintain analytical models and dashboards
- Apply basic to intermediate machine learning techniques where applicable
- Experiment with and support LLM-based solutions (prompting, embeddings, APIs) as needed
- Ensure data quality, reliability, and documentation
Required Skills & Qualifications
- 3+ years of experience as a Data Engineer / Data Analyst
- Strong proficiency in Python for data manipulation and analysis
- Pandas, NumPy, SciPy
- Solid understanding of data cleaning, transformation, and feature engineering
- Experience with SQL (PostgreSQL, MySQL, BigQuery, Snowflake, etc.)
- Familiarity with data visualization tools
- Matplotlib, Seaborn, Plotly, or Power BI/Tableau
- Understanding of statistics and data analysis fundamentals
- Experience working with APIs and external data sources
- Strong problem-solving and communication skills
Modern / Latest Tech Stack (Preferred)
- Python (3.x)
- Pandas, NumPy, Scikit-learn
- Jupyter, VS Code
- Git / GitHub
- Cloud platforms: AWS / Azure / GCP
- Data tools: Airflow, dbt, Spark (basic exposure)
- Containerization: Docker (nice to have)
Good to Have
- Hands-on experience with Machine Learning models
- Regression, classification, clustering, time series
- Exposure to LLMs and Generative AI
- OpenAI / Azure OpenAI APIs
- Prompt engineering
- Embeddings, vector databases (FAISS, Pinecone, Chroma)
- Experience with NLP or text analytics
- Knowledge of MLOps basics (model versioning, monitoring)
Apply for this Position
Ready to join ? Click the button below to submit your application.
Submit Application