Job Description

About the Role

We are looking for a Data Operations Engineer / Technical Data Analyst to ensure the accuracy, reliability, and quality of large-scale web and e-commerce data. This is a hands-on technical role focused on monitoring data pipelines, investigating data issues, and fixing web data extraction logic at the source.

The ideal candidate enjoys debugging, reverse-engineering websites, and working deeply with HTML, JSON, and web scraping logic.


Key Responsibilities

Data Operations & Quality (Daily)

Monitor ETL and data parsing pipelines to ensure data quality and availability

Act as first-level responder for data quality alerts and failed jobs

Investigate data anomalies by comparing system data with live websites

Perform root cause analysis to identify parsing issues, site changes, or source data errors

Data Extraction & Fixes

Develop, modify, and debug web data extraction logic

Write and maintain parsing rules using CSS Selectors, Regex, and JQ

Parse complex and nested HTML and JSON structures

Optimize extraction logic for performance and scalability

Development & Expansion

Handle ad-hoc requests for adding new data fields

Onboard new websites by building extraction logic from scratch

Run QA jobs and validate data before production release

Maintain documentation and track tasks using Jira

Collaborate with internal stakeholders to clarify requirements and provide updates

Required Skills

Must-Have

Strong experience in Web Scraping / Data Extraction / Data Parsing

Hands-on experience with HTML and JSON

Strong knowledge of CSS Selectors and Regular Expressions (Regex)

Experience using JQ / JSON Query / Jackson JQ

Experience with ETL pipelines and data quality monitoring

Ability to debug production data issues and perform root cause analysis

Experience working with issue tracking tools like Jira

Apply for this Position

Ready to join ? Click the button below to submit your application.

Submit Application