Job Description
About the Role
We are looking for a Data Operations Engineer / Technical Data Analyst to ensure the accuracy, reliability, and quality of large-scale web and e-commerce data. This is a hands-on technical role focused on monitoring data pipelines, investigating data issues, and fixing web data extraction logic at the source.
The ideal candidate enjoys debugging, reverse-engineering websites, and working deeply with HTML, JSON, and web scraping logic.
Key Responsibilities
Data Operations & Quality (Daily)
Monitor ETL and data parsing pipelines to ensure data quality and availability
Act as first-level responder for data quality alerts and failed jobs
Investigate data anomalies by comparing system data with live websites
Perform root cause analysis to identify parsing issues, site changes, or source data errors
Data Extraction & Fixes
Develop, modify, and debug web data extraction logic
Write and maintain parsing rules using CSS Selectors, Regex, and JQ
Parse complex and nested HTML and JSON structures
Optimize extraction logic for performance and scalability
Development & Expansion
Handle ad-hoc requests for adding new data fields
Onboard new websites by building extraction logic from scratch
Run QA jobs and validate data before production release
Maintain documentation and track tasks using Jira
Collaborate with internal stakeholders to clarify requirements and provide updates
Required Skills
Must-Have
Strong experience in Web Scraping / Data Extraction / Data Parsing
Hands-on experience with HTML and JSON
Strong knowledge of CSS Selectors and Regular Expressions (Regex)
Experience using JQ / JSON Query / Jackson JQ
Experience with ETL pipelines and data quality monitoring
Ability to debug production data issues and perform root cause analysis
Experience working with issue tracking tools like Jira
Apply for this Position
Ready to join ? Click the button below to submit your application.
Submit Application