Job Description
Health Care Data Engineer Job Description
Mission
Systematic collection of annual public hospital statistics from official sources
(websites, activity reports, national databases). The mission includes auditing, data
conversion to customer's standardized format, harmonization, and quality
assurance.
Timeline: Initial phase 6 months; could get extended based on mission success.
Responsibilities
Data Collection and Analysis
Perform structured web research on official hospital databases.
Register on platforms and retrieve relevant national datasets.
Audit available data (scope, completeness, frequency, format).
Assess subscription costs and data licensing constraints.
Evaluate data usability and perform source-to-model mapping analysis.
Document collection processes and maintain methodological transparency.
Perform manual scraping and data normalization when necessary.
Map collected datasets to the customer's data model.
Maintain and update project dashboards and sourcing trackers.
Data Harmonization
Harmonize data from different sources into a coherent dataset
Conduct consistency checks and rule-based quality audits.
Apply data governance rules (e.g., replace n< 5 with n=5 for compliance).
Enforce naming conventions and standardized variable structures.
Coordinate with Data Analyst for continuous QA feedback.
Data Integration
Review validated procedures and translate them into technical workflows.
Develop or support automation for data retrieval (API connections, secure
transfers, or structured downloads).
Ensure compatibility of formats, metadata specifications, and export
standards.
Implement secure data transfer protocols aligned with customer's
infrastructure.
Requited Skills and Knowledge
1. Good understanding of health care domain and data types
2. Familiarity with ICD10, SNOWMED
3. Web scraping skills using Python, JavaScript, Selenium
4. Ability to use Regex to extract ICD Codes and numbers and models such as
SpaCy, Tesseract etc for text extraction from PDFs (if needed)
5. Ability to use workflow automation tools Eg: Airflow, Prefect, AWS
Stepfunctions, Metaflow etc
6. Good communication skills and innovative, problem-solving skills
Apply for this Position
Ready to join ? Click the button below to submit your application.
Submit Application