Job Description
We are seeking a Data Engineer / Python Developer to lead the data acquisition and processing efforts for a high-stakes, agentic AI chatbot in the healthcare domain. This is not a traditional BI or ETL role; you will not be building dashboards or moving data for analytics. Instead, you will architect a robust, modular engine capable of crawling, parsing, and normalizing vast amounts of unstructured and structured healthcare data from diverse sources—ranging from dynamic JavaScript websites and PDFs to proprietary vendor formats.
Location: Toronto, ON(1day/week onsite)
Key Responsibilities
Data Collection & Web Crawling
- Advanced Web Scraping: Build and maintain scalable scrapers for HTML and dynamic, JavaScript-heavy websites using Scrapy and BeautifulSoup .
- Multi-Format Ingestion: Develop custom parsers to ingest and normalize data from XML, RSS f...
Apply for this Position
Ready to join Hays? Click the button below to submit your application.
Submit Application