Job Description
We're building an MVP dataset of all healthcare organizations (clinics, hospitals, therapy groups, etc.) in the U.S.
This project focuses on clean data, good structure, and human-friendly workflows, not heavy automation or complex infrastructure.
If you're comfortable working with large CSV datasets, SQL, and Google Sheets, this is likely a good fit.
What You'll Be Doing
1. Build the Core Provider Dataset
- Load public healthcare datasets (NPPES)
-- Filter to organization-level providers (Type 2 NPIs)
- Fuzzy match them against datasets from Google Maps API
- Normalize names, addresses, and categories
2. Categorize Providers
Group organizations into high-level categories:
- hospitals / health systems
- physical therapy organizations
- chiropractic organizations
- imaging centers, labs, etc.
Use taxonomy codes and name patterns
3 Google Sheets Integration
- Export selected records to Google Sheets if this list is manageable there.
Sk...
This project focuses on clean data, good structure, and human-friendly workflows, not heavy automation or complex infrastructure.
If you're comfortable working with large CSV datasets, SQL, and Google Sheets, this is likely a good fit.
What You'll Be Doing
1. Build the Core Provider Dataset
- Load public healthcare datasets (NPPES)
-- Filter to organization-level providers (Type 2 NPIs)
- Fuzzy match them against datasets from Google Maps API
- Normalize names, addresses, and categories
2. Categorize Providers
Group organizations into high-level categories:
- hospitals / health systems
- physical therapy organizations
- chiropractic organizations
- imaging centers, labs, etc.
Use taxonomy codes and name patterns
3 Google Sheets Integration
- Export selected records to Google Sheets if this list is manageable there.
Sk...
Apply for this Position
Ready to join Confidential? Click the button below to submit your application.
Submit Application