Job Description
Project Description
We are building a small internal real-estate acquisition pipeline in Python.
We already have a working underwriting / scoring model that computes:
• proximity tiers (Hot / Warm / Cold)
• internal rent clusters (Minimal / Limited / Dense)
• rent-based yield
• rule-based review flags (rent restrictions, new construction, etc.)
We need a developer to build the data ingestion and geocoding layer that feeds this model.
This is a file-based ETL project (CSV in / CSV out).
No scraping and no UI.
⸻
Scope of work
1. Read an initial MLS CSV export (baseline file)
2. Read daily MLS CSV exports (delta files)
• new listings
• modified listings
3. Merge baseline and daily deltas into a rolling master dataset
(keyed by MLS number)
4. Build a geocoding step using the Geoapify API
• only geocode addresses that do not already exist in a cache
• store geocoded results in a local cac...
We are building a small internal real-estate acquisition pipeline in Python.
We already have a working underwriting / scoring model that computes:
• proximity tiers (Hot / Warm / Cold)
• internal rent clusters (Minimal / Limited / Dense)
• rent-based yield
• rule-based review flags (rent restrictions, new construction, etc.)
We need a developer to build the data ingestion and geocoding layer that feeds this model.
This is a file-based ETL project (CSV in / CSV out).
No scraping and no UI.
⸻
Scope of work
1. Read an initial MLS CSV export (baseline file)
2. Read daily MLS CSV exports (delta files)
• new listings
• modified listings
3. Merge baseline and daily deltas into a rolling master dataset
(keyed by MLS number)
4. Build a geocoding step using the Geoapify API
• only geocode addresses that do not already exist in a cache
• store geocoded results in a local cac...
Apply for this Position
Ready to join Confidential? Click the button below to submit your application.
Submit Application