Skip to main content

Data Sources

ValiFit aggregates data from 28 data sources across federal agencies, health departments, real estate platforms, and infrastructure databases to deliver comprehensive housing intelligence for all 50 states.

28Data Sources
297,276,813+Total Records
50States Covered

Federal Government

15 sources

ACS 5-Year Demographics

U.S. Census Bureau

census_demographics_zcta

Population, income, education, housing, and commute data for every ZIP Code Tabulation Area nationwide.

33,174 ZCTAs
Annual

TIGER/Line Boundaries

U.S. Census Bureau

place_boundaries

Geographic boundary polygons for municipalities, counties, school districts, and ZIP codes.

32,037 boundaries
Annual

Census Places

U.S. Census Bureau

census_places

Official place definitions including cities, towns, CDPs, and consolidated city-counties with populations.

53,201 places
Annual

Common Core of Data (Schools)

NCES

national_schools

Every public school in America: location, enrollment, grade span, Title I status, and school type.

102,274 schools
Annual

F-33 School Finance

NCES

school_district_finance

Per-pupil expenditure and revenue data for every school district. Powers the Education ROI composite.

19,649 districts
Annual

School Performance

NCES

school_performance

Achievement data, graduation rates, and student-teacher ratios across all 50 states.

89,939 records
Annual

NIBRS Crime Data (2024)

FBI

crime_data_nibrs_2024

National Incident-Based Reporting System crime statistics by agency. Powers the Safety Infrastructure composite.

8,958 agencies
Annual

Law Enforcement Officers (LEE)

FBI

fbi_lee_2024

Sworn officer counts and civilian staff per agency. Used for police-per-capita safety scoring.

13,315 agencies
Annual

Fair Market Rents

HUD

hud_fair_market_rents

FMR and Small Area FMR rent estimates by bedroom count for every metropolitan and non-metro area.

51,895 areas
Annual

FHA Loan Limits

HUD

fha_loan_limits

Maximum FHA-insured mortgage amounts by county. Used in Budget composite affordability checks.

3,235 counties
Annual

National Flood Hazard Layer

FEMA

fema_flood_zones

Flood zone designations for every mapped area in the U.S. Powers the Environmental Safety composite.

4,448,886 zones
Quarterly

Superfund + Brownfield Sites

EPA

epa_sites

Active and archived contamination sites including NPL Superfund and brownfield redevelopment locations.

45,678 sites
Quarterly

Underground Storage Tanks

EPA

underground_storage_tanks

Registered underground storage tanks with leak status and cleanup records. Environmental hazard layer.

742,855 tanks
Quarterly

Smart Location Walkability Index

EPA

walkability_scores

Block-group-level walkability, transit access, and land-use mix scores. Powers the Livability composite.

220,739 block groups
Annual

FRED Mortgage Rates

Federal Reserve

mortgage_rates

Daily 30-year and 15-year fixed mortgage rate history from Freddie Mac Primary Mortgage Market Survey.

Daily rates
Daily

Department of Health

4 sources

Fire Stations

HIFLD

fire_stations

Every fire station in the United States with location and jurisdiction. Safety Infrastructure composite.

52,051 stations
Annual

EMS / Ambulance Stations

HIFLD

ems_stations

Emergency medical services and ambulance station locations with service type and coverage area.

7,045 stations
Annual

National Provider Registry (NPI)

CMS

health_providers

Every licensed healthcare provider in the country: physicians, dentists, therapists, specialists.

2,853,274 providers
Monthly

Health Professional Shortage Areas

HRSA

health_shortage_areas

Primary care, dental, and mental health shortage area designations by geography and population.

151,364 designations
Quarterly

Infrastructure

2 sources

Hospitals

HIFLD

health_hospitals

Hospital locations, bed counts, trauma center designations, and ownership type for all U.S. hospitals.

8,590 hospitals
Annual

Transit Stops (GTFS)

GTFS

transit_stops

Public transit stop locations aggregated from transit agency GTFS feeds nationwide.

348,353 stops
Quarterly

Real Estate & Property

4 sources

National Parcel Data

Regrid

Parcel boundaries, addresses, and property characteristics for virtually every property in the U.S.

278,831,446 parcels
Monthly

Market Data Center

Redfin

redfin_market_data

Median sale prices, days on market, sale-to-list ratios, and appreciation trends by region.

6,199,749 records
Monthly

Building Permits

State DCA / County Records

building_permits

Residential and commercial building permit filings with project type, value, and status. Currently covers NJ with more states being added.

2,638,208 permits
Monthly

Property Data (Live API)

ATTOM

Permits, automated valuation models (AVM), sales history, and comparable sales via live API.

Live
Live

Other

3 sources

Google Places API

Google

Points of interest, local businesses, restaurants, grocery stores, gyms, and lifestyle amenities.

Live
Live

Town Gallery Photos

Wikimedia Commons

town_gallery

Curated landmark and cityscape photography for town pages, verified by alt-text matching.

35,377 photos
Monthly

Weather & Climate

Open-Meteo

Live weather conditions, historical climate averages, and seasonal temperature data.

Live
Live

Fair Housing Act Compliant Algorithm

Designed to prevent steering and discriminatory outcomes

ValiFit's scoring algorithm is architected from the ground up to comply with the Fair Housing Act and applicable state anti-discrimination laws. Every scored metric measures government investment efficiency— how effectively tax dollars become public services — not demographics. No score uses race, ethnicity, national origin, religion, familial status, disability, or any protected class.

Scored & Compared (Enters Composites)

Government investment efficiency metrics — tax dollars in, public services out:

Property tax ratesGovernment spending per capitaPolice officers per capitaFire/EMS stationsCrime ratesPer-pupil school spendingStudent-teacher ratiosFlood zonesRadon levelsContamination sitesEnvironmental hazardsWalkabilityTransit accessAmenity densityProperty value appreciationCommute distance

Display Only (Not Scored)

Shown on town and property pages for context, never used in rankings:

School test scoresGraduation ratesHospital countsProvider densityShortage areasDemographicsAge distributionIncome dataEducation levelsIndividual property detailsSale historyTax assessment

Why this matters:Every scored metric measures a government service outcome — how many officers per capita, how much a district spends per student, whether flood infrastructure exists. Crime rates enter the Safety Infrastructure composite as a measure of public safety outcomes (a government service), not as a proxy for demographics. School test scores, demographics, and income are shown for context but never enter rankings. This design prevents the system from steering users toward or away from neighborhoods based on characteristics that correlate with protected classes.

Data Accuracy & Updates

All data is sourced directly from official government agencies and is updated regularly to ensure accuracy. Data refresh frequency varies by source, typically ranging from monthly to annually depending on the agency's publication schedule.

Attribution & Legal

Market data provided by Redfin. Property data via ATTOM. Local amenities powered by Google. Town images from Wikimedia Commons. Government data from U.S. Census Bureau, FEMA, FBI, HUD, EPA, NCES, HIFLD, CMS, HRSA, and the Federal Reserve.

Public Domain: Census, FEMA, FBI, HUD, TIGER/Line, FRED, HIFLDPublic Records: NCES, EPA, CMS, HRSA, GTFS, NJ DCA, RegridCommercial (with attribution): Redfin, ATTOM, Google, Open-Meteo