ML / Data
ML pipeline fusing Census, Zillow, Eviction Lab data to predict gentrification risk.
- ▪Fused US Census, Zillow ZHVI, Eviction Lab data into unified time-series geospatial DB via ETL.
- ▪Built binary classifiers (LR, RF, LightGBM) with 5-fold CV + GridSearchCV, achieving 0.77 ROC-AUC.
- ▪Used SHAP to interpret PCA-reduced features, mapping back to socioeconomic variables.
- ▪Built SARIMAX models to forecast 5-year SEIFA scores, flagging at-risk counties.
Show stack
Feature EngineeringGeospatialMLSHAPSARIMAX