from baltimore to the stars with data
play

FROM BALTIMORE TO THE STARS WITH DATA Tamas Budavari / Applied Math - PowerPoint PPT Presentation

FROM BALTIMORE TO THE STARS WITH DATA Tamas Budavari / Applied Math & Stats, JHU Breaking the Divestment Cycle: Predicting Abandonment & Fostering Neighborhood Revitalization in Baltimore Tams Budavri Applied Mathematics &


  1. FROM BALTIMORE TO THE STARS WITH DATA Tamas Budavari / Applied Math & Stats, JHU

  2. Breaking the Divestment Cycle: Predicting Abandonment & Fostering Neighborhood Revitalization in Baltimore Tamás Budavári Applied Mathematics & Statistics – The Johns Hopkins University

  3. Baltimore overview • Baltimore has lost 1/3 of its population since 1950 • Today, we have 16,500 boarded up vacant buildings • Of these, 13,000 are in distressed markets M. Braverman

  4. Boarded up vacants M. Braverman

  5. 1 data fusion data science geometry + history highly extensible flexible data platform predictive modeling & optimization

  6. 2 social science modeling transition estimating externalities evaluating policy

  7. 2 social science modeling transition estimating externalities evaluating policy

  8. 3 government rapid response queries assisting with strategic investments mapping “unoccupancy”

  9. Data in Baltimore  OpenBaltimore  Hundreds of public datasets online http://data.baltimorecity.gov  Plus more administrative data

  10. DHCD’s Data Infrastructure M. Braverman  Dept. of Housing & Community Dev J. D. Evans  Study changes over time  Support decision making  Statistics to help?  Inference & prediction

  11. Jim Gray’s 20 Questions  Data-driven studies  Low-level questions  What we see  High-level questions  Help hone policy making  Interventions

  12. Built a Unique Solution  Database of Baltimore City  Geospatial info for all parcels  Time history of real properties  Easily extendable  On the IDIES’s Data-Scope  Novel indexing for fast links

  13. Mapping Vacancy  2010  2015 Phil Garboden

  14. Mapping Vacancy  2010  2015 Phil Garboden

  15. Clustering of Vacancy  Probability of finding a vacant next to another  Quantitative comparison  Over time  Across town

  16. Similar Neighborhoods  Similarity graphs & eigenmaps

  17. What is a Neighborhood?  Are neighborhood boundaries meaningful?  Better grouping of houses?  Trends on a finer scale

  18. Collapsed Vacants

  19. Collapsed Vacant  Ends of contiguous blocks of rowhomes  Alleys, gaps and demos break rows  Need “sub-blockface” analysis  Time-dependent

  20. Neighborhood Revitalization  Modeling urban transitions  What factors catalyze reinvestment?  Disinvestment?  Innovative use of data  New sources of information  Zillow? Cell phone usage?

  21. Neighborhood Revitalization  Modeling urban transitions  What factors catalyze reinvestment?  Disinvestment?  Innovative use of data  New sources of information  Zillow? Cell phone usage?

  22. Strategic Investments  Governor’s budget  Unprecedented $75M  City scheduling  Spring 2016  JHU map of targets!

  23. Strategic Investments  Combinatorial Optimization  Improve some objective, e.g., or  Within a limited budget  Best objective? How to solve?

  24. Optimize the Impact  Different objectives  Same budget  Advanced tools  For decision makers Lenny Fan Amitabh Basu Phil Garboden

  25. Price  Longitudinal data  Environment  Prediction  Machine Learning

  26. Ambitious Next Steps Ben Seigel (21CC) Katalin Szlavecz Ben Zaitchik Keeve Nachman Katie O’Meara (MICA)

  27. Spatiotemporal Multi-Level Modeling  Hierarchical Bayesian statistics  Include all aggregated data  Joint inference for the  Individual houses and  Ensemble distributions Mengyang Gu

  28. Predicting Unoccupancy  Time-series data  Water usage  BG&E usage  USPS  Proxy for occupancy Phil Garboden Hana Clemens

  29. Satellite View  Missing roof?  Blue tarp = holes?

  30. Image behind the Atmosphere  Looking up! Coadded Image  Astronomy images  Blurred exposures  We solve for it  For high-res details Matthias Lee Charlie Gulian Rick White

  31. Image behind the Atmosphere  Looking up! Coadded Image  Astronomy images  Blurred exposures  We solve for it  For high-res details Matthias Lee Charlie Gulian Rick White

  32. Image behind the Atmosphere  Looking up! Deconvolved Image  Astronomy images  Blurred exposures  We solve for it  For high-res details Matthias Lee Charlie Gulian Rick White

  33. Image behind the Atmosphere  Looking up! Hubble Image  Astronomy images  Blurred exposures  We solve for it  For high-res details Matthias Lee Charlie Gulian Rick White

  34. Differential Chromatic Refraction  Even colors! Matthias Lee Andy Connolly Charlie Gulian

  35. Differential Chromatic Refraction  Even colors! Matthias Lee Andy Connolly Charlie Gulian

  36. At the Heart…  Applied Math & Stats  Data-Intensive Science  Data mining  Hardware platforms  Statistical modeling  Software solutions  Machine learning  Streaming algorithms  Optimization  Database technologies  Bayesian inference  GIS tools & indexing

  37. Limitations of Machine Learning  Many methods to choose from  And more knobs to tweak  Latching on known features  Manual intervention to refine  What’s left in the data? Missing the Human in the Loop!

  38. Use the Brain’s Detection Power

  39. Rapid Serial Visual Presentation  Current state-of-the-art is binary classification  Target / Distractor  We look for the interesting  Dynamic behavior of brain: looking for new Nick Carey

  40. Human-Machine Co-Learning  Hide wireframe of 3D cube in high-D  Looks like noise  Random projections Nick Carey

  41. Human-Machine Co-Learning  Hide wireframe of 3D cube in high-D  Looks like noise  Random projections  Trigger to explore locally Nick Carey

  42. Human-Machine Co-Learning  Hide wireframe of 3D cube in high-D  Looks like noise  Random projections  Trigger to explore locally  Converge on better view Nick Carey

  43. Human-Machine Co-Learning  Hide wireframe of 3D cube in high-D  Looks like noise  Random projections  Trigger to explore locally  Converge on better view Subconscious Navigation! Nick Carey

  44. Human-Machine Co-Learning  Hide wireframe of 3D cube in high-D  Looks like noise  Random projections  Trigger to explore locally  Converge on better view Subconscious Navigation! Nick Carey

  45. Summary  Promising first steps  With direct applications already deployed  Common data infrastructure & approaches  Surprisingly similar, e.g., across astro/city  Ambitious future plans  Need help! And need more data…

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend