Using deep learning and Using deep learning and Google Street View - - PowerPoint PPT Presentation

using deep learning and using deep learning and google
SMART_READER_LITE
LIVE PREVIEW

Using deep learning and Using deep learning and Google Street View - - PowerPoint PPT Presentation

Using deep learning and Using deep learning and Google Street View to estimate Google Street View to estimate the demographic makeup of the demographic makeup of neighbourhoods across the US neighbourhoods across the US Gebru , T. et al.


slide-1
SLIDE 1

Using deep learning and Using deep learning and Google Street View to estimate Google Street View to estimate the demographic makeup of the demographic makeup of neighbourhoods across the US neighbourhoods across the US

Gebru , T. et al. (2017) DOI: 10.1073/pnas.1700035114

1

slide-2
SLIDE 2

In a nutshell... In a nutshell...

The United States spends more than $250 million each year on the American Community Survey (ACS), a labor-intensive door-to-doors tudy that measures statistics relating to race, gender, education, occupation, unemployment, and other demographic factors. Although a comprehensive source of data, the lag between demographic changes and their appearance in the ACS can exceed several years. As digital imagery becomes ubiquitous and machine vision techniques improve, automated data analysis may become an increasingly practical supplement to the ACS. Here, we present a method that estimates socioeconomic characteristics of regions spanning 200 US cities by using 50 million images of street scenes gathered with Google Street View cars. Using deep learning-based computer vision techniques, we determined the make, model, and year of all motor vehicles encountered in particular neighborhoods. Data from this census of motor vehicles, which enumerated 22 million automobiles in total (8% of all automobiles in the United States), were used to accurately estimate income, race, education, and voting patterns at the zip code and precinct level. (The average US precinct contains 1,000 people.) The resulting associations are surprisingly simple and powerful. For instance, if the number of sedans encountered during a drive through a city is higher than the number of pickup trucks, the city is likely to vote for a Democrat during the next presidential election (88% chance); otherwise, it is likely to vote Republican (82%). Our results suggest that automated systems for monitoring demographics may effectively complement labor-intensive approaches, with the potential to measure demographics with fine spatial resolution, in close to real time.

2

slide-3
SLIDE 3

In a nutshell... In a nutshell...

The United States spends more than $250 million each year on the American Community Survey (ACS), a labor-intensive door-to-doors tudy that measures statistics relating to race, gender, education, occupation, unemployment, and other demographic factors. Although a comprehensive source of data, the lag between demographic changes and their appearance in the ACS can exceed several years. As digital imagery becomes ubiquitous and machine vision techniques improve, automated data analysis may become an increasingly practical supplement to the ACS. Here, we present a method that estimates socioeconomic characteristics of regions spanning 200 US cities by using 50 million images of street scenes gathered with Google Street View cars. Using deep learning-based computer vision techniques, we determined the make, model, and year of all motor vehicles encountered in particular neighborhoods. Data from this census of motor vehicles, which enumerated 22 million automobiles in total (8% of all automobiles in the United States), were used to accurately estimate income, race, education, and voting patterns at the zip code and precinct level. (The average US precinct contains 1,000 people.) The resulting associations are surprisingly simple and powerful. For instance, if the number of sedans encountered during a drive through a city is higher than the number of pickup trucks, the city is likely to vote for a Democrat during the next presidential election (88% chance); otherwise, it is likely to vote Republican (82%). Our results suggest that automated systems for monitoring demographics may effectively complement labor-intensive approaches, with the potential to measure demographics with fine spatial resolution, in close to real time.

3

slide-4
SLIDE 4

American Community Survey: American Community Survey:

Demographic statistics for all US cities/counties with population ≥ 65,000 Comprehensive, but demographic changes are typically reported with a lag of several years Expensive and labour-intensive

4

slide-5
SLIDE 5

American Community Survey: American Community Survey:

Demographic statistics for all US cities/counties with population ≥ 65,000 Comprehensive, but demographic changes are typically reported with a lag of several years Expensive and labour-intensive

Computational methods can be useful in tackling such challenges facing topics in social science

5

slide-6
SLIDE 6

Research question Research question

Can demographic statistics and voter preferences be inferred from objective characteristics of images from a neighbourhood?

6

slide-7
SLIDE 7

Research question Research question

Can demographic statistics and voter preferences be inferred from objective characteristics of images from a neighbourhood?

Machine learning methods can be used with public data to determine socioeconomic statistics and political preferences in the US

7

slide-8
SLIDE 8

Methodology: Methodology: overview

  • verview
  • 1. Establishment of dataset based on 50 million images

taken from Google Street View Exterior of houses, landscaping, vehicles parked

  • n the street etc.
  • 2. Machine vision framework based on convolutional

neural networks (CNN) to classify vehicles Recognise vehicles and determine their characteristics (e.g. make, model etc.)

  • 3. Classified vehicles were use to infer a range of

demographic statistics and socioeconomic attributes

8

slide-9
SLIDE 9

1.

  • 1. Dataset

Dataset

50 million Google street view images from 3068 ZIP codes and 39286 voting precincts across 200 cities Product shot images: 2657 visually distinct car categories Amazon Mechanical Turk to gather labeled car images from specific websites (cars.com, craigslist.org etc.) Expert annotation of a subset

  • f Google street view images

9

slide-10
SLIDE 10

2.

  • 2. Machine vision framework

Machine vision framework

Car detection Key challenge: balance accuracy and efficiency Deformable Part Model (DPM) as

  • bject recognition algorithm

Detection of 22 million distinct vehicles Car classification CNN trained to distinguish different types of vehicle Classify each vehicle into one of 2657 categories

10

slide-11
SLIDE 11

3.

  • 3. Demographic estimation

Demographic estimation

Based on 88 car-related attributes: average #cars per image, average car price, %hybrid cars, %foreign cars, %cars of a specific make, etc. Socioeconomic data obtained from ACS between 2008-2012 2008 election data from another study

Experimental procedure Dataset partitioned based on county into training (A-C; 35/200) and test (D-Z) sets Ridge regression model for income and voter affiliation estimation Logistic regression for race and education prediction Five models trained in each case using fivefold cross-validation to select regularisation parameter (average)

11

slide-12
SLIDE 12

Results Results (training set) (training set)

Strong associations between vehicle distribution and different demographic factors

12

slide-13
SLIDE 13

Results Results (training set) (training set)

Strong associations between vehicle distribution and different demographic factors

Toyotas/Hondas → Asian Sedans → Democrat Pickup trucks → Caucausian/Republican Chrysler/Buick → African American

13

slide-14
SLIDE 14

Results Results (test set) (test set)

Strong correlation between test results and ACS values for all demographic variables at city resolution; 0.54 ≤ r ≤ 0.87 Test results at ZIP code resolution also exhibited close correspondence with ACS values

14

slide-15
SLIDE 15

Voter Voter preferences preferences

Strong correlation between estimates and actual voting results of 2008 elections at city resolution; r =0.73 Precinct-level estimates also closely matched ground truth data; r =0.57

15

slide-16
SLIDE 16

Conclusions Conclusions

Neighbourhood images can be used to accurately estimate demographic statistics and voter preferences in the US via automated machine learning algorithms. Only data from a few cities are required to provide up-to-date statistics for many cities and ZIP codes. Model could be improved by expanding object recognition incorporate global image features integrating other data types.

16

slide-17
SLIDE 17

Thank you. Thank you.

17