Predicting poverty from satellite imagery Neal Jean, Michael Xie, - - PowerPoint PPT Presentation

predicting poverty from satellite imagery
SMART_READER_LITE
LIVE PREVIEW

Predicting poverty from satellite imagery Neal Jean, Michael Xie, - - PowerPoint PPT Presentation

Predicting poverty from satellite imagery Neal Jean, Michael Xie, Stefano Ermon Department of Computer Science Stanford University Matt Davis, Marshall Burke, David Lobell Department of Earth Systems Science Stanford University 1 Why


slide-1
SLIDE 1

Predicting poverty from satellite imagery

Neal Jean, Michael Xie, Stefano Ermon Department of Computer Science Stanford University Matt Davis, Marshall Burke, David Lobell Department of Earth Systems Science Stanford University

1

slide-2
SLIDE 2

Why poverty?

  • #1 UN Sustainable Development Goal

– Global poverty line: $1.90/person/day

  • Understanding poverty can lead to:

– Informed policy-making – Targeted NGO and aid efforts

2

slide-3
SLIDE 3

Data scarcity

3

slide-4
SLIDE 4

Lack of quality data is a huge challenge

  • Expensive to conduct

surveys:

– $400,000 to $1.5 million

  • Data scarcity:

– <0.01% of total households covered by surveys

  • Poor spatial and

temporal resolution

4

slide-5
SLIDE 5

Satellite imagery is low-cost and globally available

5

Simultaneously becoming cheaper and higher resolution (DigitalGlobe, Planet Labs, Skybox, etc.)

Shipping records Inventory estimates Agricultural yield Deforestation rate

slide-6
SLIDE 6

What if…

6

we could infer socioeconomic indicators from large-scale, remotely-sensed data?

slide-7
SLIDE 7

Standard supervised learning won’t work

7

  • Very little training data (few thousand data points)
  • Nontrivial for humans (hard to crowdsource labels)

Poverty, wealth, child mortality, etc.

Model

Input Output

slide-8
SLIDE 8

Transfer learning overcomes data scarcity

8

Transfer learning: Use knowledge gained from

  • ne task to solve a different (but related) task

Train here Perform here

Transfer

slide-9
SLIDE 9

Transfer learning bridges the data gap

9

  • A. Satellite images
  • C. Poverty measures

Not enough data!

  • B. Proxy outputs

Deep learning model

Plenty of data! Less data needed Would this work?

slide-10
SLIDE 10

Nighttime lights as proxy for economic development

10

slide-11
SLIDE 11

Why not use nightlights directly?

11

  • A. Satellite images
  • C. Poverty measures
  • B. Nighttime light intensities
slide-12
SLIDE 12

Not so fast…

12

Almost no variation below the poverty line

slide-13
SLIDE 13

Lights aren’t useful for helping the poorest

13

slide-14
SLIDE 14

Step 1: Predict nighttime light intensities

14

training images sampled from these locations

  • A. Satellite images
  • C. Poverty measures

Deep learning model

  • B. Nighttime light intensities
slide-15
SLIDE 15

Training data on the proxy task is plentiful

15

Millions of training images

training images sampled from these locations

Low nightlight intensity

,

High nightlight intensity

, …

( ( ) )

Labeled input/output training pairs

slide-16
SLIDE 16

Images summarized as low-dimensional feature vectors

16

f1 f2 … f4096

Night- Light

Nonlinear mapping

Inputs: daytime satellite images

{Low, Medium, High}

Outputs: Nighttime light intensities Convolutional Neural Network (CNN) Linear regression

slide-17
SLIDE 17

Feature learning

17

f1 f2 … f4096

Night- Light

Nonlinear mapping

Linear regression

min

𝜄,𝜄′ 𝑗=1 𝑛 𝑚 𝑧𝑗,

𝑧𝑗 = min

𝜄,𝜄′ 𝑗=1 𝑛 𝑚 𝑧𝑗 , 𝜄𝑈 𝑔(𝑦𝑗, 𝜄′ )

Over 50 million parameters to fit Run gradient descent for a few days

𝜄 𝜄′

slide-18
SLIDE 18

Transfer Learning

18

Have we learned to identify useful features?

f1 f2 … f4096

Poverty

Nonlinear mapping

Inputs: daytime satellite images

{Low, Medium, High}

Outputs: Nighttime light intensities Feature Learning

Target task

slide-19
SLIDE 19

Model learns relevant features automatically

19

Satellite image Filter activation map Overlaid image

slide-20
SLIDE 20
  • Living Standards Measurement Study (LSMS) data in

Uganda (World Bank)

– Collected data on household features

  • Roof type, number of rooms, distance to major road, etc.

– Report household consumption expenditures

  • Task: Predict if the majority of households in a cluster

are above or below the poverty line

Target task: Binary poverty classification

20

slide-21
SLIDE 21

How does our model compare?

21

Survey-based model is the gold standard for accuracy but…

– Relies on expensively collected data – Is difficult to scale, not comprehensive in coverage

0.75 0.53 0.71 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8

Survey Nightlights Transfer

Accuracy

slide-22
SLIDE 22

Transfer learning model approaches survey accuracy

22

0.75 0.53 0.71 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8

Survey Nightlights Transfer

Accuracy

Advantages of transfer learning approach:

– Relies on inexpensive, publicly available data – Globally scalable, doesn’t require unifying disparate datasets

slide-23
SLIDE 23

Our model maps poverty at high resolution

23

Case study: Uganda

  • Most recent poverty map over a decade old
  • Lack of ground truth highlights need for more data

Smoothed predictions District aggregated Official map (2005)

slide-24
SLIDE 24

We can differentiate different levels of poverty

24

2 continuous measures of wealth:

  • Consumption expenditures
  • Household assets

We outperform recent methods based on mobile call record data

Blumenstock et al. (2015) Predicting Poverty and Wealth from Mobile Phone Metadata, Science

slide-25
SLIDE 25

Transfer Learning

25

f1 f2 … f4096

Expenditures Assets

Nonlinear mapping

Inputs: daytime satellite images

{Low, Medium, High}

Outputs: Nighttime light intensities Feature Learning

Target task

slide-26
SLIDE 26

Models travels well across borders

26

Models trained in one country perform well in

  • ther countries

Can make predictions in countries where no data exists at all

slide-27
SLIDE 27

What do we still need?

  • Develop models that account for spatial and

temporal dependencies of poverty and health

27

slide-28
SLIDE 28

Take full advantage of incredible richness of images

28

slide-29
SLIDE 29

We have introduced an accurate, inexpensive, and scalable approach to predicting poverty and wealth

29

A new approach based on satellite imagery

slide-30
SLIDE 30

30