Context-aware Real-time Population Estimation for Metropolis Fengli - - PowerPoint PPT Presentation

context aware real time population estimation for
SMART_READER_LITE
LIVE PREVIEW

Context-aware Real-time Population Estimation for Metropolis Fengli - - PowerPoint PPT Presentation

Context-aware Real-time Population Estimation for Metropolis Fengli Xu FIB, Tsinghua University www.1ppt.com Outline Motivation 1 Material 2 Method 3 Application 4 Motivation Significance 1 Achieving real-time population


slide-1
SLIDE 1

www.1ppt.com

Context-aware Real-time Population Estimation for Metropolis

Fengli Xu FIB, Tsinghua University

slide-2
SLIDE 2

Outline Motivation Material Method Application 1 2 3 4

slide-3
SLIDE 3

Significance 1 2 3 4 Motivation

Achieving real-time population distribution benefits: n Transportation scheduling n Anomaly detection n Urban planning

Urban citizens are mobile in their daily lives. The population distribution varies during the day.

slide-4
SLIDE 4

Problems of population census

nVery expensive nHigh latency

Motivation It’s not feasible to achieve real-time population distribution through census.

slide-5
SLIDE 5

Limitations of previous attempts 1 2 3 4 Motivation Call Detail Records1

  • Low spatial resolution
  • High latency(sparse records)

Remote Sensing Images2

  • Require multiple datasets
  • Can’t track day-time variation
  • 1. Deville, et al. Dynamic population mapping using mobile phone data[J]. PNAS, 2014.
  • 2. Stevens F R, et al. Disaggregating census data for population …[J]. PloS one, 2015.
slide-6
SLIDE 6

Appropriate datasets Advanced method

Key points to address these problems Motivation Limitations of previous attempts

slide-7
SLIDE 7

Cellular data access records Material

Device’s ID||Start time||End time||BS ID||Location||Traffic volume From August 1 to August 31, 2014

9600 BSs 150,000 users

Large-scale Long duration

Fine-grained

Start and end time accurate to second

Contains 1.96 billion logs, total size over 300GB

We extract the number of access of each base station at granularity of one hour.

slide-8
SLIDE 8

Dataset features Material

High Sampling Rate

n 85% of consecutive records happen in last than 10 Mins.

Extensive records

n Most of users have more than 1,000 records in total.

8.2 hours on average for call records 1

  • 1. Gonzalez M C, et al. Understanding individual human mobility patterns[J]. Nature, 2008.
slide-9
SLIDE 9

Visualization 1 2 4 Material

Mobile users’ behavior is related to the type of their location(physical context).

Different

Can we achieve context-aware segmentation

  • f the cities?
slide-10
SLIDE 10

Context-aware segmentation 1 2 3 4 Material Road network forms a natural segmentation of urban environment. 1

  • 1. Yuan J, et al. Discovering regions of different functions …,SIGKDD, 2012.
slide-11
SLIDE 11

n Collect POI data though APIs1 n Extract TF-IDF features n Using kmeans to detect clusters

Labeling the type of regions Material POI — a specific point location of a certain function.

  • 1. http://map.baidu.com/
slide-12
SLIDE 12

Ground truth Material

n Only provide night-time population. n Accurate(State of the art). n High resolution 100mX100m.

𝑋𝑝𝑠𝑚𝑒𝑞𝑝𝑞 𝑞𝑠𝑝𝑘𝑓𝑑𝑢1:

  • 1. http://www.worldpop.org.uk/.
slide-13
SLIDE 13

Data fusion Material

Mapping cellular data and worldpop data into segmented regions based on overlapping area.

slide-14
SLIDE 14

Estimation model 1 2 3 4 Method

n Inspiration: superlinear effect has been discovered in many fields in urban area, which is considered to be the result of intensive cooperation. 1

𝜄. = 𝛽(𝜍.)4 ln 𝜄. = ln 𝛽 + βln 𝜍.

  • 1. Bettencourt L M A. The origins of scaling in cities[J]. science, 2013.
slide-15
SLIDE 15

Estimation model 1 2 3 4 Method

n Context-aware estimation model.

𝜄. = 𝛽9(𝜍.)4: 𝑘 = 1, 2 … 7

Users’ behavior is spatial heterogeneous.

slide-16
SLIDE 16

Estimation model 1 2 3 4 Method

n Expand the model into a dynamic one. 𝑆A = B 𝜄.

  • .

B 𝛽9 𝜍.

4:

  • D

𝛽9

A = 𝑆A×𝛽9

𝜄 F.

A = 𝛽9 A(𝜍. A )4:

𝛽9

A is scaled to model the temporal inhomogeneity of users’

behavior, while 𝛾𝑘 is fixed to model spatial characteristics.

slide-17
SLIDE 17

Evaluation method Method

  • Worldpop dataset.
  • Transportation dataset(10 million taxi

trips, 1 month)1.

Evaluation datasets

  • Validate the night-time estimation with

Worldpop data.

  • Evaluate the real-time estimation with

transportation dataset.

Evaluation schemes

  • 1. http://soda.datashanghai.gov.cn/.
slide-18
SLIDE 18

Evaluating night-time estimation 1 2 Method

n Reduce 22.5% estimation error, enhance 12.5% correlation. n Performance gain is most significant in education, scenery and business regions.

slide-19
SLIDE 19

Evaluating dynamic estimation 1 2 3 4 Method

n Estimated population has a high correlation with taxi data in central area of urban. n The correlation is significantly higher during day-time.

slide-20
SLIDE 20

Evaluating dynamic estimation Method

n The correlation monotonically increases with taxi density. n Underlying reasons: taxi data can’t capture population well when its density is low.

The dynamic estimation matches well with taxi data.

slide-21
SLIDE 21

Observing urban dynamics Application Simple visualization can quantify the phenomenon

  • f morning and evening rush.
slide-22
SLIDE 22

Observing urban dynamics Application

n Morning rush is more intensive than evening rush. n Different functional regions have distinct population patterns.

slide-23
SLIDE 23

Locate new subway station 1 2 3 4 Application

n Warmer color represents higher variation of population. n The regions with high population variation and no subway station are recommended identified.

slide-24
SLIDE 24

Summary

  • Appropriate data: collect 3G/LTE data access records of over

9,600 BSs with 150,000 subscribers for one month

  • Advanced method: First estimation model to produce accurate

real-time population estimation.

  • Applications:
  • Visualizing and quantifying the dynamics of urban population.
  • Recommending locations for new subway stations.
slide-25
SLIDE 25

Thanks you!

For Data Sample, Please Contact xfl15@mails.tsinghua.edu.cn liyong07@tsinghua.edu.cn FIB-LAB: http://fi.ee.Tsinghua.edu.cn