De Detec%ng and ec%ng and An Analyzing Urb Urban an Regions - - PowerPoint PPT Presentation

de detec ng and ec ng and an analyzing urb urban an
SMART_READER_LITE
LIVE PREVIEW

De Detec%ng and ec%ng and An Analyzing Urb Urban an Regions - - PowerPoint PPT Presentation

De Detec%ng and ec%ng and An Analyzing Urb Urban an Regions with High Imp mpact of Weather Ch Change on e on T Transpor ort Ye Ding, Yanhua Li, Ke Deng, Haoyu Tan, Mingxuan Yuan, Lionel M. Ni Presenta;on by Karan Somaiah Napanda,


slide-1
SLIDE 1

De Detec%ng and ec%ng and An Analyzing Urb Urban an Regions with High Imp mpact of Weather Ch Change on e on T Transpor

  • rt

Ye Ding, Yanhua Li, Ke Deng, Haoyu Tan, Mingxuan Yuan, Lionel M. Ni

Presenta;on by Karan Somaiah Napanda, Suchithra Balakrishnan, Zhaoning Su

slide-2
SLIDE 2

URBAN comp mpu%ng

connects urban sensing, data ma

manageme ment, data analy%c and service providing

*Urban compu;ng with taxis, MSRA *A Cloud-Based Knowledge Discovery System for Monitoring Fine-Grained Air Quality, MSRA

slide-3
SLIDE 3

URBAN comp mpu%ng

connects urban sensing, data ma

manageme ment, data analy%c and service pr providing viding

  • infer the real-;me and fine grained air quality

informa;on throughout a city

  • iden;fy the hot spots of moving vehicles in an

urban area

  • propose a framework, called DRoF, to discover

regions of different func;ons in a city

  • try to sense the refueling behavior and

citywide petrol consump;on in real-;me…

  • Focus
  • Traffic conges;on
  • Energy consump;on
  • Pollu;on
  • Based on data
  • traffic flow
  • human mobility
  • geographical data
slide-4
SLIDE 4

The imp mpact of incleme ment weather to traffic ffic

  • May slow down the traffic
  • May cause conges;ons due

to low visibility and high demand of

  • May influence the transport

performance,

slide-5
SLIDE 5

the imp mpact of incleme ment weather to traffic ffic

*Changes in Climate and Weather Relevant on US Transport, “The impact of climate change and weather on transport: An overview of empirical fin”

slide-6
SLIDE 6

Mo Mo%v %va%o a%on n

how can we identify those regions being highly influenced by weather change on transport?

*Google Opera;ng System *IBM Smarter Traveler Traffic Predic;on So^ware

slide-7
SLIDE 7

Ch Challen enges es

lacking of effec%ve traffic monitoring system in city-wide scale

  • To enable weather-traffic index

throughout a city and factor analysis

  • extract traffic informa;on from

numerous taxis driving on roads due to its availability, wide-coverage and low-cost.

  • A taxi tracking system con;nuously

record the informa;on including loca;on, speed, occupancy status, and orienta;on of the taxis

slide-8
SLIDE 8

Vo Voronoi d diagr iagram am

Ch Challen enges es lacking of effec%ve traffic monitoring system in city-wide scale

slide-9
SLIDE 9

Ch Challen enges es lacking of effec%ve traffic monitoring system in city-wide scale

Voronoi diagram

*The Voronoi diagrams par;;ons in Shanghai. The under layer represents the road networks.

Equal-sized rectangles Ø some cells are highly dense and in others are highly sparse Ø seeds are the road intersec;ons Ø every cell include at least one road intersec;on and a number of roads connected to this intersec;on Ø if several road intersec;ons are very close to each other, for example within 50 meters, they are grouped together as a complex intersec;on

slide-10
SLIDE 10

Qu Ques%ons

  • The average of taxi can reflect the traffic

informa%on? In the area of residence community, everyone have their own car so they don’t need the taxi

  • By calcula%ng the average driving speeds of all taxis

in each Voronoi cell can reflect the average speed? However maybe some cells have a lower average speed of taxis is just because people are likely get on

  • r get off from the taxis here
slide-11
SLIDE 11

Ch Challen enges es

how to disclose the key factors behind the weather-traffic index

density of roads number of road intersec;ons number of POIs(points of interest) traffic volume average age of the household density of buildings and more in the surrounding regions

slide-12
SLIDE 12

Th The Goa e Goal of

  • f t

this p paper er

To develope weather-traffic index (WTI) system

The first is to set up a weather-traffic index throughout a city, which indicates the impact of weather to traffic from light to heavy. The second is to reveal the key factors behind the weather-traffic index throughout the city and their rela;ve weights. Previous works mainly focus on the analysis of road segments; on the contrary, this paper is the first study on local traffic-weather sensi;vity throughout a city and the inves;ga;on to reveal the key factors behind the sensi;vity

slide-13
SLIDE 13

OVERVIEW

  • 1. Data Prepara;on
  • 2. Weather-traffic

Index Establishment

  • 3. Factor Analysis
slide-14
SLIDE 14
  • 1. DATA PREPARATION
  • i. Regional Par;;oning

Voronoi Diagram

  • ii. Source Data

Road Network Traffic Data Regional features Weather Report Data

slide-15
SLIDE 15

DATA PREPARATION - Regional Par%%oning

VORONOI DIAGRAM Par;;oning of a plane into cells/ regions based on the distance between seeds Shape and size of each cells is different from each other SEEDS- road intersec;ons ROAD- INTERSECTION- ORIENTED Par;;oning Proper;es

  • Even distribu;on of road networks
  • Potray the rela;on of weather and

traffic

slide-16
SLIDE 16

DATA PREPARATION – Source Data

i. Road Networks - G(V, E) E- set of road segments V- set of road intersec;ons E- type, length, speed limit, two end points V- loca;on (la;tude and longitude), type i. Road Networks ii. Traffic Data

  • iii. Regional Features
  • iv. Weather Report Data
slide-17
SLIDE 17

DATA PREPARATION – Source Data

ii. Traffic Data Traffic Parameters of interest are extracted

  • Average Speed
  • Quan;ty measures
  • Quality assessment measures
  • Movement measures
  • Composi;on/ Classifica;on measures

Time Mean Speed/ Average Speed- Traffic Parameter Arithme;c mean of individual spot speeds that are recorded over a selected ;me period Extracted from taxi trajectories in each cell Split into 7 classes

  • <10 km/h
  • 10- 30 km/h
  • 30- 50 km/h
  • 50- 70 km/h
  • 70- 90 km/h
  • 90- 110 km/h
  • >110 km/h

The average speed of one road segment is subject to the traffic parameter of that road segment only, not comparable with other road segments

slide-18
SLIDE 18

DATA PREPARATION – Source Data

  • iii. Weather Report Data
  • State of atmosphere
  • Degree to which it is hot or cold, wet
  • r dry, calm or stormy, clear or cloudy
  • iv. Regional Features

Four Categories a) Points of Interest b) Structure c) Density d) Community

slide-19
SLIDE 19
  • 2. WEATHER TRAFFIC INDEX ESTABLISHMENT

Input – Weather data and Traffic data Indicates the impact of weather to traffic in different cells Given a cell g, its value in the weather traffic index is the correla;on between traffic and weather, denoted as ρ(g). ρ(g) takes a value from the discrete range [1, 2, 3, 4, 5]

slide-20
SLIDE 20

WEATHER TRAFFIC INDEX ESTABLISHMENT Correla%on Detec%on

Correla;on between traffic speed Ft and weather Fw. Classifier is trained with Ft and Fw Input- Weather as a feature vector Output- one of the seven speed classes Inference accuracy Correla;on between Ft and Fw in that cell Inference accuracy Correla;on between Ft and Fw in that cell Weakness of this method, Does not consider other reasons that affect traffic

slide-21
SLIDE 21

Granger Causality test whether one ;me series is useful in forecas;ng another A ;me series X is said to Granger-cause Y if it can be shown that those X values provide sta;s;cally significant informa;on about future values of Y. A variable X that evolves over ;me Granger-cause another evolving variable Y if predic;ons of the value of Y based on its own past values and on the past values of X are beuer than predic;ons of Y based only on its own past values.

WEATHER TRAFFIC INDEX ESTABLISHMENT Correla%on Detec%on

slide-22
SLIDE 22
  • Traffic Predic;on models are trained separately

for different ;me

  • Weather-traffic index value ρ(g) is assigned to

each cell to indicate the extent to which traffic is impacted by weather

  • Cells are organised in ascending order of traffic

predic;on accuracy improvement and then divided into k- equal sized subsets

  • k- quan;les show the correla;on between traffic

and weather from weak to strong

WEATHER TRAFFIC INDEX ESTABLISHMENT Correla%on Detec%on

slide-23
SLIDE 23
  • Classifica;on Problem
  • Methods used

Support Vector Machine (SVM) Logis;c Regression (LOGIT) Perceptron

  • 10-fold cross valida;on
  • SVM is applied and LOGIT and Perceptron is used

to verify the output of SVM

WEATHER TRAFFIC INDEX ESTABLISHMENT Traffic Predic%on

slide-24
SLIDE 24
  • 3. FACTOR ANALYSIS

AssumpRon – weather traffic index of all cells have been certainly assigned This method iden;fies the key factors and their weights contribu;ng to the weather- traffic indices of cells. Discloses what regional features make the traffic in some cells vulnerable to inclement weather Steps:

  • 1. Key Factor Verifica;on by Index Inference (KFVII)
  • 2. Weight Es;ma;on of Regional Features
slide-25
SLIDE 25
  • Intui;on
  • Weather- traffic index of one region can be inferred from the indices of its closely

located (or adjacent) cells

  • Given a set of regional features Fr, if the inference accuracy is sa;sfactory with Fr as

input, it indicates that such set of regional features are the key features

  • This model is not symmetric
  • Naïve Bayes classifier

FACTOR ANALYSIS Key Factor Verifica%on by Index Inference (KFVII)

slide-26
SLIDE 26

Marginal DistribuRon Probability distribu;on of the regions contained in a similarity subset Probability of one region being the index of i given one of its adjacent regions with index j, if two regions have a certain similarity Cosine Similarity is used to describe the similarity muv between two regions gu and gv Similarity ranges of k- groups b0 is minimum similarity Bk is maximum similarity

FACTOR ANALYSIS Key Factor Verifica%on by Index Inference (KFVII)

slide-27
SLIDE 27

Pairs of cells in the group of [bi-1, bi] are in marginal distribu;on matrix Bi Rows of Bi- weather traffic indices of gu Columns of Bi- weather traffic indices of gv pij- probability that the weather-traffic index of gv is ρj when the weather- traffic index of gu is ρi

FACTOR ANALYSIS Key Factor Verifica%on by Index Inference (KFVII)

slide-28
SLIDE 28

Index- Index Inference From marginal distribu;on, weather traffic index of a par;cular cell is inferred from adjacent cell using naïve Bayes classifier. Given a cell gu, the marginal distribu;on allows naïve Bayes classifier to infer which value the weather- traffic index of gu is most likely to be, based on the weather- traffic indices of its adjacent cells ρ(g1), ρ(g2),…

FACTOR ANALYSIS Key Factor Verifica%on by Index Inference (KFVII)

slide-29
SLIDE 29

Consider a regional feature with nontrivial impact to weather-traffic index,

  • 1. Remove this regional features
  • 2. Using KFVII method test if remaining features have high
  • verall accuracy
  • 3. If not, then removed regional feature is very important
  • 4. δ(Fi

r) weight of the regional feature Fi r by compu;ng the

difference of the inference accuracy with and without Fi

r

FACTOR ANALYSIS Weight Es%ma%on of Regional Features

slide-30
SLIDE 30

EMPIRICA EMPIRICAL S L STUD UDY Y

Data Sets

  • Road Network Data of Shanghai
  • Taxi Trajectory Data
  • Weather Report Data (of the same period of ;me as taxi trajectory

data)

  • Regional Informa;on Data
slide-31
SLIDE 31

More about the Data

Road Network Data

  • Obtained from the government
  • 7 levels of roads:

§ Na;onal Expressway § City Expressway § Regular Highway § Large Avenue § Primary Way § Secondary Way § Regular Road Weather Report Data

  • Obtained from Weather

Underground (wunderground.com)

  • Includes 14 weather features
  • For data alignment, recorded during

the same ;me frame as taxi trajectory.

Major Roads Minor Roads

slide-32
SLIDE 32

More about the Data

Trajectory Data

  • Spa;al-Temporal points
  • Provides addi;onal informa;on

including average speed. Regional Informa;on Data

  • Real Estate Data & POI (Points of

Interest) data.

  • Real estate (soufun.com) provides

informa;on about loca;on, price and residen;al communi;es.

  • Dianping.com provides local

merchant informa;on and other aurac;ons around the locality

slide-33
SLIDE 33

Weather – Traffic Index

What is the WTI

  • Average speed is inferred with/without weather.
  • Difference indicates sensi;vity of the cell at that ;me slot.

Robustness of the model:

  • Accuracy of traffic predic;on is test with 3 models
  • SVM gives 0.5
  • Perceptron and Logis;c regression give low accuracy, hence they will be

used to test the robustness of WTI (with/without weather).

  • A Posi;ve value indicates high impact or weather and a nega;ve value

indicates low impact.

slide-34
SLIDE 34

The above diagram shows most

  • f the cells have an average

accuracy of 0.5 for SVM. The above diagram gives changes in the predic;on accuracy which is the least for

  • SVM. Thus, SVM can be used as

the base model and Logis;c Regression and Perceptron are used to test. The changes in predic;on accuracy are close to mean for SVM and Logis;c Regression and it’s a regular distribu;on where as for Perceptron is greater.

Robustness Evalua;on

slide-35
SLIDE 35

Valida%on

  • The five labelled regions indicate high impact
  • f weather on traffic.
  • Most of them are tourist aurac;ons.
  • Not all tourist aurac;ons have high impact due

to inclement weather. eg: Xin;andi is not influenced by weather

slide-36
SLIDE 36

Valida%on

  • The rain aspect has been separated to show

that it has significant impact on traffic.

  • Example area 1: It shows that weather and

rain both have significant impact on traffic.

  • Rain label gives more informa;on about

rain affec;ng traffic than weather affec;ng traffic.

  • Example area 2: Average speed is more on

rainy days.

slide-37
SLIDE 37

Effec%veness

  • Effec;veness verifies the regional features and es;mates their weights.
  • Reciprocal method is used to evaluate
  • It is considerably a fairer method than maximum likelihood since it widens the

gap between different likelihoods.

  • For this, Naive Bayes, Ar;ficial Neural Networks and Random Guess is used.
  • Bayes performs the best, followed by ANN and then finally Random Guess.
slide-38
SLIDE 38

Factor Analysis

  • Four categories of regional features are tested:

POI, structure, density and community.

  • Figure shows that community regional features

are more important than structure regional features.

  • House Age & Number of Neighboring Cells

are important factors.

slide-39
SLIDE 39

Conclusion

  • This paper fills the gap in the study on the impact of weather on

traffic.

  • The effec;veness of the system is verified.
  • Regional factor analysis has significant impact:

Regional House Age is an important factor

  • The factors retrieved from this study can be applied to other ci;es to

generate same knowledge.

  • Further developments are expected since research in this field is

always emerging.

slide-40
SLIDE 40

Ques%ons?