Urban Link Travel Time Estimation Using Large-scale Taxi Data with Partial Information
Xianyuan Zhan* Satish V. Ukkusuri*
*Civil Engineering, Purdue University
24/04/2014
Partial Information Xianyuan Zhan * Satish V. Ukkusuri * * Civil - - PowerPoint PPT Presentation
Urban Link Travel Time Estimation Using Large-scale Taxi Data with Partial Information Xianyuan Zhan * Satish V. Ukkusuri * * Civil Engineering, Purdue University 24/04/2014 Introduction Study region Base model Probabilistic model Numerical
*Civil Engineering, Purdue University
24/04/2014
Introduction Study region Base model Probabilistic model Numerical results Conclusion
2
MPE 2013+ Urban Link Travel Time Estimation Using Large-scale Taxi Data with Partial Information Xianyuan Zhan
Limousine Commission (NYTLC)
Introduction Study region Base model Probabilistic model Numerical results Conclusion
3
− 450,000 to 550,000 daily trip records − More than 180 million taxi trips a year − Providing a lot of opportunities!
MPE 2013+ Urban Link Travel Time Estimation Using Large-scale Taxi Data with Partial Information Xianyuan Zhan
Introduction Study region Base model Probabilistic model Numerical results Conclusion
4
Trip Origin Trip Destination
MPE 2013+ Urban Link Travel Time Estimation Using Large-scale Taxi Data with Partial Information Xianyuan Zhan
− Loop detector data − Automatic Vehicle Identification tags − Video camera data − Remote microwave traffic sensors
− Novel large-scale data sources − Ideal probes monitoring traffic condition − Large coverage − Do not need fixed sensors − Cheap!
Introduction Study region Base model Probabilistic model Numerical results Conclusion
5
MPE 2013+ Urban Link Travel Time Estimation Using Large-scale Taxi Data with Partial Information Xianyuan Zhan
− Contains only OD coordinate, trip travel time and distance, etc. − Path information not available − Large-scale data with partial information The problem
− Map data to the network − Path inference − Estimate link travel time based on OD data
Introduction Study region Base model Probabilistic model Numerical results Conclusion
6
MPE 2013+ Urban Link Travel Time Estimation Using Large-scale Taxi Data with Partial Information Xianyuan Zhan
Introduction Study region Base model Probabilistic model Numerical results Conclusion
7
MPE 2013+ MPE 2013+ Urban Link Travel Time Estimation Using Large-scale Taxi Data with Partial Information Xianyuan Zhan
Introduction Study region Base model Probabilistic model Numerical results Conclusion
8
− 193 nodes − 381 directed links
MPE 2013+ Urban Link Travel Time Estimation Using Large-scale Taxi Data with Partial Information Xianyuan Zhan
Introduction Study region Base model Probabilistic model Numerical results Conclusion
9
MPE 2013+ Urban Link Travel Time Estimation Using Large-scale Taxi Data with Partial Information Xianyuan Zhan
200 400 600 800 1000 1200 Frequency
Histogram for day 1
100 200 300 400 500 600 Frequency
Histogram for day 6
Introduction Study region Base model Probabilistic model Numerical results Conclusion
10
MPE 2013+ Urban Link Travel Time Estimation Using Large-scale Taxi Data with Partial Information Xianyuan Zhan
* Zhan, X., Hasan, S., Ukkusuri, S. V., & Kamga, C. (2013). Urban link travel time estimation using large-scale taxi data with partial information.Transportation Research Part C: Emerging Technologies, 33, 37-49.
Introduction Study region Base model Probabilistic model Numerical results Conclusion
11
MPE 2013+ Urban Link Travel Time Estimation Using Large-scale Taxi Data with Partial Information Xianyuan Zhan
Introduction Study region Base model Probabilistic model Numerical results Conclusion
12
MPE 2013+ Urban Link Travel Time Estimation Using Large-scale Taxi Data with Partial Information Xianyuan Zhan
* Y. Yen, Finding the K shortest loopless paths in a network, Management Science 17:712–716, 1971.
Introduction Study region Base model Probabilistic model Numerical results Conclusion
13
MPE 2013+ Urban Link Travel Time Estimation Using Large-scale Taxi Data with Partial Information Xianyuan Zhan
− Each driver wants to minimize both trip time and distance to make more trips thus make more revenue
𝑄
𝑛
𝑢, 𝑒, 𝜄 = 𝑓−𝜄𝐷𝑛
𝑢,𝑒𝑛
𝑘∈𝑆𝑗 𝑓−𝜄𝐷𝑘
𝑢,𝑒𝑘
𝐷𝑛 𝑢, 𝑒𝑛 = 𝛾1 ∙ 𝑛 𝑢 + 𝛾2 ∙ 𝑒𝑛 𝑛 𝑢 = 𝛽1𝑢𝑃 + 𝛽2𝑢𝐸 +
𝑚∈𝑀
𝜀𝑛𝑚𝑢𝑚
Introduction Study region Base model Probabilistic model Numerical results Conclusion
14
MPE 2013+ Urban Link Travel Time Estimation Using Large-scale Taxi Data with Partial Information Xianyuan Zhan
𝑗|𝑆𝑗 ) and
𝑗 path travel times
𝐹 𝑍
𝑗|𝑆𝑗 = 𝑛∈𝑆𝑗
𝑛( 𝑢)𝑄
𝑛
𝑢, 𝑒, 𝜄 𝑢 = arg min
𝑢 𝑗∈𝐸
𝑗|𝑆𝑗 2
Introduction Study region Base model Probabilistic model Numerical results Conclusion
15
MPE 2013+ Urban Link Travel Time Estimation Using Large-scale Taxi Data with Partial Information Xianyuan Zhan
Introduction Study region Base model Probabilistic model Numerical results Conclusion
16
MPE 2013+ Urban Link Travel Time Estimation Using Large-scale Taxi Data with Partial Information Xianyuan Zhan
2)
𝑄 𝑧𝑗|𝑙, 𝒚 = 𝑄 𝑧𝑗|𝑙, 𝝂, 𝜯 = 𝑂 𝛽1𝜈0 + 𝛽2𝜈𝐸 +
𝑚∈𝑙
𝜈𝑚 , 𝛽1𝜏𝑃 2 + 𝛽2𝜏𝐸 2 +
𝑚∈𝑙
𝜏𝑚
2
𝜌𝑙
𝑗 𝝂, 𝜸, 𝑒𝑗 =
exp −𝐷𝑙
𝑗 𝝂, 𝜸, 𝑒𝑗
𝑡∈𝑆𝑗 exp −𝐷𝑡
𝑗 𝝂, 𝜸, 𝑒𝑗
Introduction Study region Base model Probabilistic model Numerical results Conclusion
17
MPE 2013+ Urban Link Travel Time Estimation Using Large-scale Taxi Data with Partial Information Xianyuan Zhan
𝐼 𝒛|𝝂, 𝜯, 𝑬 =
𝑗=1 𝑜 𝑙∈𝑆𝑗
𝜌𝑙
𝑗 𝝂, 𝜸, 𝑒𝑗 𝑄 𝑧𝑗|𝑙, 𝝂, 𝜯
𝑗 as the latent variable
Plate notation
Introduction Study region Base model Probabilistic model Numerical results Conclusion
18
MPE 2013+ Urban Link Travel Time Estimation Using Large-scale Taxi Data with Partial Information Xianyuan Zhan
− Prior on 𝝂: − Priors on 𝝂 and variance 𝜯
𝐼 𝒛|𝝂, 𝜯, 𝑬 =
𝑗=1 𝑜 𝑙∈𝑆𝑗
𝜌𝑙
𝑗 𝝂, 𝜸, 𝑒𝑗 𝑄 𝑧𝑗|𝑙, 𝝂, 𝜯 ∙ 𝑘∈𝑀
𝑞 𝜈𝑘 𝐼 𝒛|𝝂, 𝜯, 𝑬 =
𝑗=1 𝑜 𝑙∈𝑆𝑗
𝜌𝑙
𝑗 𝝂, 𝜸, 𝑒𝑗 𝑄 𝑧𝑗|𝑙, 𝝂, 𝜯 ∙ 𝑘∈𝑀
𝑞 𝜈𝑘 𝑞 𝜏
𝑘 2
Introduction Study region Base model Probabilistic model Numerical results Conclusion
19
MPE 2013+ Urban Link Travel Time Estimation Using Large-scale Taxi Data with Partial Information Xianyuan Zhan
− E-step:
𝔽 𝑨𝑙
𝑗
= 𝑨𝑙
𝑗 𝑨𝑙
𝑗 𝜌𝑙 𝑗 𝝂, 𝜸, 𝑒𝑗 𝑄 𝑧𝑗|𝑙, 𝝂, 𝜯 𝑨𝑙
𝑗
𝑨𝑙
𝑗 𝑡∈𝑆𝑗 𝜌𝑡
𝑗 𝝂, 𝜸, 𝑒𝑗 𝑄 𝑧𝑗|𝑡, 𝝂, 𝜯 𝑨𝑡
𝑗 = 𝛿 𝑨𝑙
𝑗
− M-step: Let 𝜐𝑚 = 𝜏𝑚
2, 𝝊 = 𝜯,
𝑅 𝝂, 𝛖 = 𝔽𝒜 ln 𝑄 𝒛, 𝒜|𝝂, 𝛖 =
𝑗=1 𝑜 𝑙∈𝑆𝑗
𝛿 𝑨𝑙
𝑗
ln 𝜌𝑙
𝑗 𝝂, 𝜸, 𝑒𝑗 + ln 𝑄 𝑧𝑗|𝑙, 𝝂, 𝛖
𝝂𝑜𝑓𝑥, 𝝊𝑜𝑓𝑥 = 𝑏𝑠 𝑛𝑏𝑦
𝝂,𝛖
𝑅 𝝂, 𝛖
Introduction Study region Base model Probabilistic model Numerical results Conclusion
20
MPE 2013+ Urban Link Travel Time Estimation Using Large-scale Taxi Data with Partial Information Xianyuan Zhan
− Solve for large-scale data input − Solve for large network − Short term link travel time estimation (say 15min)
− Alternating Direction Method of Multiplier (ADMM) to decouple the problem into smaller sub-problems − Solve decomposed sub-problems in parallel − Deals with large size of network and data − Faster model estimation
Introduction Study region Base model Probabilistic model Numerical results Conclusion
21
MPE 2013+ Urban Link Travel Time Estimation Using Large-scale Taxi Data with Partial Information Xianyuan Zhan
𝑗=1 𝑜
𝑄𝑠 − 𝑈𝑗 𝑃𝑐 2
MAPE = 1 𝑜
𝑗=1 𝑜
𝑈𝑗
𝑄𝑠 − 𝑈𝑗 𝑃𝑐
𝑈𝑗
𝑃𝑐
× 100%
Introduction Study region Base model Probabilistic model Numerical results Conclusion
22
MPE 2013+ Urban Link Travel Time Estimation Using Large-scale Taxi Data with Partial Information Xianyuan Zhan
Day Error Time Period 9:00-10:00 13:00-14:00 19:00-20:00 21:00-22:00 Monday RMSE (min) 2.614 1.981 1.937 1.372 MAPE 29.51% 24.22% 26.27% 21.87% Tuesday RMSE (min) 2.461 2.302 1.827 1.437 MAPE 29.63% 25.59% 23.33% 22.20% Wednesday RMSE (min) 3.827* 3.216* 2.18 1.691 MAPE 41.32%* 34.97%* 28.73% 24.40% Thursday RMSE (min) 2.468 2.699 2.49 1.382 MAPE 27.28% 27.92% 28.54% 21.05% Friday RMSE (min) 2.26 2.179 1.692 1.334 MAPE 27.76% 27.04% 25.17% 22.26% Saturday RMSE (min) 1.034 1.69 1.839 1.584 MAPE 16.84% 24.58% 27.14% 21.61% Sunday RMSE (min) 2.041 1.518 1.395 1.16 MAPE 25.44% 23.70% 22.72% 19.87% * Traffic disturbance caused by Patrick's Day Parade.
Introduction Study region Base model Probabilistic model Numerical results Conclusion
23
MPE 2013+ Urban Link Travel Time Estimation Using Large-scale Taxi Data with Partial Information Xianyuan Zhan
Introduction Study region Base model Probabilistic model Numerical results Conclusion
24
MPE 2013+ Urban Link Travel Time Estimation Using Large-scale Taxi Data with Partial Information Xianyuan Zhan
Introduction Study region Base model Probabilistic model Numerical results Conclusion
25