Forecasting bus ridership with trip planner usage data
Acknowledgement:
- Dr. Chintan Amrit (UTwente)
- Dr. Engin Topan (UTwente)
- Dr. Niels van Oort ( Smart
Public Transport Lab)
1
a machine learning application
Jop van Roosmalen
trip planner usage data a machine learning application - - PowerPoint PPT Presentation
Forecasting bus ridership with trip planner usage data a machine learning application Acknowledgement: Jop van Roosmalen Dr. Chintan Amrit (UTwente) Dr. Engin Topan (UTwente) Dr. Niels van Oort ( Smart Public Transport Lab) 1 9292 Trip
Acknowledgement:
Public Transport Lab)
1
Jop van Roosmalen
1 2
2
Objective
3
Models
π‘π’ππ = πππ‘π‘πππππ π‘π’ππβ1 + πΆπππ πππππ‘π’ππ β π΅πππβπ’ππππ‘π’ππ = Οπ=0 π‘
πΆπ β Οπ=0
π‘
π΅π Machine learning
Comparison with simple rules 1. Predicted number equals number last week 2. Predicted number equals historical average
4
Undersampling using stratified K-fold
5
Performance metrics
1 π Οπ=1 π (π§π β ΰ·
π§π)2
Ο(π§πβ ΰ· π§π)2 Ο(π§πβ ΰ΄€ π§π)2
6
Legend
Number of habitants
Scope
and Drenthe
Groningen City
two smaller holidays
7
Structure
Trip planner Smart card AVL data
16:50 - 17:31 - 16:56 - 17:13 - 17:18 - 17:20 - 17:27 -
+ 1
Journey question Journey parts 8 17:20 - 17:27 - Smart card trips 17:20 - 17:27 - Planned + recorded
All on vehicle level
11,447,562 11,694,849 6,814,907 4,946 stops
Merging trip planner with bus data
Metric: Difference boarding times + difference alighting times
Line 1 Trip 1001 Trip planner: Stop A to B at boarding to alighting time with line 1
Boarding Alighting
Line 1 Trip 1003 Line 2 Trip 1041 Line 3 Trip 1013
Time
9
Exploratory data analysis
10
Exploratory data analysis
11
Data selection
Forecasting demand for trips of line configuration g554-1-0 on workdays around 8 AM
(56 line configurations, 4173 trips and 138,694 records)
(83 line configuration, 51,471 trips and 1,523,115 records)
(1 line configuration, 2275 trips and 97,825 records)
(1 line configuration, 239 trips and 10,277 records)
12
Line configuration g554-1-0
13
central Station to Hospital
RMSE
14
Boarding Alighting Passenger MLR DT RF NN SVR Last week Historical avg
RMSE Passengers
15
Passenger prediction example
16
Percentage correct maximum passenger count predictions
β₯ β€
17
Random Forests
Limitations
18
Research question
Can one forecast short-term ridership of buses using data containing the consulted travel advices from a widely used trip planner for public transport and what accuracy can one achieve in different scenarios?
19
Recommendations Practice
analysis
number, operation date and stop
Research
20 Training data:
Models:
(bias/flexible) Performance metric:
Features:
Forecasting performance
jop.j@hotmail.com linkedin.com/in/jop-van-roosmalen/ nielsvanoort.weblog.tudelft.nl essay.utwente.nl/77590/
21
Slides Thesis