Real-time Lane Configuration with Coordinated Reinforcement Learning - - PowerPoint PPT Presentation

real time lane configuration with coordinated
SMART_READER_LITE
LIVE PREVIEW

Real-time Lane Configuration with Coordinated Reinforcement Learning - - PowerPoint PPT Presentation

Real-time Lane Configuration with Coordinated Reinforcement Learning Presenter: Udesh Gunarathna Authors: Udesh Gunarathna, Hairuo Xie, Egemen Tanin, Shanika Karunasekara, Renata Borovica-Gajic University of Melbourne Udesh Gunarathna


slide-1
SLIDE 1

Real-time Lane Configuration with Coordinated Reinforcement Learning

Presenter: Udesh Gunarathna

Authors: Udesh Gunarathna, Hairuo Xie, Egemen Tanin, Shanika Karunasekara, Renata Borovica-Gajic University of Melbourne

Udesh Gunarathna Real-time Lane Configuration 1 / 12

slide-2
SLIDE 2

How Often Have You Stuck In Traffic Like This?

Figure: Directionally imbalanced traffic. Congested traffic in one direction and

  • ppose direction having less traffic.1

1https://i.dailymail.co.uk Udesh Gunarathna Real-time Lane Configuration 2 / 12

slide-3
SLIDE 3

Real-time Lane-direction Configuration with Connected Autonomous Vehicles

What is real-time lane-direction configuration? Changing the travelling direction of lanes in road segments based on real-time traffic information in short time intervals. Why consider this problem now? Capabilities of Connected Autonomous Vehicles!

Udesh Gunarathna Real-time Lane Configuration 3 / 12

slide-4
SLIDE 4

Difficult to Compute?

Yes! Lane-direction change in one road segment may affect traffic flow in neighboring road segments.

A B C

Figure: Three road segments A, B, C with different lane-configurations.

What makes lane configuration computation difficult in real-time? Computation needs to be lightweight

Udesh Gunarathna Real-time Lane Configuration 4 / 12

slide-5
SLIDE 5

Proposed Architecture: Coordinated Learning-based Lane Allocation

We propose an efficient multi-agent, scalable solution.

RL RL RL RL

C C C

Lane-directions proposed by RL Agents Coordinated lane-directions by Coordinatiing Agents RL Agents Coordinating Agents Road network

Figure: Architecture of CLLA consists of RL Agents that operate at the intersection level and Coordinating Agents who evaluate the global impact of local lane-direction changes.

Udesh Gunarathna Real-time Lane Configuration 5 / 12
slide-6
SLIDE 6

Why Existing Methods Fail?

Existing approaches use mathematical programming to compute lane-direction allocation based on pre-known traffic patterns. Why existing methods cannot compute real-time lane-direction allocations?

Inability to work with real-time data Computation cost is very high Microscopic simulation vs Macroscopic simulation gap

Udesh Gunarathna Real-time Lane Configuration 6 / 12

slide-7
SLIDE 7

Why Multi-agent Reinforcement Learning?

Why reinforcement learning?

Real-time control Lack of lane-changing traffic models

Why not a single reinforcement learning agent?

Exponential growth of state-space Difficulty of learning

Coordination is the key!

Network level impact of changes needs to be considered Distributed RL Agents’ action may conflict with each other

Udesh Gunarathna Real-time Lane Configuration 7 / 12

slide-8
SLIDE 8

Coordinated Learning-based Lane Allocation

C C C Bottom Layer Upper Layer

Traffic information Coordinated Lane-directions by Coordinating Agents Coordinating Agents Road network

RL RL RL RL RL RL RL RL RL

Area contolled by a RL Agent Aggregated traffic information Lane-direction changes proposed by RL Agents RL Agents

Figure: Architecture of CLLA consists of RL Agents that operate at the intersection level and Coordinating Agents that evaluate the global impact of local lane-direction changes.

Udesh Gunarathna Real-time Lane Configuration 8 / 12
slide-9
SLIDE 9

CLLA Algorithm

I

Increase/ Decrease Incoming Vechicles Outgoing Vehicles Pre-trained RL Agents

Global Impact Evaluation

Clear LLC Reset t

At every time step After every ta time steps

LLC : List of changes Vechile Paths Queue lengths Current lane-configurations Coordinating Agent CLC : Coordinated changes

slide-10
SLIDE 10

Global Impact Evaluation : Complexity

Complexity O(m × n)

m : number of proposed changes from RL Agents n : Number of neighbors per road segment

n does not increase with the network size O(m × n) → O(m) Worst case: O(|E|), |E|: total number of road segments Distributed Version A distributed version can reduce the complexity further with a communication layer.

Udesh Gunarathna Real-time Lane Configuration 10 / 12

slide-11
SLIDE 11

Results from Manhattan Road Network

Simulated using SMARTS [1], a microscopic simulator Using one hour of New York taxi data on Manhattan road network Baseline Travel Time(s) % of Vehicles with DFFT>6 no-LA 604.32 45.9 LLA 585.83 48.6 DLA 496.12 50.7 CLLA 471.28 45.87

Table: Performance of baselines evaluated using New York taxi data. noLA is a baseline

with no lane-direction allocations, LLA is similar to CLLA, without the upper-layer coordination and DLA is a baseline algorithm which allocates lane-directions based on aggregated traffic demand.

Udesh Gunarathna Real-time Lane Configuration 11 / 12

slide-12
SLIDE 12

Thank you Q & A

Udesh Gunarathna Real-time Lane Configuration 12 / 12