Query-Efficient Imitation Learning for End-to-End Simulated Driving
Jiakai Zhang, Kyunghyun Cho New York University
for End-to-End Simulated Driving Jiakai Zhang, Kyunghyun Cho New - - PowerPoint PPT Presentation
Query-Efficient Imitation Learning for End-to-End Simulated Driving Jiakai Zhang, Kyunghyun Cho New York University Overview Introduction End-to-end learning for self-driving Related work Learning method Convolutional
Jiakai Zhang, Kyunghyun Cho New York University
Steering Brake
Dataset πΈ0 Policy π1
Policy ππ = πΎππβ + (1 β πΎπ) ππ Dataset πΈπ = πΈβ² βͺ πΈπβ1 Dataset πΈβ² Policy ππ
Best policy ππ
Dataset πΈ0 Policy π1
Dataset πΈπ = πΈβ² βͺ πΈπβ1 Dataset πΈβ² not safe Policy ππ
Best policy ππ Safety classifier π1 Policy ππ = πΎππβ + (1 β πΎπ) ππ Safety classifier ππ Safety classifier π1 Safety classifier ππ
Training tracks Test tracks
Input image β 3x160x72 Convolutional layer β 64x3x3 Max Pooling β 2x2 Convolutional layer β 128x5x5 Fully connected layer Control signals Environment variables x 4 x 2
Feature map Fully connected layer Safety value x 2
Safe Frames Unsafe Frames
Dashed curve β with traffic Solid curve β without traffic
# of Dagger Iterations MSE (Steering Angle)
Dashed curve β with traffic Solid curve β without traffic
# of Dagger Iterations Damage per Lap
Dashed curve β with traffic Solid curve β without traffic
# of Dagger Iterations
Dashed curve β with traffic Solid curve β without traffic
# of Dagger Iterations % of csafe = 0