SLIDE 1
Towards a Virtual Stuntman Xue Bin (Jason) Peng UC Berkeley - - PowerPoint PPT Presentation
Towards a Virtual Stuntman Xue Bin (Jason) Peng UC Berkeley - - PowerPoint PPT Presentation
Towards a Virtual Stuntman Xue Bin (Jason) Peng UC Berkeley Animation Animation Computer Animation [Geijtenbeek et al. 2013] [Brown et al. 2013] [Ju et al. 2013] [Tan et al. 2014] [Kwon and Hodgins 2017] [Peng et al. 2018] Physics-Based
SLIDE 2
SLIDE 3
Animation
SLIDE 4
Computer Animation
[Ju et al. 2013] [Brown et al. 2013] [Tan et al. 2014] [Geijtenbeek et al. 2013] [Kwon and Hodgins 2017] [Peng et al. 2018]
SLIDE 5
Physics-Based Animation
[Coros et al. 2011]
SLIDE 6
Deep RL
[Schulman et al. 2016] [Mnih et al. 2015] [Chebotar et al. 2017]
SLIDE 7
Motion Quality
[Heess et al. 2017] [Schulman et al. 2016]
SLIDE 8
Motivation
SLIDE 9
Which is Mocap?
SLIDE 10
Simulation Mocap
Which is Mocap?
SLIDE 11
Overview
+ +
Character Reference Motion Task: Hit Target
SLIDE 12
Overview
SLIDE 13
Overview
SLIDE 14
Overview
SLIDE 15
Reference Motion
SLIDE 16
Reference Motion
SLIDE 17
Reference Motion
SLIDE 18
Reference Motion
SLIDE 19
Reference Motion
SLIDE 20
Reference Motion
SLIDE 21
Reference Motion
𝑏0 𝑏1 𝑏2 𝑏3 𝑏4
SLIDE 22
State + Action
State:
- link positions
- link velocities
SLIDE 23
State + Action
State:
- link positions
- link velocities
SLIDE 24
State + Action
State:
- link positions
- link velocities
Action:
- PD targets
SLIDE 25
State + Action
State:
- link positions
- link velocities
Action:
- PD targets
SLIDE 26
Reward
SLIDE 27
Reward
Imitation Objective
SLIDE 28
Reward
Imitation Objective
SLIDE 29
Reward
Imitation Objective
SLIDE 30
Reward
Imitation Objective
SLIDE 31
Reward
Imitation Objective Task Objective
SLIDE 32
Proximal Policy Optimization (PPO)
[Schulman et al. 2017]
SLIDE 33
Proximal Policy Optimization (PPO)
[Schulman et al. 2017]
SLIDE 34
Proximal Policy Optimization (PPO)
[Schulman et al. 2017]
SLIDE 35
Proximal Policy Optimization (PPO)
[Schulman et al. 2017]
SLIDE 36
Proximal Policy Optimization (PPO)
[Schulman et al. 2017]
SLIDE 37
Humanoid: Walk
SLIDE 38
Humanoid: Run
SLIDE 39
Comparison
[Merel et al. 2017] Ours
SLIDE 40
No Reference Motion
SLIDE 41
Locomotion
SLIDE 42
Humanoid: Cartwheel
SLIDE 43
Humanoid: Backflip
SLIDE 44
Humanoid: Frontflip
SLIDE 45
Humanoid: Roll
SLIDE 46
Humanoid: Crawl
SLIDE 47
Humanoid: Dance A
SLIDE 48
Humanoid: Kip-Up
SLIDE 49
Humanoid: Headspin
SLIDE 50
Humanoid: Vault 1-Handed
SLIDE 51
Humanoid: Flare
SLIDE 52
20+ Skills
SLIDE 53
Keyframe Animation
SLIDE 54
T-Rex: Walk
Simulated Character
SLIDE 55
Dragon: Walk
Simulated Character
SLIDE 56
Lion: Run
Simulated Character
SLIDE 57
Lion++
SLIDE 58
Tasks
SLIDE 59
Tasks
Reference Motion
SLIDE 60
Tasks
Reference Motion
+
Task
SLIDE 61
Humanoid: Spinkick - Strike
SLIDE 62
Humanoid: Baseball Pitch - Throw
SLIDE 63
No Reference Motion
SLIDE 64
Humanoid: Balance Beam
SLIDE 65
Humanoid: Run – Dense Gaps
SLIDE 66
Retargeting
SLIDE 67
Character Retargeting
+
Reference Motion Atlas
SLIDE 68
Atlas: Spinkick
SLIDE 69
Atlas: Run
SLIDE 70
Atlas: Getup-Facedown
SLIDE 71
Atlas: Backflip
SLIDE 72
Multi-Clip Integration
SLIDE 73
Multi-Clip Integration
SLIDE 74
Multi-Clip Integration
SLIDE 75
Learning from Mocap
SLIDE 76
Mocap is a Hassle
[Holden 2018]
SLIDE 77
Skills From Videos
SLIDE 78
Simulation
Learning from Videos
Video Simulation
SLIDE 79
Overview
SLIDE 80
Overview
SLIDE 81
Overview
SLIDE 82
Overview
SLIDE 83
Overview
SLIDE 84
Pose Estimation
SLIDE 85
Pose Estimation
Pose Prediction Video: Handspring A
SLIDE 86
Human Mesh Recovery (HMR)
[Kanazawa et al., 2018]
SLIDE 87
Pose Estimation
Pose Prediction Video: Backflip A
SLIDE 88
Motion Imitation
SLIDE 89
Motion Imitation
SLIDE 90
Motion Imitation
SLIDE 91
Motion Imitation
SLIDE 92
Humanoid: Cartwheel B
Video: Cartwheel B Reference Motion Policy
SLIDE 93
Humanoid: Jumping Jack
Video: Jumping Jack Policy
SLIDE 94
Humanoid: Backflip B
Video: Backflip B Policy
SLIDE 95
Humanoid: Frontflip
Video: Frontflip Policy
SLIDE 96
Humanoid: Roll
Video: Roll Policy
SLIDE 97
Humanoid: Spin
Video: Spin Policy
SLIDE 98
Humanoid: Kip-Up
Video: Kip-Up Policy
SLIDE 99
Humanoid: Vault
Video: Vault Policy
SLIDE 100
Atlas: Handspring A
Video: Handspring A Policy
SLIDE 101
Atlas: Jump
Video: Jump Policy
SLIDE 102
Atlas: Vault
Video: Vault Policy
SLIDE 103
Atlas: Dance
Video: Dance Policy
SLIDE 104
Environment Retargeting
SLIDE 105
Environment Retargeting
SLIDE 106
Failure Cases
SLIDE 107
SLIDE 108
Skills from Videos
Policy Video: Backflip A
SLIDE 109
Concluding Remarks
- Simple method can learn a large repertoire of skills
- Minimizing tracking error works (surprisingly) well
SLIDE 110
Concluding Remarks
- Simple method can learn a large repertoire of skills
- Minimizing tracking error works (surprisingly) well
- A lot of room for improvement for video imitation
– More end-to-end approach – Outdoor sports – Multiple actors
SLIDE 111
Concluding Remarks
- Simple method can learn a large repertoire of skills
- Minimizing tracking error works (surprisingly) well
- A lot of room for improvement for video imitation
– More end-to-end approach – Outdoor sports – Multiple actors
- Code: https://github.com/xbpeng/DeepMimic
SLIDE 112
Collaborators
Pieter Abbeel Sergey Levine Michiel van de Panne Angjoo Kanazawa Jitendra Malik
SLIDE 113