Inferring Human Interaction from Motion Trajectories
University of California, Los Angeles, USA
1 Department of Statistics 2 Department of Psychology
Tianmin Shu1 Yujia Peng2 Hongjing Lu2 Song-Chun Zhu1 Lifeng Fan1
Inferring Human Interaction from Motion Trajectories Tianmin Shu 1 - - PowerPoint PPT Presentation
Inferring Human Interaction from Motion Trajectories Tianmin Shu 1 Yujia Peng 2 Lifeng Fan 1 Hongjing Lu 2 Song-Chun Zhu 1 University of California, Los Angeles, USA 1 Department of Statistics 2 Department of Psychology People are adept at
University of California, Los Angeles, USA
1 Department of Statistics 2 Department of Psychology
Tianmin Shu1 Yujia Peng2 Hongjing Lu2 Song-Chun Zhu1 Lifeng Fan1
Heider and Simmel (1944)
low-level motion cues, e.g., speed and motion direction. (Dittrich & Lea, 1994; Scholl &
Tremoulet, 2000; Tremoulet & Feldman, 2000, 2006; Gao, Newman, & Scholl, 2009; Gao, McCarthy, & Scholl, 2010…)
[Gao & Scholl, 2011] Chasing vs. Stalking
[Shu et al., 2015] [Choi et al., 2009]
(Shu, et al., 2015)
Tracking human trajectories and labeling group human interactions.
[Shu et al., CVPR 2015]
interacting at each moment.
Interactive instances Non-interactive instances
Video frame
Interactive action 4 Non-interactive action 40
N = 33
Video frame
Interactive action 4 Non-interactive action 40
(Baker, Goodman, & Tenenbaum, 2008; Baker, Saxe, and Tenenbaum, 2009; Ullman et al., 2009; Baker, 2012...)
[Ullman et al., 2009] [Baker, 2012]
(Baker, Goodman, & Tenenbaum, 2008; Baker, Saxe, and Tenenbaum, 2009; Ullman et al., 2009; Baker, 2012...)
S: latent sub-interactions Y: interaction labels Г : input layer, motion trajectories of two agents
(0: interactive, 1: non-interactive)
Linear Dynamic System:
(0: interactive, 1: non-interactive)
(0: interactive, 1: non-interactive)
(0: interactive, 1: non-interactive)
(0: interactive, 1: non-interactive)
The model infers the current status of latent variables
Infer st under the assumption of interaction (i.e., yt = 1) The model infers the current status of latent variables
Infer st under the assumption of interaction (i.e., yt = 1) The posterior probability of yt = 1 given st ∈ S The model infers the current status of latent variables
Predict/synthesize xt+1 given yt and st Predict/synthesize st+1 given yt+1 and all previous s yt+1 = 1: interactive trajectories yt+1 = 0: non-interactive trajectories
http://www.stat.ucla.edu/˜tianmin.shu/AerialVideo/AerialVideo.html
the stimuli)
http://www.stat.ucla.edu/˜tianmin.shu/AerialVideo/AerialVideo.html
the stimuli)
Comparison of online predictions by our full model (|S| = 15) (orange) and humans (blue) over time (in seconds) on testing videos.
Time (s) Time (s) Time (s) Interactive ratings
Comparison of online predictions by our full model (|S| = 15) (orange) and humans (blue) over time (in seconds) on testing videos.
Time (s) Time (s) Time (s) Interactive ratings
Trained on Heider-simmel stimuli, tested on aerial video stimuli: r = 0.640 and RMSE of 0.227
Synthesized interactive video (y=1)
Model predicted interactiveness
5x
Synthesized interactive video (y=1)
Model predicted interactiveness
5x
Model predicted interactiveness
Synthesized non-interactive video (y=0) 5x
N = 17
accurately by human observers.
between agents.
http://www.stat.ucla.edu/~tianmin.shu/HeiderSimmel/CogSci17/