keypoint based action keypoint based action recognition
play

Keypoint-Based Action Keypoint-Based Action Recognition - PowerPoint PPT Presentation

Keypoint-Based Action Keypoint-Based Action Recognition Recognition Presenter: Jianchao Yang Presenter: Jianchao Yang Course Instructor: Prof. Derek Hoiem Papers to discuss Papers to discuss Behavior recognition via sparse Behavior


  1. Keypoint-Based Action Keypoint-Based Action Recognition Recognition Presenter: Jianchao Yang Presenter: Jianchao Yang Course Instructor: Prof. Derek Hoiem

  2. Papers to discuss Papers to discuss • Behavior recognition via sparse • Behavior recognition via sparse spatio-temporal features. • Learning realistic human actions from movies. actions from movies.

  3. Behavior Recognition via Sparse Spatio-Temporal Features Spatio-Temporal Features • Motivated by the success application of key points in object recognition • Designed a spatio-temporal feature for behavior recognition recognition

  4. Approach Approach • Similar to what seen in object recognition • Similar to what seen in object recognition Key Points Feature Histogram Classifier Detection Extraction Prototypes Prototypes

  5. Keypoints detection Keypoints detection • Extension from 2D • Extension from 2D • Localization proceeds along the spatial dimensions x and the spatial dimensions x and y , as well as the temporal dimension t . dimension t . • 3D corners too rare

  6. Keypoints detection (cont’) Keypoints detection (cont’) • Response function: • Response function: – Spatial kernel is 2D Gaussian – Temporal kernel

  7. Keypoints detection (cont’) Keypoints detection (cont’) • Keypoints • Keypoints – By pooling maxima of the filter responses – Emphasize temporal information other than spatial information – Strong response to periodic motions – Strong response to periodic motions – Does not respond to pure translation motion – Totally unsupervised – Totally unsupervised

  8. Cuboid descriptor Cuboid descriptor Spatio-temporal Spatio-temporal Key points Key points Cuboids Cuboids descriptor Descriptor: Descriptor: Normalized pixel values; Gradients; Windowed optical flow, etc. Windowed optical flow, etc. Transform Transform Transform: Vectorize directly; Vectorize directly; Histogram (global or local).

  9. Cuboid descriptor (cont’) Cuboid descriptor (cont’) Gradient is best! Gradient is best! Vectorize directly is best! ??

  10. Behavior descriptor Behavior descriptor Spatio-temporal Spatio-temporal Behavior Behavior Key points Key points Cuboids Cuboids descriptor descriptor Transform Transform Prototypes Prototypes

  11. Experiment results Experiment results • Datasets: facial expression, mouse, human actions • Datasets: facial expression, mouse, human actions

  12. Experiments results (cont’) Experiments results (cont’) Mouse Database. Human Facial Expression Database.

  13. Learning realistic human actions from movies movies • Automatic annotation of human actions in video. • Automatic annotation of human actions in video. • Video classification by space-time features. • Video classification by space-time features.

  14. Bag-of-feature approach Bag-of-feature approach • Extension of recent advances in bag-of-feature • Extension of recent advances in bag-of-feature approaches – Spatial pyramid � more general spatial grids Spatial pyramid more general spatial grids – Fixed weights for each pyramid level � optimized – Spatial grid � space-time grids

  15. Space-time features Space-time features • Interest point detection: Harris operator • Interest point detection: Harris operator Space-time Key points Cuboids feature Histogram of oriented gradient ( HoG ) gradient ( HoG ) Local histogram Histogram of optical flow Histogram of optical flow ( HoF )

  16. Spatio-temporal bag-of-features Spatio-temporal bag-of-features • Hierarchical structure • Hierarchical structure Space-time Spatio-temporal Key points Cuboids feature feature Local Local Local Local histogram histogram

  17. Spatio-temporal grids Spatio-temporal grids

  18. Experiment results Experiment results • Evaluation of spatio-temporal grids • Evaluation of spatio-temporal grids

  19. Experiments results (cont’) Experiments results (cont’)

  20. Experiment results (cont’) Experiment results (cont’) • KTH action database • KTH action database

  21. Experiment results (cont’) Experiment results (cont’)

  22. Comments Comments • The two methods are extensions of key-points based • The two methods are extensions of key-points based image classification. Will dense descriptors be better? • Key-points based methods work surprisingly well for image and sequence classification, why? image and sequence classification, why? • Issues needed to address: – Discriminative key-points learning or design for the given task – Discriminative key-points selection for the given task – More efficient way to use location information

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend