actom sequence models for efficient action detection
play

Actom Sequence Models for Efficient Action Detection LEAR INRIA - PowerPoint PPT Presentation

Actom Sequence Models for Efficient Action Detection LEAR INRIA Grenoble Adrien Gaidon Zaid Harchaoui Cordelia Schmid Presentation by Benoit Mass Introduction Video : Big Data Automatisation ? Semantic analysis Retrieval


  1. Actom Sequence Models for Efficient Action Detection LEAR – INRIA Grenoble Adrien Gaidon Zaid Harchaoui Cordelia Schmid Presentation by Benoit Massé

  2. Introduction ● Video : Big Data ● Automatisation ? – Semantic analysis – Retrieval Problem : Find if and when a specific action happen

  3. State of the art Training ● – Define the action – Choose the features – Train Retrieval ● – Classification – Detection

  4. State of the art Training ● – Define the action => Spatio-temporal extent – Choose the features => HoG, HoF, SP interest Point – Train => Bag-of-Feature Retrieval ● – Classification => SVM, Bayesian Network – Detection => ?

  5. Actoms ● Actom : short atomic action

  6. Actoms An actom has – A location t – A radius r Actom descriptors : Set of visual words – Bag of Features applied on HoG, HoF, Harris Interest points... – Ponderated sum from t - r to t + r

  7. Interest of Actoms ● An action is composed of several actoms – New goal : find an ordered sequence of actoms – No temporal dependance inside an action ● Gap between actoms ● Overlap ● An action can be composed of very different parts => Classic methods compute the average

  8. Actom Sequence Model (ASM) One Action = One Actom Sequence – The radius r i of actom i depends on its distance to the closer other actoms : min(t i - t i-1 , t i+1 - t i ) – ASM : concatenation of actoms words (x 11 , …, x 1k , x 21 , …, x 2k , x 31 , …, x 3k )

  9. Classification ● Given a new ASM (x 11 , ... x nk ), does it corresponds to the trained action ? (for instance : « drinking ») – Classic machine learning problem – Chosen solution : SVM – Including negative examples improves the classifier

  10. Detection ● Given a video, find all the occurences of the trained action. (for instance : « drinking ») For every 5 frames Set the current frame as the middle actom Generate candidates for other actoms Apply classification on the result End Delete non-maximal overlapping actions

  11. Detection Tricky step : Generating the other actoms We must estimate the distance between actoms – Training : Build the multivariate distribution {t i+1 – t i } Remove the outliers – Estimation : Try all the possible combinations (starting from the middle limit the error propagation)

  12. Experiments 4 kind of actions Criteria Drinking OV20 (20 % Overlap) – – Smoking OVAA (All Actoms Overlap) – – Open a door – Sit down – State of the art Comparison Bag of Features – Bag of Features with a grid – Other published methods –

  13. Results

  14. Conclusion ASM gives better result than state-of-the-art, using the same data sets. => Actoms are particularly adapted for representing the temporal structure of actions into videos

  15. QUESTIONS ?

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend