End-to-end Learning of Action Detection from Frame Glimpses in - PowerPoint PPT Presentation

Nov 27, 2023 •128 likes •286 views

End-to-end Learning of Action Detection from Frame Glimpses in Videos CVPR 2016 Serena Yeung, Olga Russakovsky, Greg Mori , Li Fei-Fei Presenter: Wei-Jen Ko 1 Action detection Predict which and when action occurs in the video. 2 Related

End-to-end Learning of Action Detection from Frame Glimpses in Videos CVPR 2016 Serena Yeung, Olga Russakovsky, Greg Mori , Li Fei-Fei Presenter: Wei-Jen Ko 1
Action detection • Predict which and when action occurs in the video. 2
Related Work Motion features: Dense Trajectories Apearance features: CNN+SIFT+ COLOR Audio features: MFCC+ASR Classified by SVM over exhaustive segments with varying scale and temporal position. D. Oneata, J. Verbeek, and C. Schmid. The lear submission at thumos 2014. L. Wang, Y. Qiao, and X. Tang. Action recognition and detection by combining motion and appearance features J. Yuan, Y. Pei, B. Ni, P. Moulin, and A. Kassim. Adsc submission at thumos challenge 2015 3
Related Work Dynamic feature prioritization Predictive corrective networks Y-C. Su and K. Grauman. Leaving Some Stones Unturned: Dynamic A. Dave, O. Russakovsky, D. Ramanan. Predictive-Corrective Networks Feature Prioritization for Activity Detection in Streaming Video , ECCV for Action Detection, CVPR 2017. 2016. 4
Proposed method • Recurrent neural network-based end-to-end model • Decides which frame to observe next and when to emit a prediction. 5
Observation Network Video frame V ln Image fearure VGG On FC Frame location ln
Recurrent Network s n : start location of the action e n : end location of the action l n+ 1 : location of the video frame to observe next c n : confidence level of the prediction P n : prediction indicator S n, e n, l n+ 1 normalized to [0,1] 7
8
Loss function L cls (d n ): Cross-entropy loss on confidence Cn L loc (d n , g m ) : L2- regression loss minimizing the distance 9
p n and l n+1 trained by REINFORCE Reward function negative reward if did not emit predictions for videos containg instances 10
THUMOS’14 Results 11
If observed frames are not be determined dynamically, it does not provide sufficient resolution to localize action boundaries. 12
ActivityNet Results 13
Strengths: • First End-to-end training approach • Select important frames to observe, no exhaustive searching • Better results 14
15

Recommend

Green Action Centre, 2019 Green Action Centre, 2019 Green Action Centre, 2019 Green Action

Green Action Centre, 2019 Green Action Centre, 2019 Green Action Centre, 2019 Green Action Centre, 2019 Green Action Centre, 2019 Green Action Centre, 2019 Green Action Centre, 2019 Green Action Centre, 2019 Green Action Centre, 2019 Green

1k views • 56 slides

Municipal Water District of Orange County May 1, 2019 Action 1 Action 1 Action 2 Action 2

5/1/2019 Municipal Water District of Orange County May 1, 2019 Action 1 Action 1 Action 2 Action 2 Action 3 Action 3 Improve database Improve database Developing Developing Stormwater pilot Stormwater pilot of potential of potential

184 views • 5 slides

3i Capital Markets Seminar Action 8 June 2016 Agenda 3is investment in Action 1 2 Action

3i Capital Markets Seminar Action 8 June 2016 Agenda 3is investment in Action 1 2 Action presentation Sander van der Laan, CEO Action Frederik Lotz, CFO Action 2 3i acquired Action in September 2011 Acquired from founders

445 views • 42 slides

Writing reliable end to end tests End to end browser tests They take a long time to run. Around

Writing reliable end to end tests End to end browser tests They take a long time to run. Around 4-12 hours Long feedback cycles Tough to read or modify Flaky Not part of the development life cycle Unit tests are End to end important but they End

530 views • 38 slides

Cirrus: A Serverless Framework for End-to-end ML Workflows Joao Carreira , Pedro Fonseca, Alexey

Cirrus: A Serverless Framework for End-to-end ML Workflows Joao Carreira , Pedro Fonseca, Alexey Tumanov, Andrew Zhang, Randy Katz Machine Learning End-to-end ML workflows Modern end-to-end ML workflows are complex End-to-end ML

726 views • 35 slides

Kanban in Action: Kanban in Action: Kanban in Action: Kanban in Action: Thoughtfully

Kanban in Action: Kanban in Action: Kanban in Action: Kanban in Action: Thoughtfully Thoughtfully Thoughtfully Thoughtfully Observing Flow Observing Flow Observing Flow Observing Flow Trent Hone Trent Hone Trent Hone Trent Hone Mark

281 views • 24 slides

End-to-end IoT Platform Connect Collect Manage Learn Analyze Act End-to-end Solution

End-to-end IoT Platform Connect Collect Manage Learn Analyze Act End-to-end Solution ThingsOn is an IoT Platform that provides end-to-end tools for any scale IoT project. It combines the main requirements of digital transformation projects

1.11k views • 12 slides

Appendix Deposits and Loans Appendix Deposits (ending balance) Loans (ending balance) (Unit:

Appendix Deposits and Loans Appendix Deposits (ending balance) Loans (ending balance) (Unit: million yen) (Unit: million yen) March-end March-end March-end March-end March-end March-end March-end March-end March-end March-end 2016

135 views • 11 slides

Is End-to-End Integrity Verification Really End- to-End? Ahmed Alhussen, Batyr Charyyev, and Engin

Is End-to-End Integrity Verification Really End- to-End? Ahmed Alhussen, Batyr Charyyev, and Engin Arslan Whats End-to-End Integrity Verification Data corruption may occur during transfers Faulty equipment, transient errors etc.

523 views • 13 slides

End-to-End Argument Jeff Chase Duke University End-To-End Argument Application TCP Where to

End-to-End Argument Jeff Chase Duke University End-To-End Argument Application TCP Where to Place Functionality? IP Router End-to-End Argument Functionality should be implemented at a lower layer if and only if it can be

694 views • 12 slides

What is Action Learning? What is an ALC? 1 What is Action Learning? Learning by doing

What is Action Learning? What is an ALC? 1 What is Action Learning? Learning by doing Process for bringing together a group of people with varied levels of skills and experience (but with influence, passion, etc. to impact an

191 views • 5 slides

Deep Reinforcement Learning and Complex Environments Raia Hadsell End-to-end Deep Learning

Deep Reinforcement Learning and Complex Environments Raia Hadsell End-to-end Deep Learning for robots? slide from V. Vanhoucke End-to-end Deep Learning for robots? 2010 : Speech Recognition Audio Acoustic Model Phonetic Model

1.05k views • 65 slides

Deep RL Robert Platt Northeastern University Q-learning Q-function Q action argmax state

Deep RL Robert Platt Northeastern University Q-learning Q-function Q action argmax state action World e t a t s Update rule Q-learning Q-function Q action argmax state action World e t a t s Update rule Deep Q-learning

971 views • 73 slides

Action Transportation Action Team Your Move Pilot ODOT and ODH joint initiative Action

Action Transportation Action Team Your Move Pilot ODOT and ODH joint initiative Action Transportation Action Team Targeted pilot - 5 counties September mid October Franklin Columbus Richland Mansfield Your Move Pilot Summit

482 views • 36 slides

A CALL TO ACTION ON SUICIDE February 7, 2018 A CALL TO ACTION ON SUICIDE WELCOME AND

A CALL TO ACTION ON SUICIDE February 7, 2018 A CALL TO ACTION ON SUICIDE WELCOME AND INTRODUCTION Squamish First Nation A CALL TO ACTION ON SUICIDE HISTORY OF OUR WORK A CALL TO ACTION ON SUICIDE HISTORY A CALL TO ACTION ON SUICIDE

236 views • 21 slides

Psychology 101 Coupling between action and perception Action for perception Action

Master Informatique - Universit Paris-Sud Action-perception coupling Classical psychology (cognitivist approach) Perception <=> Cognition <=> Action Psychology 101 Coupling between action and perception Action for

91 views • 7 slides

Cubical Computational Type Carlo Angiuli Evan Cavallo Theory (*) Favonia Robert Harper

2018.07.07 LFMTP Cubical Computational Type Carlo Angiuli Evan Cavallo Theory (*) Favonia Robert Harper & RedPRL Jonathan Sterling Todd Wilson >> redprl.org >> 1 Cubical features of homotopy type theory univalence,

1.18k views • 97 slides

MF-DMPC: Matrix Factorization with Dual Multiclass Preference Context for Rating Prediction Weike

MF-DMPC: Matrix Factorization with Dual Multiclass Preference Context for Rating Prediction Weike Pan Zhong Ming Jing Lin National Engineering Laboratory for Big Data System Computing Technology, College of Computer Science and Software

601 views • 26 slides

Manifold Regularization Lorenzo Rosasco MIT, 9.520 L. Rosasco Manifold Regularization About

Manifold Regularization Lorenzo Rosasco MIT, 9.520 L. Rosasco Manifold Regularization About this class Goal To analyze the limits of learning from examples in high dimensional spaces. To introduce the semi-supervised setting and the use of

437 views • 31 slides

Amazing zing Bridges ges of China na By : E. Cheong 1 2 3

Amazing zing Bridges ges of China na By : E. Cheong 1 2 3 4/43 5 6 7 8 9 10 11 12 13 14 Hangzhou Bay Bridge Chinas Hangzhou Bay Trans -oceanic Bridge is one of the masterpieces of modern

622 views • 36 slides

Nonlinear lattice effects in mica. The properties and applications of 'quodons' - identified as

Nonlinear lattice effects in mica. The properties and applications of 'quodons' - identified as mobile longitudinal optical mode discrete breathers. F M Russell Heriot-Watt University, Edinburgh, EH14 4AS, UK In collaboration with J C Eilbeck

391 views • 15 slides

Meeting inaugurale del Centro di Studi Avanzati GGI 15 febbraio 2018 Il futuro della fisica

Meeting inaugurale del Centro di Studi Avanzati GGI 15 febbraio 2018 Il futuro della fisica teorica delle particelle G.F. Giudice Congratulations! A moment of satisfaction for the Italian theory community Galileo Galilei was born in Pisa on

537 views • 35 slides

Authorship Attribution of Micro-Messages Roy Schwartz + , Oren Tsur + , Ari Rappoport + and Moshe

Authorship Attribution of Micro-Messages Roy Schwartz + , Oren Tsur + , Ari Rappoport + and Moshe Koppel * + The Hebrew University, * Bar Ilan University In proceedings of EMNLP 2013 Overview Authorship attribution of tweets Users tend to

582 views • 47 slides

Aslan Askarov aslan@cs.au.dk acknowledgments: E.Ernst, M.I.Schwartzbach, J. Midtgaard, G.

____ _ _ _ _ / ___|___ _ __ ___ _ __ (_) | __ _| |_(_) ___ _ __ | | / _ \| '_ ` _ \| '_ \| | |/ _` | __| |/ _ \| '_ \ | |__| (_) | | | | | | |_) | | | (_| | |_| | (_) | | | | \____\___/|_| |_| |_|

579 views • 28 slides