A spatiotemporal model with visual attention for video - PowerPoint PPT Presentation

Sep 10, 2022 •41 likes •161 views

A spatiotemporal model with visual attention for video classification Mo Shan and Nikolay Atanasov Department of Electrical and Computer Engineering July 16, 2017 Outline Motivation Proposed model Experiment Conclusion Motivation Video

A spatiotemporal model with visual attention for video classification Mo Shan and Nikolay Atanasov Department of Electrical and Computer Engineering July 16, 2017
Outline Motivation Proposed model Experiment Conclusion
Motivation Video classification ◮ Semantic understanding of sequential visual input is important for robots in localization and object detection. ◮ Eg, search for a cat in a living room, instead of in a gym.
Motivation Rotation and scale ◮ Existing benchmark contains videos of daily scenes. ◮ Objects in real world could be rotated and scaled.
Motivation Visual attention ◮ Attention mechanism reduces complexity and avoids cluttering. This makes it easier to deal with rotated and scaled images.
Proposed model Architecture ◮ The proposed model concatenates CNN to RNN. ◮ The CNN stage is augmented with attention modules.
Proposed model Attention modules ◮ STN (Jaderberg, 2015) learns a global affine transformation. ◮ DCN (Dai, 2017) learns offsets locally and densely.
Experiment Dataset ◮ Moving MNIST is augmented with rotation and scaling.
Experiment Quantitative analysis ◮ Results are shown in Table 1. ◮ DCN-LSTM consistently performs the best in all cases. Table: Comparison of cross entropy loss and test accuracy for the proposed model and baseline. Moving MNIST LeNet-LSTM STN-LSTM DCN-LSTM Normal 1 . 44 , 97 . 96% 1 . 98 , 87 . 26% 1 . 27 , 99 . 62% Rotation 1 . 42 , 98 . 43% 1 . 97 , 90 . 47% 1 . 29 , 99 . 70% Scaling 1 . 52 , 96 . 28% 1 . 99 , 86 . 90% 1 . 28 , 99 . 41% Rotation+Scaling 1 . 51 , 96 . 82% 1 . 99 , 89 . 10% 1 . 25 , 99 . 46%
Experiment Qualitative analysis ◮ STN could not attend to each digit individually.
Experiment Digit gesture classification ◮ Elastic deformation simulates oscillations of hand muscles. ◮ Results are shown in Table 2. ◮ DCN could learn the deformation field explicitly. ◮ DCN-LSTM has the potential to handle articulated objects. Table: Cross entropy loss and test accuracy for deformed digits. LeNet-LSTM STN-LSTM DCN-LSTM 1 . 48 , 97 . 19% 1 . 48 , 97 . 19% 1 . 28 , 99 . 30%
Conclusion Key insights ◮ DCN-LSTM achieves high accuracy compared to baseline. ◮ Attention isuseful to deal with rotation and scale changes. ◮ STN-LSTM performs poorly due to global transformation. ◮ Future work: how to train the entire model end to end.

Recommend

Attention in NLP CS 6956: Deep Learning for NLP Overview What is attention Attention in

Attention in NLP CS 6956: Deep Learning for NLP Overview What is attention Attention in encoder-decoder networks Various kinds of attention 2 Overview What is attention? Attention in encoder-decoder networks 3 Visual

971 views • 73 slides

A Model of Visual Imagery A Model of Visual Imagery John Abbondanza, OD, FCOVD John Abbondanza,

A Model of Visual Imagery A Model of Visual Imagery John Abbondanza, OD, FCOVD John Abbondanza, OD, FCOVD , , , , A Model of Visual Imagery A Model of Visual Imagery A Model of Visual Imagery A Model of Visual Imagery What shape are

533 views • 39 slides

Attention, Transformer and BERT Prof. Kuan-Ting Lai 2020/6/16 Attention is All You Need! A.

Attention, Transformer and BERT Prof. Kuan-Ting Lai 2020/6/16 Attention is All You Need! A. Waswani et al., NIPS , 2017 Google Brain & University of Toronto 2 Attention Visual attention and textual attention

628 views • 21 slides

Spatiotemporal Regulation of ERK by Spatiotemporal Regulation of ERK by Dual- -specificity

Spatiotemporal Regulation of ERK by Spatiotemporal Regulation of ERK by Dual- -specificity Phosphatases specificity Phosphatases Dual University of Bristol IN Cell 1000 University of Bristol IN Cell 1000 WT Equipment Grant WT Equipment

551 views • 20 slides

AttentionNAS: Spatiotemporal Attention Cell Search for Video Classification Xiaofang Wang, Xuehan

AttentionNAS: Spatiotemporal Attention Cell Search for Video Classification Xiaofang Wang, Xuehan Xiong, Maxim Neumann, AJ Piergiovanni, Michael S. Ryoo, Anelia Angelova, Kris M. Kitani, Wei Hua Convolutional networks are dominant C3D [ICCV

746 views • 26 slides

Attention Eye tracking seminar 2/19/15 Presented by Tatiana Emmanouil Outline What is

Attention Eye tracking seminar 2/19/15 Presented by Tatiana Emmanouil Outline What is attention? How is attention allocated? How are eye movements related to attention? Further questions Attention Attention

331 views • 18 slides

The Attention Economy What is the attention economy? A business model where you (as the

The Attention Economy What is the attention economy? A business model where you (as the company) want to hold the users attention as much as possible. Attention is treat like a scarce resource What are ethical issues that have emerged

170 views • 3 slides

Biovision team 2 Retina Visual cortex 3 Retina Visual cortex 3 Retina Visual cortex 3

Biovision team 2 Retina Visual cortex 3 Retina Visual cortex 3 Retina Visual cortex 3 Retina Visual cortex 3 285 millions visually impaired people Retina Visual cortex 3 285 millions visually impaired people Retina Visual cortex

744 views • 63 slides

Visual Attention FEF V4 spatial attention: simultaneous neural recordings in V4

10/29/14 Visual Attention FEF V4 spatial attention: simultaneous neural recordings in V4 & Frontal Eye Fields (monkeys) Gregoriou, Gotts, Zhou, Desimone (2010) object attention: MEG & fMRI in FFA (faces),

238 views • 4 slides

A spatiotemporal stochastic model for tropical precipitation and water vapor dynamics. Scott

A spatiotemporal stochastic model for tropical precipitation and water vapor dynamics. Scott Hottovy and Sam Stechmann (UW) shottovy@math.wisc.edu University of Wisconsin ONR DURIP grant N00014-14-1-0251 S. Hottovy, UW Spatiotemporal

382 views • 16 slides

ViAMoD Visual Spatiotemporal Pattern Analysis of Movement and Event Data Prof. Dr. Stefan Wrobel

ViAMoD Visual Spatiotemporal Pattern Analysis of Movement and Event Data Prof. Dr. Stefan Wrobel Dr. Natalia Andrienko Prof. Dr. Daniel Keim Dr. Gennady Andrienko Dr. Peter Bak NN Slava Kiselevich http://visual-analytics.info

455 views • 26 slides

Attention! 1. Definitions and behavioral effects 2. Effects on neural firing rates: Spatial

4/14/17 Attention! 1. Definitions and behavioral effects 2. Effects on neural firing rates: Spatial attention Attention to features 3. Directing attention: Posterior parietal cortex Frontal eye fields Top-down and bottom-up attention 1

336 views • 17 slides

CHRONIC CHRONIC VISUAL LOSS VISUAL LOSS Wasu Supakornthanasarn, MD. Visual loss Sensory

CHRONIC CHRONIC VISUAL LOSS VISUAL LOSS Wasu Supakornthanasarn, MD. Visual loss Sensory pathway y p y Refractive errors Refractive errors Cloudy of ocular media Cloudy of ocular media Functional visual loss Functional visual loss

1.99k views • 187 slides

Overview Overview Visual displays Visual displays Visual and tactile displays Visual and

Overview Overview Visual displays Visual displays Visual and tactile displays Visual and tactile displays Tactile displays Tactile displays Integrated displays Integrated displays Alan Liu Alan Liu Applications

80 views • 5 slides

An Overview of Models and Methods for Spatiotemporal Data Analysis Jim Zidek- U British

An Overview of Models and Methods for Spatiotemporal Data Analysis Jim Zidek- U British Columbia, Vancouver, Canada May 30, 2012 Jim Zidek- (UBC) An Overview of Models and Methods for Spatiotemporal Data Analysis May 30, 2012 1

1.3k views • 111 slides

Video Games Written and Researched by: Patrick Kania First Video Game The first Video Game made

Video Games Written and Researched by: Patrick Kania First Video Game The first Video Game made was in the early 1940-1950s. Also the most popular video game back then was Cathode Ray Tube. Video Game Research. Video Games are sometimes

419 views • 11 slides

GPU Acceleration on the 3D Elastic RTM Method Lin Gan, Tsinghua University May 8 st , 2017, GTC

High Performance Geo-Computing Group GPU Acceleration on the 3D Elastic RTM Method Lin Gan, Tsinghua University May 8 st , 2017, GTC 2017 About Tsinghua HPGC High Performance Geo-Computing Group Interdisciplinary research group High

856 views • 43 slides

Linear Elastic Model for Generating Wavy Structure Wavy in Lipid Membrane by Peripheral Proteins

Linear Elastic Model for Generating Linear Elastic Model for Generating Wavy Structure Wavy in Lipid Membrane by Peripheral Proteins Structure in Lipid Membrane by by Paritosh Mahata Paritosh Mahata PRAVARTANA - 2016 Indian Institute

1.15k views • 18 slides

PTC India Financial Services Limited May 2012 Our Vision and Mission Be the most preferred

PTC India Financial Services Limited May 2012 Our Vision and Mission Be the most preferred financial services Vision partner in the entire energy value chain. To partner and forge strong relationships with credible stakeholders to provide

141 views • 13 slides

../ DEEPAK FERTILISERS AND PETROCHEMICALS CORPORATION LIMITED 4 September 2019 BSE Limited

Regd. Ofgice: Sai Hira, Survey No. 93, Thanking you, Conference on 5 th September, 2019 in Mumbai. A copy of the presentation is enclosed in this regard. NOTE: Dates are subject to changes. Changes may happen due to exigencies on the part of

417 views • 37 slides

MATRIX DAMAGE IN LAMINATED COMPOSITES UNDER BIAXIAL STRESS M. Salavatian, L.V. Smith* 1 School of

18 TH INTERNATIONAL CONFERENCE ON COMPOSITE MATERIALS MATRIX DAMAGE IN LAMINATED COMPOSITES UNDER BIAXIAL STRESS M. Salavatian, L.V. Smith* 1 School of Mechanical and Materials Engineering, Washington State University, Pullman, USA *Corresponding

251 views • 4 slides

Dynamics of harmonically excited irregular cellular metamaterials S. Adhikari 1 , T. Mukhopadhyay

Dynamics of harmonically excited irregular cellular metamaterials S. Adhikari 1 , T. Mukhopadhyay 2 , A. Al` u 3 1 Zienkiewicz Centre for Computational Engineering, College of Engineering, Swansea University, Bay Campus, Swansea, Wales, UK, Email:

255 views • 22 slides

How Businesses Survive How Businesses Survive after a Disaster after a Disaster EPI CC

How Businesses Survive How Businesses Survive after a Disaster after a Disaster EPI CC April 28, 2009 April 28, 2009 EPI CC Fredric Kropp, PhD Fredric Kropp, PhD Monterey Institute of International Monterey Institute of

545 views • 50 slides

Thailand CGE Model Thailand CGE Model AI M/ Material AI M/ Material Sunil Malla Malla Sunil

Thailand CGE Model Thailand CGE Model AI M/ Material AI M/ Material Sunil Malla Malla Sunil Wongkot Wongsapai Wongsapai Wongkot Asian I nstitute of Technology Dec 2, 2004 APEI S Training Workshop NI ES/ Japan 1 Outline Outline

335 views • 31 slides