Temporal Gaussian Mixture Layer for Videos AJ Piergiovanni and - PowerPoint PPT Presentation

Nov 12, 2022 •216 likes •365 views

Temporal Gaussian Mixture Layer for Videos AJ Piergiovanni and Michel S. Ryoo Indiana University Motivation Video Representation Learning Learning good video representations has many applications Robot perception, activity

Temporal Gaussian Mixture Layer for Videos AJ Piergiovanni and Michel S. Ryoo Indiana University
Motivation – Video Representation Learning • Learning good video representations has many applications • Robot perception, activity recognition, smart cities, sports analysis • Videos are high-dimensional spatio-temporal data, abstracting representations is critical for many tasks • Standard methods use CNNs with temporal convolution (e.g., 1D or 3D convolution)
Temporal Information is Needed • Standard CNNs only capture short-term information • 2D CNNs use a single frame • 3D CNNs capture only 2-3 seconds • Short clips can be ambiguous
Temporal Information is Needed • Standard CNNs only capture short-term information • Short clips can be ambiguous • Extending 3D/1D conv to longer durations leads to many parameters and poor performance
Temporal Gaussian Mixture Layer Temporal Gaussian Mixture Layer • Can learn longer-term temporal structures without increasing • Can learn longer-term temporal structures without increasing parameters parameters • Learns a set of Gaussians and mixing weights which generates the • Learns a set of Gaussians and mixing weights which generates the temporal convolutional kernel temporal convolutional kernel
Using TGMs • Can apply TGM as standard 1D convolution or as grouped 2D convolution • Loses some information when combining the base CNN channels Standard 1D Conv 1D Conv with TGM kernels TGM + TC-Grouping
Temporal Channel Grouped Convolution • TC-Grouping adds a new temporal channel axis • Allows for learning of different temporal structures with base CNN feature channels
Activity Detection with TGMs • Applies base CNN, followed by TGMs to learn longer-term temporal structure, followed by a classification layer.
Fewer Parameters LSTMs and 1D Conv with fewer parameters leads to nearly random performance.
Fewer Parameters LSTMs and 1D Conv with fewer parameters leads to nearly random performance. Stacking 1D conv reduces performance, but stacking TGMs is beneficial
Results on MultiTHUMOS Super-Events Ground Truth Baseline TGM Full
Results on Charades Super-Events Ground Truth Baseline TGM Full
Increasing temporal resolution • Increasing 1-D conv size reduces performance • Increasing TGMs adds no parameters, improves performance and focuses on important intervals
Thank you Please visit our poster #149 for more details Code and models: https://github.com/piergiaj/tgm-icml19

Recommend

Bernoulli Mixture Models Victor Medina Researcher at SBIF DataCamp Mixture Models in R The

DataCamp Mixture Models in R MIXTURE MODELS IN R Bernoulli Mixture Models Victor Medina Researcher at SBIF DataCamp Mixture Models in R The handwritten digits dataset DataCamp Mixture Models in R Continuous versus discrete variables

440 views • 41 slides

Structure of mixture models Victor Medina Researcher at SBIF DataCamp Mixture Models in R

DataCamp Mixture Models in R MIXTURE MODELS IN R Structure of mixture models Victor Medina Researcher at SBIF DataCamp Mixture Models in R Description of mixture models 1. Which is the suitable probability distribution? Get familiar with

1.07k views • 36 slides

Gaussian Filter The Gaussian filter 1 2 1 A Gaussian kernel gives less 1 2 4 2 weight to

Gaussian Filter The Gaussian filter 1 2 1 A Gaussian kernel gives less 1 2 4 2 weight to pixels further from 16 the center of the window 1 2 1 This kernel is an approximation of a Gaussian function Gaussian filtering versus mean

405 views • 11 slides

Deep Gaussian Mixture Models Cinzia Viroli (University of Bologna, Italy) joint with Geoff

Deep Gaussian Mixture Models Cinzia Viroli (University of Bologna, Italy) joint with Geoff McLachlan (University of Queensland, Australia) JOCLAD 2018, Lisbona, April 5th, 2018 Outline Deep Learning Mixture Models Deep Gaussian Mixture

915 views • 66 slides

Network Layer October 2, 2019 guha.jayachandran@sjsu.edu Layer 2: Protocol atop Layer 1

Network Layer October 2, 2019 guha.jayachandran@sjsu.edu Layer 2: Protocol atop Layer 1 (Lightning, Plasma, etc.) Layer 1: Coin protocol (Bitcoin, Ethereum, etc.) Layer 2: Protocol atop Layer 1 (Lightning, Plasma, etc.) Layer 1: Consensus

168 views • 12 slides

Lecture 6: Wireless Link Layer, Lecture 6: Wireless Link Layer, MAC protocols, CSMA MAC

Lecture 6: Wireless Link Layer, Lecture 6: Wireless Link Layer, MAC protocols, CSMA MAC protocols, CSMA Mythili Vutukuru CS 653 Spring 2014 Jan 23, Thursday Wireless Link Layer Link layer (layer 2) is above physical layer Link layer

720 views • 12 slides

1 Transport Layer Transport Layer Outline Message, Segment, Datagram Transport-layer

Transport Layer Transport Layer Chapter 3: Transport Layer Mobile network Our goals: Global ISP understand principles learn about transport Chapter 3 behind transport layer protocols in the Transport Layer Internet: layer

678 views • 5 slides

ELEC / COMP 177 Fall 2016 Some slides from Kurose and Ross, Computer Networking , 5 th Edition

ELEC / COMP 177 Fall 2016 Some slides from Kurose and Ross, Computer Networking , 5 th Edition Application Layer Transport Layer Network Layer Link Layer Physical Layer 2 Application Layer Transport Layer Network Layer Link Layer

903 views • 51 slides

5 Network Layer Network Layer Network Layer Network Layer Example: Choosing among multiple ASes

Network Layer Network Layer Network Layer Network Layer Network Layer Comparison of LS and DV algorithms Message complexity Robustness: what happens 1 Introduction 5 Routing algorithms if router malfunctions? LS: with n nodes, E

326 views • 4 slides

Lecture 3 Capacity of Multiuser Gaussian Channels The Gaussian uplink: 6.1 The fading

Lecture 3 Capacity of Multiuser Gaussian Channels The Gaussian uplink: 6.1 The fading Gaussian uplink: 6.3 (parts) The Gaussian downlink: 6.2 The fading Gaussian downlink: 6.4 (parts) Mikael Skoglund, Theoretical Foundations of

347 views • 10 slides

Gaussian Mixture Models & EM CE-717: Machine Learning Sharif University of Technology M.

Gaussian Mixture Models & EM CE-717: Machine Learning Sharif University of Technology M. Soleymani Fall 2016 Mixture Models: definition Mixture models: Linear supper-position of mixtures or components | =

1.04k views • 39 slides

Using Gaussian Mixture Models to Detect Figurative Language in Context Linlin Li and Caroline

Introduction Using Gaussian Mixture Model to Detect Figurative Language Evaluating the GMM Approach Conclusion Using Gaussian Mixture Models to Detect Figurative Language in Context Linlin Li and Caroline Sporleder Cluster of Excellence, MMCI

411 views • 21 slides

ELEN E6884 - Topics in Signal Processing Recap Topic: Speech Recognition Gaussian Mixture

Outline of Todays Lecture ELEN E6884 - Topics in Signal Processing Recap Topic: Speech Recognition Gaussian Mixture Models - A Gaussian Mixture Models - B Lecture 3

339 views • 13 slides

Expectation Maximization Greg Mori - CMPT 419/726 Bishop PRML Ch. 9 K-Means Gaussian Mixture

K-Means Gaussian Mixture Models Expectation-Maximization Expectation Maximization Greg Mori - CMPT 419/726 Bishop PRML Ch. 9 K-Means Gaussian Mixture Models Expectation-Maximization Learning Parameters to Probability Distributions We

1.37k views • 91 slides

Spatio-Temporal Statistics with R Chapter Two: Exploring Spatio-Temporal Data Spatio-Temporal

Spatio-Temporal Statistics with R Chapter Two: Exploring Spatio-Temporal Data Spatio-Temporal Data Spatio-Temporal Data Geostatistical : continuous spatial index Areal (lattice): defined on finite/countable subset in space Point

897 views • 25 slides

MLE 04-09-2019 For Gaussian and Mixture Gaussian Models Instructor - Sriram Ganapathy

E9 205 Machine Learning for Signal Processing MLE 04-09-2019 For Gaussian and Mixture Gaussian Models Instructor - Sriram Ganapathy (sriramg@iisc.ac.in) Teaching Assistant - Prachi Singh (prachisingh@iisc.ac.in). Finding the parameters of the

390 views • 14 slides

State space methods for temporal GPs Arno Solin Assistant Professor in Machine Learning

State space methods for temporal GPs Arno Solin Assistant Professor in Machine Learning Department of Computer Science Aalto University G AUSSIAN P ROCESS S UMMER S CHOOL September 11, 2019 @arnosolin arno.solin.fi Outline Motivation:

583 views • 47 slides

Summary of Lecture III Introducing Temporal Logics. Intuitions beyond Linear Temporal Logic.

F ORMAL M ETHODS L ECTURE III: L INEAR T EMPORAL L OGIC Alessandro Artale Faculty of Computer Science Free University of Bolzano artale@inf.unibz.it http://www.inf.unibz.it/ artale/ Some material (text, figures) displayed in these slides

786 views • 44 slides

Robustness of Temporal Logic Specifications for Signals Georgios Fainekos dissertation series -

Robustness of Temporal Logic Specifications for Signals Georgios Fainekos dissertation series - Part I Akshay Rajhans ECE Department, CMU SVC Seminar: Aug 21, 2008 Akshay Rajhans (ECE, CMU) Robustness of TL for signals SVC Seminar: Aug 21,

456 views • 42 slides

Spaten : a Spatio-Temporal and Textual Big Data Generator Thaleia Dimitra Doudali* Ioannis

Spaten : a Spatio-Temporal and Textual Big Data Generator Thaleia Dimitra Doudali* Ioannis Konstantinou Nectarios Koziris * Motivation 1. Geo-Social Networking Graph 2. Spatio-temporal and textual data 2 Motivation 3. Daily routes with

398 views • 13 slides

Activity Identification from GPS Trajectories Using Spatial Temporal POIs Attractiveness Lian

Activity Identification from GPS Trajectories Using Spatial Temporal POIs Attractiveness Lian Huang, Qingquan Li, Yang Yue State Key Laboratory of Information Engineering in Survey, Mapping and Remote Sensing, Wuhan University

374 views • 20 slides

Temporal Planning with Temporal Metric Trajectory Constraints Andrea Micheli Enrico Scala

Temporal Planning with Temporal Metric Trajectory Constraints Andrea Micheli Enrico Scala Embedded Systems Unit, Fondazione Bruno Kessler, Italy January 23, 2019 AAAI 2019, Honolulu, HA, USA Context Industrial automation requires highly

471 views • 27 slides

Tianwei Lin Baidu VIS What is Temporal Action Detection (TAD)? Image: Classification Video:

Temporal Action Detection with Local and Global Context Tianwei Lin Baidu VIS What is Temporal Action Detection (TAD)? Image: Classification Video: Classification Which action? People Dog Cricket Bowling What is Temporal Action Detection

617 views • 43 slides

3 COMP 1 5 9 3 Algorithmic Verification Temporal Logics Dr. Liam OConnor CSE, UNSW (for

<latexit

392 views • 26 slides