LEARNING TEMPORAL EMBEDDINGS FOR COMPLEX VIDEO ANALYSIS BY - PowerPoint PPT Presentation

Oct 14, 2023 •294 likes •531 views

LEARNING TEMPORAL EMBEDDINGS FOR COMPLEX VIDEO ANALYSIS BY RAMANATHAN, TANG, MORI, AND LI Chad Voegele PROBLEM What can we learn about videos ? without supervision MOTIVATION ... quick fox jumps over dog ... WORD2VEC FOR VIDEOS? words

LEARNING TEMPORAL EMBEDDINGS FOR COMPLEX VIDEO ANALYSIS BY RAMANATHAN, TANG, MORI, AND LI Chad Voegele
PROBLEM What can we learn about videos ? without supervision
MOTIVATION ... quick fox jumps over dog ...
WORD2VEC FOR VIDEOS? words frames ≈ sentences video segments ≈
WORD2VEC FOR VIDEOS? ISSUES 1. Frames are not discrete. 2. Visual similarity between neighboring frames. 3. Representation of context.
FRAME EMBEDDING ⟶
FRAME EMBEDDING input Alex Magic Net fc7 ReLU LRN
EMBEDDING OBJECTIVE a ⋅ b similarity( a , b ) = ∥ a ∥∥ b ∥ = a ⋅ b
EMBEDDING OBJECTIVE f v j ⋅ h v j ≫ f − h v j ⋅
EMBEDDING OBJECTIVE embedding ∑ min ∑ ∑ max ( 0, 1 − ( f v j − f − ) ⋅ h v j ) v j ∈ v v ∈ V v − ≠ v j
EMBEDDING OBJECTIVE WANT 1 − ( f v j − f − ) ⋅ h v j < 0 ⇔ f v j ⋅ h v j > 1 + f − h v j ⋅
FRAME CONTEXT T T 1 1 h v j = 2 T ∑ f v j − t + f v j + t h v j = T ∑ f v j − t h v j ∈ { f v k | k ≠ j } t =1 t =1
MULTI-RESOLUTION & NEGATIVES
EVENT RETRIEVAL TASK v → { v j ∈ V | event( v ) = event( v j )} METHOD For each , v j ∈ V 1. Uniformly sample 4 frames from . v j 2. Compute and average the frame embeddings. Then, 1. Sort ¯ ¯ ∣ { f v ⋅ f v k ∣ v k ≠ v }
EVENT RETRIEVAL Method mAP (%) Chance 6.53 Two-stream pre-trained 20.09 fc6 20.08 fc7 21.24 Model (no future) 21.30 Model (no hard neg.) 24.22 Model (best) 25.07
EVENT RETRIVEAL
SAMPLE VIDEOS Awesome Parkour and Freerunning 20... Skateboarding Montage 2015
TEMPORAL ORDER RECOVERY 2 1 4 3 1 2 3 4
TEMPORAL ORDER RECOVERY METHOD Given s v j ∣ { ∣ s v j ∈ v j } Until done, 1. Average last two frame embeddings. 2. Find next frame as frame with highest similarity.
TEMPORAL ORDER RECOVERY Method Kendall Tau Chance 50 Two-stream 42.05 fc6 42.43 fc7 41.67 Model (pairwise) 42.03 Model (no future) 40.91 Model (best) 40.41
TEMPORAL ORDERING FOR PHOTOS
DISCUSSION How are long-distance dependencies captured? Can we estimate the quality of embeddings independent of application? Hyper-parameter tuning: fps sampling, embedding dimension, negative selection, context representation
SOURCES Word2Vec: An Introduction Unsupervised Learning of Visual Representations using Videos by Nitish Srivastava Visualizing Data using t-SNE by van der Maaten Fox Over Dog Picture Groundhog Day, 1993, Columbia Pictures Efficient Estimation of Word Representations in Vector Space by Mikolov

Recommend

Embeddings @ Twitter Making ML easy with Embeddings !!! Sept 2018 Agenda 1 Team 2 Whats an

Embeddings @ Twitter Making ML easy with Embeddings !!! Sept 2018 Agenda 1 Team 2 Whats an Embedding ? 3 Why Embeddings ? 4 Embeddings Pipeline 5 Whats Next Agenda 1 Team 2 Whats an Embedding ? 3 Why Embeddings ? 4

732 views • 40 slides

Word Embeddings Revisited: Contextual Embeddings CS 6956: Deep Learning for NLP Overview

Word Embeddings Revisited: Contextual Embeddings CS 6956: Deep Learning for NLP Overview Word types and tokens Training contextual embeddings Embeddings from Language Models (ELMo) 1 Overview Word types and tokens Training

670 views • 40 slides

Word embeddings Rappel Embeddings ( pas Word Embeddings ) Est une lookup table Formalisme:

Word embeddings Rappel Embeddings ( pas Word Embeddings ) Est une lookup table Formalisme: Index dun mot: w i Table dembeddings (lookup matrix): V Embedding: e i e i = V( w i ) Reprsentation dun mot

780 views • 47 slides

Word Embeddings Natural Language Processing VU (706.230) - Andi Rexha 02/04/2020 Word Embeddings

Word Embeddings Natural Language Processing VU (706.230) - Andi Rexha 02/04/2020 Word Embeddings Agenda Traditional NLP Word Embeddings-1 Word Embeddings-2 Text preprocessing Topic Modeling ELMo Bag-of-words model Neural

901 views • 64 slides

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word embeddings: Early work Word embeddings via language models Word2vec and Glove Evaluating embeddings Design choices and open questions 1

2k views • 24 slides

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

828 views • 15 slides

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

422 views • 17 slides

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

1.25k views • 75 slides

Mixed membership word embeddings: Corpus-specific embeddings without big data James Foulds

Mixed membership word embeddings: Corpus-specific embeddings without big data James Foulds University of California, San Diego Southern California Machine Learning Symposium, Caltech, 11/18/2018 Word Embeddings Language models which learn

537 views • 32 slides

Spatio-Temporal Statistics with R Chapter Two: Exploring Spatio-Temporal Data Spatio-Temporal

Spatio-Temporal Statistics with R Chapter Two: Exploring Spatio-Temporal Data Spatio-Temporal Data Spatio-Temporal Data Geostatistical : continuous spatial index Areal (lattice): defined on finite/countable subset in space Point

897 views • 25 slides

Complex Numbers Complex Numbers 1 / 19 Complex Numbers Complex numbers ( C ) are an extension of

Complex Numbers Complex Numbers 1 / 19 Complex Numbers Complex numbers ( C ) are an extension of the real numbers. z C takes the form z = x + i y x , y R Complex Numbers 2 / 19 Complex Numbers Complex numbers ( C ) are an extension of

577 views • 31 slides

Intermembrane Space H + H + Cyt c Co Q Complex Complex III IV H + ATPase H + Complex

Intermembrane Space H + H + Cyt c Co Q Complex Complex III IV H + ATPase H + Complex Complex II I FADH 2 FAD O 2 H 2 O NADH NAD + Matrix Intermembrane Space H + H + Cyt c Co Q Complex Complex III IV H + ATPase H + Complex

140 views • 10 slides

Temporal, Spatial, and Spatio-temporal Granularities Gabriele Pozzani Department of Computer

Outline Introduction Temporal granularity Spatial granularity Spatio-temporal granularity Conclusions Temporal, Spatial, and Spatio-temporal Granularities Gabriele Pozzani Department of Computer Science, University of Verona, Italy 27th

720 views • 37 slides

Temporal Code Temporal Code Temporal Code (Acoustic Front-end) Human Recognition Machine

Temporal Code Temporal Code Temporal Code (Acoustic Front-end) Human Recognition Machine Recognition RECOGNIZED UTTERANCE LANGUAGE MODELING (Back-end) HYPOTHESIZED UTTERANCES STATISTICAL SEQUENCE RECOGNITION ACOUSTIC REPRESENTATION

546 views • 29 slides

Temporal Privacy in Wireless Sensor Networks Temporal Privacy in Wireless Sensor Networks

Temporal Privacy in Wireless Sensor Networks Temporal Privacy in Wireless Sensor Networks Pandurang Kamat, Wenyuan Xu, Wade Trappe and Yanyong Zhang WINLAB Rutgers University 1 Temporal Privacy in Sensor Networks Temporal Privacy in Sensor

280 views • 14 slides

Temporal Planning Planning with Temporal and Concurrent Actions 1 Literature Malik

Temporal Planning Planning with Temporal and Concurrent Actions Temporal Planning Planning with Temporal and Concurrent Actions 1 Literature Malik Ghallab, Dana Nau, and Paolo Traverso. Automated Planning Theory and Practice ,

740 views • 59 slides

Footsteps Informal Sound Study Retro Basics Parkour Squad Iconic Tricks Video Footsteps

Footsteps Informal Sound Study Retro Basics Parkour Squad Iconic Tricks Video Footsteps Retro Sound Study Footsteps Retro Sound Study http://vimeo.com/19525536 Footloose and Fancy Free Fundamentals 1 st Person 3 rd Person Animation

265 views • 12 slides

How to build a recommender system based on Mahout and Java EE Berlin Expert Days 29. 30. March

How to build a recommender system based on Mahout and Java EE Berlin Expert Days 29. 30. March 2012 Manuel Blechschmidt CTO Apaxo GmbH All the web content will be personalized in three to five years. Sheryl Sandberg COO Facebook

602 views • 33 slides

Making Your Business Accessible Presented by GIOVANNA LEVER & Delivered on behalf of:

BUSINESS FUTURE PROOF SERIES Making Your Business Accessible Presented by GIOVANNA LEVER & Delivered on behalf of: JACKIE HICKS LETS BREAK IT DOWN Today we will discuss: 1. What does accessibility and accessible business mean

229 views • 22 slides

GridGain Ultimate Edition aids implementation of SaaS systems and replaces traditional databases

GridGain Ultimate Edition aids implementation of SaaS systems and replaces traditional databases Craig Gresbrink Solutions Architect 24 Hour Fitness Who, What, Why, How, and Learnings Tales from the trenches Who are we? 24 Hour Fitness is a

750 views • 41 slides

H ONORS & A WARDS : Alliance for Graduate Education and the Professoriate Fellowship 2005

S ATURNINO G ARCIA Computer Science & Engineering Department (858) 877-3579 University of California, San Diego 9500 Gilman Drive, Mailcode #0404 sat@cs.ucsd.edu La Jolla, California 92093 http://cseweb.ucsd.edu/~s4garcia R ESEARCH I

437 views • 6 slides

The Art of Standing up Uncovering design pattern in comedy Who am I. Why am I doing this. The

The Art of Standing up Uncovering design pattern in comedy Who am I. Why am I doing this. The Act Orator Content Audience Orator Orator Jester Orator Jester Philosophers Content Design pattern Bizarre Assumption Fact

850 views • 68 slides

your child's age, we can make it through. But before we dive in, I know you are wondering who

As if Juggling Entrepreneurship every day is not enough, we are now juggling amid COVID. But no worries mamas, no matter your child's age, we can make it through. But before we dive in, I know you are wondering who this supermom is telling me

405 views • 28 slides

Deep Reinforcement Learning and Complex Environments Raia Hadsell End-to-end Deep Learning

Deep Reinforcement Learning and Complex Environments Raia Hadsell End-to-end Deep Learning for robots? slide from V. Vanhoucke End-to-end Deep Learning for robots? 2010 : Speech Recognition Audio Acoustic Model Phonetic Model

1.05k views • 65 slides

LEARNING TEMPORAL EMBEDDINGS FOR COMPLEX VIDEO ANALYSIS BY - PowerPoint PPT Presentation

LEARNING TEMPORAL EMBEDDINGS FOR COMPLEX VIDEO ANALYSIS BY RAMANATHAN, TANG, MORI, AND LI Chad Voegele PROBLEM What can we learn about videos ? without supervision MOTIVATION ... quick fox jumps over dog ... WORD2VEC FOR VIDEOS? words

Embeddings @ Twitter Making ML easy with Embeddings !!! Sept 2018 Agenda 1 Team 2 Whats an

Word Embeddings Revisited: Contextual Embeddings CS 6956: Deep Learning for NLP Overview

Word embeddings Rappel Embeddings ( pas Word Embeddings ) Est une lookup table Formalisme:

Word Embeddings Natural Language Processing VU (706.230) - Andi Rexha 02/04/2020 Word Embeddings

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Mixed membership word embeddings: Corpus-specific embeddings without big data James Foulds

Spatio-Temporal Statistics with R Chapter Two: Exploring Spatio-Temporal Data Spatio-Temporal

Complex Numbers Complex Numbers 1 / 19 Complex Numbers Complex numbers ( C ) are an extension of

Intermembrane Space H + H + Cyt c Co Q Complex Complex III IV H + ATPase H + Complex

Temporal, Spatial, and Spatio-temporal Granularities Gabriele Pozzani Department of Computer

Temporal Code Temporal Code Temporal Code (Acoustic Front-end) Human Recognition Machine

Temporal Privacy in Wireless Sensor Networks Temporal Privacy in Wireless Sensor Networks

Temporal Planning Planning with Temporal and Concurrent Actions 1 Literature Malik

Footsteps Informal Sound Study Retro Basics Parkour Squad Iconic Tricks Video Footsteps

How to build a recommender system based on Mahout and Java EE Berlin Expert Days 29. 30. March

Making Your Business Accessible Presented by GIOVANNA LEVER &amp; Delivered on behalf of:

GridGain Ultimate Edition aids implementation of SaaS systems and replaces traditional databases

H ONORS &amp; A WARDS : Alliance for Graduate Education and the Professoriate Fellowship 2005

The Art of Standing up Uncovering design pattern in comedy Who am I. Why am I doing this. The

your child's age, we can make it through. But before we dive in, I know you are wondering who

Deep Reinforcement Learning and Complex Environments Raia Hadsell End-to-end Deep Learning

Making Your Business Accessible Presented by GIOVANNA LEVER & Delivered on behalf of:

H ONORS & A WARDS : Alliance for Graduate Education and the Professoriate Fellowship 2005