Video to Text Description Jia Chen 1 , Shizhe Chen 2 , Qin Jin 2 , - PowerPoint PPT Presentation

Oct 13, 2023 •34 likes •174 views

INF@TRECVID2017 Video to Text Description Jia Chen 1 , Shizhe Chen 2 , Qin Jin 2 , Alexander Hauptmann 1 Carnegie Mellon University 1 Renmin University of China 2 Main focus in this year: cross-dataset generalization Last year: As the

INF@TRECVID2017 Video to Text Description Jia Chen 1 , Shizhe Chen 2 , Qin Jin 2 , Alexander Hauptmann 1 Carnegie Mellon University 1 Renmin University of China 2
Main focus in this year: cross-dataset generalization • Last year: • As the video caption pilot task provides no training captions for videos, we treat it as an opportunity to test the generalization ability of the caption models. • This year: • We found that the performance of caption model begins to saturate within one dataset by comparison to human reference • opportunity->problem that we must face now
Motivation • human reference on MSRVTT • leave-one-out test on groundtruth • on par with the human reference on caption metrics • metric issue? • dataset issue (coupling with generalization issue)?
Motivation • eliminate the metric issues • on par with the human reference on tagging metrics (stop words removed)
Motivation • preliminary cross dataset expriment • pitfall in the dataset MSRVTT: • train/test clips could come from the same video • The median number of shots for single video clip is 2 in MSRVTT • information leakage • MSVD • too few videos • too many duplicate groundtruth sentences, which reduce the number of unique (video, caption) pairs
Cross-dataset Generalization Property of Models • Q1: Which one is more promising for better generalization on unseen datasets, higher quality training dataset or more robust model? • Q2: Could we get more stable generalization ability by ensembling more different models?
Basic Setting • Feature: • resnet200 • i3d • mfcc (bow + fv) • RNN with LSTM Cell • 512 hidden dimension, 512 input dimension • Train scheme • batch size of 64
Q1: Higher quality training dataset or more robust model for better generalization? • fix the model architecture to study its influence by treating TRECVID2016 as unseen dataset • fix the training datasets to study its influence by treating TRECVID2016 as unseen dataset
Q1: Higher quality training dataset or more robust model for better generalization? • Models: • Vanilla Encoder-decoder (MP) • Attention Encoder-decoder (ATT) • Training dataset: • MSRVTT+MSVD • TGIF
Q1: Higher quality training dataset or more robust model for better generalization? • the performance gain from dataset >> the gain from the caption model
Q1: Higher quality training dataset or more robust model for better generalization? • TGIF Dataset collection instruction:
Q2 Could we get more stable generalization ability by ensembling more different models? • more replicas of models: • varying the detailed settings such as tuning dropout rate and using different epochs in training • ensemble: • rerank sentences using the submitted model in the retrieval subtask
Q2 Could we get more stable generalization ability by ensembling more different models? • by ensembling more and more models from source domain datasets, the performance on the target domain dataset TRECVID16 improves consistently
Challenge Result

Recommend

10 slides that always work Simple text boxes (I) Sample text Sample text Sample text

10 slides that always work Simple text boxes (I) Sample text Sample text Sample text Sample text Sample text Sample text Sample text Sample text Sample text Sample text Sample text Sample

207 views • 10 slides

CONTENT TITLE Insert Subtitle Here Enter Text Here Enter Text Here Enter Text Here

CONTENT TITLE Insert Subtitle Here Enter Text Here Enter Text Here Enter Text Here Enter Text Here Enter Text Here CONTENT TITLE Insert Subtitle Here Enter Text Here Enter Text Here Enter Text Here Enter Text

697 views • 66 slides

Post-Conference Presentation Sunday Oladayo Oladejo Table of Content A Introduction B

Post-Conference Presentation Sunday Oladayo Oladejo Table of Content A Introduction B Benefits C Take-Aways D Research Areas Add text add text add text add text add text add text add text add text add text add text add text E Research

513 views • 12 slides

Enhancing ICANN Text Accountability 26 June 2014 Text #ICANN50 Text #ICANN50 Text #ICANN50

Enhancing ICANN Text Accountability 26 June 2014 Text #ICANN50 Text #ICANN50 Text #ICANN50 Inventory of ICANNs Accountability Efforts Text *Non-exhaustive inventory #ICANN50 Inventory of ICANNs Accountability Efforts Text

456 views • 29 slides

Add Your Title Here Replace your text here! Replace your text here! Insert your title here 1

COMPANY NAME Add Your Title Here Replace your text here! Replace your text here! Insert your title here 1 2 Your text Your text Replace your text here! Replace your text here! Replace your text here! Replace your text here! Replace

364 views • 12 slides

Text Text #ICANN51 15 October 2014 Text Text IDN Root Zone LGR Sarmad Hussain IDN Program

Text Text #ICANN51 15 October 2014 Text Text IDN Root Zone LGR Sarmad Hussain IDN Program Senior Manager #ICANN51 Agenda Text Text Introduction Sarmad Hussain Need, Limitations and Mechanisms for the Root Zone LGR Marc

817 views • 65 slides

Text Text #ICANN51 Contractual Compliance Text Text Contractual Compliance Update

Text Text #ICANN51 Contractual Compliance Text Text Contractual Compliance Update Wednesday, 15 October 2014 #ICANN51 Agenda Text Text Learn More about Compliance Metrics Audit Program Update Registrar Related Update

847 views • 57 slides

Text Text #ICANN50 Contractual Compliance Text Text GNSO Council Meeting Wednesday, Jun 25

Text Text #ICANN50 Contractual Compliance Text Text GNSO Council Meeting Wednesday, Jun 25 2014 #ICANN50 Objective Text Text To provide an update to the GNSO council on the Contractual Compliance efforts regarding 20130516-1 Address

675 views • 56 slides

God Rescues Daniel from the Lions Daniel 6 Here is some test text Here is some test text Here

Here is some test text Here is some test text Here is some test text God Rescues Daniel from the Lions Daniel 6 Here is some test text Here is some test text Here is some test text 1. Dedication to the Lord in prayer Here is some test text

574 views • 46 slides

5. Text CHAPTER HIGHLIGHTS Text tradition. Codes for computer text. C d f t t t

10/12/2016 CHAPTER 5. Text CHAPTER HIGHLIGHTS Text tradition. Codes for computer text. C d f t t t Font technologies. Multimedia text. Guidelines for use of text in multimedia. 2 1 10/12/2016 POWERS OF TEXT

597 views • 13 slides

Stack Stack Heap Heap Data Data Text Text Program A Program B Stack Stack Text Heap

Stack Stack Heap Heap Data Data Text Text Program A Program B Stack Stack Text Heap Heap Data Data Text Text Text Program A Program B Physical Memory Stack Stack Stack Heap Stack Kernel Heap Heap Data Heap Data

1.17k views • 62 slides

Business Proposal Infographic Style Your Text Here Your Text Here Your Text Here Your Text

Business Proposal Infographic Style Your Text Here Your Text Here Your Text Here Your Text Here Your Text Here You can simply You can simply You can simply You can simply You can simply impress your impress your impress

392 views • 23 slides

How to Stay Faithful in Exile Daniel 1 Here is some test text Here is some test text Here is

Here is some test text Here is some test text Here is some test text How to Stay Faithful in Exile Daniel 1 Here is some test text Here is some test text Here is some test text 1. Remember your true identity Here is some test text Here is

661 views • 37 slides

Nehemiah Prays Nehemiah 1-2 Here is some test text Here is some test text Here is some test

Here is some test text Here is some test text Here is some test text Nehemiah Prays Nehemiah 1-2 Here is some test text Here is some test text Here is some test text 1. Nehemiah prays out of a burden for his people Here is some test text

503 views • 35 slides

Video Games Written and Researched by: Patrick Kania First Video Game The first Video Game made

Video Games Written and Researched by: Patrick Kania First Video Game The first Video Game made was in the early 1940-1950s. Also the most popular video game back then was Cathode Ray Tube. Video Game Research. Video Games are sometimes

419 views • 11 slides

Title of an article [16 pt] Introduction [14 pt] Text. Text. Text. Text. Text. Text. Text. Text.

REQUIREMENTS FOR POSTER PRESENTATION 1. Volume of poster presentation (or Article) from 4 to 10 pages (A4 format). Please submit an electronic version by email. Along with the article must be submitted a review of a scientist . A review

71 views • 3 slides

Lecture 4.5: Generalized Fourier series Matthew Macauley Department of Mathematical Sciences

Lecture 4.5: Generalized Fourier series Matthew Macauley Department of Mathematical Sciences Clemson University http://www.math.clemson.edu/~macaule/ Math 4340, Advanced Engineering Mathematics M. Macauley (Clemson) Lecture 4.5: Generalized

748 views • 7 slides

Michael Spece Departments of Machine Learning and Statistics Carnegie Mellon University June 11,

Generalization Martingale Bounds Ongoing Work Generalization for Streaming Data Michael Spece Departments of Machine Learning and Statistics Carnegie Mellon University June 11, 2015 1 / 12 Generalization Martingale Bounds Ongoing Work

489 views • 12 slides

The Effect of Network Width on Stochastic Gradient Descent and Generalization Daniel S. Park

The Effect of Network Width on Stochastic Gradient Descent and Generalization Daniel S. Park Google ICML 2019 Daniel S. Park (Google) ICML 2019 1 / 9 Work with Jascha Sohl-Dickstein, Quoc V. Le and Samuel L. Smith. Daniel S. Park (Google)

377 views • 9 slides

On the Generalization Benefjt of Noise in Stochastic Gradient Descent Samuel L. Smith, Erich

On the Generalization Benefjt of Noise in Stochastic Gradient Descent Samuel L. Smith, Erich Elsen and Soham De ICML 2020 Joint work with Soham De Erich Elsen With thanks to: Esme Sutherland, James Martens, Yee Whye Teh Sander Dieleman,

778 views • 25 slides

VC GENERALIZATION BOUND VC GENERALIZATION BOUND Matthieu Bloch March 12, 2020 1 LOGISTICS (AND

VC GENERALIZATION BOUND VC GENERALIZATION BOUND Matthieu Bloch March 12, 2020 1 LOGISTICS (AND BABY PICTURE) LOGISTICS (AND BABY PICTURE) Problem Set 4 Assigned very soon, but no work expected during Spring break Project proposal Deadline

372 views • 11 slides

Lecture 4: Linear Regression Optimization Generalization Model complexity

Lecture 4: Linear Regression Optimization Generalization Model complexity Regularization Aykut Erdem October 2018 Hacettepe University 1 , 1 , , ,

840 views • 56 slides

Learning From Data Lecture 5 Training Versus Testing The Two Questions of Learning Theory of

Learning From Data Lecture 5 Training Versus Testing The Two Questions of Learning Theory of Generalization ( E in E out ) An Effective Number of Hypotheses A Combinatorial Puzzle M. Magdon-Ismail CSCI 4100/6100 recap: The Two Questions

700 views • 18 slides

Generalization Error MACH IN E LEARN IN G W ITH TREE-BAS ED MODELS IN P YTH ON Elie Kawerk

Generalization Error MACH IN E LEARN IN G W ITH TREE-BAS ED MODELS IN P YTH ON Elie Kawerk Data Scientist Supervised Learning - Under the Hood Supervised Learning: y = f ( x ) , f is unknown. MACHINE LEARNING WITH TREE-BASED MODELS IN PYTHON

601 views • 37 slides