video to text task Jia Chen 1 , Shizhe Chen 2 , Qin Jin 2 , Alexander - PowerPoint PPT Presentation

Jan 26, 2024 •221 likes •363 views

INF entrance to TRECVID2018 video to text task Jia Chen 1 , Shizhe Chen 2 , Qin Jin 2 , Alexander Hauptmann 1 1 Carnegie Mellon University 2 Renmin University of China Content Recap and what's new Network architecture Limitation of

INF entrance to TRECVID2018 video to text task Jia Chen 1 , Shizhe Chen 2 , Qin Jin 2 , Alexander Hauptmann 1 1 Carnegie Mellon University 2 Renmin University of China
Content • Recap and what's new • Network architecture • Limitation of cross-entrpy loss • Bridging the exposure bias • Two losses • Experiments
Recap and What's New • Last year • Dataset vs. Network Architecture • dataset: low hanging fruit • network architecture: not too much improvement* (performance plateu) • What's new in this year • Change the loss used in the caption task • brings large gain *Knowing yourself: Improving video caption via in-depth recap. ACM MM 2017
Network Architecture • Vanilla encoder-decoder architecture[2] [2] Show and tell: A neural image caption generator. O Vinyal etc al. CVPR 2015
Network Architecture (cont'd) • temporal attention[2] [2] Describing videos by exploiting temporal structure. Yao Li etc al. ICCV 2015
Limitation of cross-entrpy loss train stage: test stage: [3] Sequence level training with recurrent neural networks. Ranzato, Marc'Aurelio, et al. ICLR 2015
Bridging the exposure gap • Solution • feed step t-1's output to step t's input through sampling • use evaluation metric as reward* • use REINFORCE to train model (an algorithm of policy gradient in reinforcement learning) 7
Bridging the exposure gap • Caveat • sometimes the algorithm may exploit the loopholes in the reward • Design a robust reward • CIDEr (closer to human evluation compared to BLEU and METEOR) • BCMR • weighted average of BLEU, CIDEr, METEOR, ROUGE
Two losses • self-critique loss • greedy decoding as baseline to reduce variance [4] Self-critical sequence training for image captioning. SJ Rennie, et al. CVPR 2017
Two losses • PROS (partially observable set) loss* distance of two captions s_i and s_j • *work under progress
Experiments • Training set • TGIF (all) • TRECVID16 (optional) • Validation set • TRECVID17 • Feature • Resnet200 (pretrained on ImageNet) • I3D (pretrained on Kinetics-400)
Experiments • performance on validation set model loss BLEU4 METEOR CIDEr vanilla cross entropy 7.1 12.4 27.6 self critique 7.7 13.2 31.3 PROS 8.1 13.9 32.5 temporal attention cross entropy 7.6 12.5 28.9 self critique 7.4 13.0 32.1
Experiments • performance on TRECVID18 model loss BLEU4 METEOR CIDEr vanilla PROS 2.4 23.1 41.6 attention self critique 1.8 22.1 40.8
Conclusion • Reformulate the problem (e.g. by loss) from scratch brings improvement over the current performance plateu

Recommend

10 slides that always work Simple text boxes (I) Sample text Sample text Sample text

10 slides that always work Simple text boxes (I) Sample text Sample text Sample text Sample text Sample text Sample text Sample text Sample text Sample text Sample text Sample text Sample

207 views • 10 slides

CONTENT TITLE Insert Subtitle Here Enter Text Here Enter Text Here Enter Text Here

CONTENT TITLE Insert Subtitle Here Enter Text Here Enter Text Here Enter Text Here Enter Text Here Enter Text Here CONTENT TITLE Insert Subtitle Here Enter Text Here Enter Text Here Enter Text Here Enter Text

697 views • 66 slides

Post-Conference Presentation Sunday Oladayo Oladejo Table of Content A Introduction B

Post-Conference Presentation Sunday Oladayo Oladejo Table of Content A Introduction B Benefits C Take-Aways D Research Areas Add text add text add text add text add text add text add text add text add text add text add text E Research

513 views • 12 slides

Enhancing ICANN Text Accountability 26 June 2014 Text #ICANN50 Text #ICANN50 Text #ICANN50

Enhancing ICANN Text Accountability 26 June 2014 Text #ICANN50 Text #ICANN50 Text #ICANN50 Inventory of ICANNs Accountability Efforts Text *Non-exhaustive inventory #ICANN50 Inventory of ICANNs Accountability Efforts Text

456 views • 29 slides

Add Your Title Here Replace your text here! Replace your text here! Insert your title here 1

COMPANY NAME Add Your Title Here Replace your text here! Replace your text here! Insert your title here 1 2 Your text Your text Replace your text here! Replace your text here! Replace your text here! Replace your text here! Replace

364 views • 12 slides

Text Text #ICANN51 15 October 2014 Text Text IDN Root Zone LGR Sarmad Hussain IDN Program

Text Text #ICANN51 15 October 2014 Text Text IDN Root Zone LGR Sarmad Hussain IDN Program Senior Manager #ICANN51 Agenda Text Text Introduction Sarmad Hussain Need, Limitations and Mechanisms for the Root Zone LGR Marc

817 views • 65 slides

Text Text #ICANN51 Contractual Compliance Text Text Contractual Compliance Update

Text Text #ICANN51 Contractual Compliance Text Text Contractual Compliance Update Wednesday, 15 October 2014 #ICANN51 Agenda Text Text Learn More about Compliance Metrics Audit Program Update Registrar Related Update

847 views • 57 slides

Text Text #ICANN50 Contractual Compliance Text Text GNSO Council Meeting Wednesday, Jun 25

Text Text #ICANN50 Contractual Compliance Text Text GNSO Council Meeting Wednesday, Jun 25 2014 #ICANN50 Objective Text Text To provide an update to the GNSO council on the Contractual Compliance efforts regarding 20130516-1 Address

675 views • 56 slides

God Rescues Daniel from the Lions Daniel 6 Here is some test text Here is some test text Here

Here is some test text Here is some test text Here is some test text God Rescues Daniel from the Lions Daniel 6 Here is some test text Here is some test text Here is some test text 1. Dedication to the Lord in prayer Here is some test text

574 views • 46 slides

5. Text CHAPTER HIGHLIGHTS Text tradition. Codes for computer text. C d f t t t

10/12/2016 CHAPTER 5. Text CHAPTER HIGHLIGHTS Text tradition. Codes for computer text. C d f t t t Font technologies. Multimedia text. Guidelines for use of text in multimedia. 2 1 10/12/2016 POWERS OF TEXT

597 views • 13 slides

Stack Stack Heap Heap Data Data Text Text Program A Program B Stack Stack Text Heap

Stack Stack Heap Heap Data Data Text Text Program A Program B Stack Stack Text Heap Heap Data Data Text Text Text Program A Program B Physical Memory Stack Stack Stack Heap Stack Kernel Heap Heap Data Heap Data

1.17k views • 62 slides

Business Proposal Infographic Style Your Text Here Your Text Here Your Text Here Your Text

Business Proposal Infographic Style Your Text Here Your Text Here Your Text Here Your Text Here Your Text Here You can simply You can simply You can simply You can simply You can simply impress your impress your impress

392 views • 23 slides

How to Stay Faithful in Exile Daniel 1 Here is some test text Here is some test text Here is

Here is some test text Here is some test text Here is some test text How to Stay Faithful in Exile Daniel 1 Here is some test text Here is some test text Here is some test text 1. Remember your true identity Here is some test text Here is

661 views • 37 slides

Nehemiah Prays Nehemiah 1-2 Here is some test text Here is some test text Here is some test

Here is some test text Here is some test text Here is some test text Nehemiah Prays Nehemiah 1-2 Here is some test text Here is some test text Here is some test text 1. Nehemiah prays out of a burden for his people Here is some test text

503 views • 35 slides

Video Games Written and Researched by: Patrick Kania First Video Game The first Video Game made

Video Games Written and Researched by: Patrick Kania First Video Game The first Video Game made was in the early 1940-1950s. Also the most popular video game back then was Cathode Ray Tube. Video Game Research. Video Games are sometimes

419 views • 11 slides

Bond Task Force Draft Bond Task Force Recommendations Tuesday, February 27 , 2018 Bond Task

Bond Task Force Draft Bond Task Force Recommendations Tuesday, February 27 , 2018 Bond Task Force Bond Task Force Background Bond Task Force Background: Why the Task Force was formed Ferndale School District putting a facilities bond

574 views • 25 slides

Lattice Cryptography: Introduction and Open Problems Daniele Micciancio Department of Computer

Lattice Cryptography: Introduction and Open Problems Daniele Micciancio Department of Computer Science and Engineering University of California, San Diego August 2015 Daniele Micciancio (UCSD) Lattice Cryptography: Introduction and Open

1.21k views • 91 slides

Centaur Verification Approach Jared Davis, Warren Hunt, Jr., Anna Slobodova, Sol Swords Bob

Centaur Verification Approach Jared Davis, Warren Hunt, Jr., Anna Slobodova, Sol Swords Bob Boyer, Gary Byers, Matt Kaufmann, Robert Krug November, 2010 Computer Sciences Department Centaur Technology, Inc. University of Texas 7600-C N.

576 views • 32 slides

Publishing date: 19/ 01/ 2015 Document title: 4b - Oil and gas UK slides ACER meeting 5 Dec 2014

Publishing date: 19/ 01/ 2015 Document title: 4b - Oil and gas UK slides ACER meeting 5 Dec 2014 We appreciate your feedback Please click on the icon to take a 5 online survey and provide your feedback about this document UK gas industry

348 views • 15 slides

Ordering Metro Lines by Block Crossings Martin Fink Lehrstuhl f ur Informatik I Universit

Ordering Metro Lines by Block Crossings Martin Fink Lehrstuhl f ur Informatik I Universit at W urzburg Joint work with Sergey Pupyrev 1 /18 Metro Maps Vienna 2 /18 Metro Maps Paris 3 /18 Metro Maps Metro Lines 4

1.2k views • 119 slides

Model Checking as A Reachability Problem Moshe Y. Vardi Rice University Engines of Progress:

Model Checking as A Reachability Problem Moshe Y. Vardi Rice University Engines of Progress: Semiconductor Technology Gordon Moore (co-founder of Intel) predicted in 1965 that the transistor density of semiconductor chips would double

486 views • 38 slides

Dominance as a New Trusted Computing Primitive for the IoT Meng Xu (Georgia Tech) Manuel Huber

1 Dominance as a New Trusted Computing Primitive for the IoT Meng Xu (Georgia Tech) Manuel Huber (Fraunhofer AISEC) Zhichuang Sun (Northeastern University) Paul England (Microsoft Research) Marcus Peinado (Microsoft Research) Sangho Lee

4.28k views • 49 slides

Basics of Complexity Complexity = resources time space ink gates energy

Basics of Complexity Complexity = resources time space ink gates energy Complexity is a function Complexity = f (input size) Value depends on: problem encoding adj. list vs. adj matrix model of

326 views • 17 slides

Mental Health Adult Pre-Charge Diversion Program Agenda Why Pre-Charge Diversion? Item 1 Item 1

Mental Health Adult Pre-Charge Diversion Program Agenda Why Pre-Charge Diversion? Item 1 Item 1 Program Development Item 2 Item 2 Program Description & Criteria Item 3 Item 3 Service Process Item 4 Item 4 Outcomes & Future

217 views • 18 slides