DVDNet Deep Blind Video Decaptioning with 3D-2D Gated Convolutions - PowerPoint PPT Presentation

Nov 20, 2022 •157 likes •311 views

2018 ChaLearn Looking at People Challenge - Track 2. Video Decaptioning DVDNet Deep Blind Video Decaptioning with 3D-2D Gated Convolutions Dahun Kim, Sanghyun Woo, Joonyoung Lee, In So Kweon 1 Our Problem Remove text overlays in video

2018 ChaLearn Looking at People Challenge - Track 2. Video Decaptioning DVDNet Deep Blind Video Decaptioning with 3D-2D Gated Convolutions Dahun Kim*, Sanghyun Woo*, Joonyoung Lee, In So Kweon 1
Our Problem Remove text overlays in video Need to consider two important points: 1. Video : Sequence of frames) 2. Blind : No inpainting mask)
Model Overview 3D gated- 2D gated- CNN CNN Encoder Decoder Input Skipconnections Prediction Output Two important points : • Video : Sequence of frames • 3D-2D U-net • Residual learning • Blind : No inpainting mask + Gated convolution
Vanilla 2D U-Net* Frame-by-frame operation • Spatial context 2D CNN 2D CNN Encoder Decoder Input Skipconnections Prediction Two important points : • Video : Sequence of frames • Scene dynamics • Blind : No inpainting mask * Ronneberger, O.et al. “U -net: Convolutional networks for biomedical image segmentation .” MICCAI 2015.
Input : Multiple frames Scene dynamics • Aggregate hints from spatio-temporal neighborhoods  Object movements  Subtitle changes
Vanilla 3D U-Net* Multiple frame prediction 3D CNN 3D CNN Encoder Decoder Input Skipconnections Prediction • Hard problem • Heavy • Not uniform prediction * C¸ ic¸ek, O ¨ .et al. “3d u-net: learning dense volumetric segmentation from sparse annotation.” MICCAI 2016.
Output : Single frame Focus on a single frame • Aggregate hints from lagging and leading frames. Lagging frames Leading frames 3D-2D U-Net • Easy problem • Light-weight Center frame • Temporal view range Output
3D-2D U-Net architecture Focus on a single frame 3D gated- 2D gated- CNN CNN Encoder Decoder Input Skipconnections Prediction • 3D convolutions to flatten the encoder features into one frame .  to match the shape and concatenate.
Residual Learning 3D gated- 2D gated- CNN CNN Encoder Decoder Input Skipconnections Prediction Output  Implicitly knows the inpainting mask Two important points : • Video : Sequence of frames • Residual learning - Not touching good pixels • Blind : No inpainting mask - Focus on the corrupted regions
+ Attention Gated Convolution* • 0-1 value (Gating) • Attentioning Sigmoid Conv Conv Input feature * Yu, J . et al. “Free -form image inpainting with gated convolution”. arXiv preprint arXiv:1806.03589.
Loss Function L1 + gradient L1 + SSIM loss
Quantative Results
Qualitative Results
2018 ChaLearn Looking at People Challenge - Track 2. Video Decaptioning DVDNet Deep Blind Video Decaptioning with 3D-2D Gated Convolutions Dahun Kim*, Sanghyun Woo*, Joonyoung Lee, In So Kweon 14

Recommend

Intraseasonal variability in South America Mariano S. Alvarez Departamento de Ciencias de la

Long and short time scales of Intraseasonal variability in South America Mariano S. Alvarez Departamento de Ciencias de la Atmsfera y los Ocanos, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires Centro de

411 views • 29 slides

Speeding Up the ARDL Estimation Command: A Case Study in Efficient Programming in Stata and Mata

Efficient Coding Digression: A Tiny Bit of Asymptotic Notation The ARDL Model Optimal Lag Selection Incremental Code Improvements Speeding Up the ARDL Estimation Command: A Case Study in Efficient Programming in Stata and Mata Sebastian

614 views • 27 slides

NIE Doctor in Education Nurturing leaders for change in the education professions Associate

NIE Doctor in Education Nurturing leaders for change in the education professions Associate Professor Mary Anne Heng Associate Dean, Higher Degrees & Strategic Partnerships NI E: W orld-class institute and top 3 in Asia for education NTU:

472 views • 18 slides

Introduction to the SPFPFS Strategic Plan Map Ohios SPFPFS Initiative: OnDemand

4/10/2018 Introduction to the SPFPFS Strategic Plan Map Ohios SPFPFS Initiative: OnDemand Learning Event Presented by Ohios SPFPFS Evaluation Team (OSET) About this learning event Learning Objectives: 1. Construct a

531 views • 22 slides

Observation-constrained pulsar magnetospheric models Yes, this one needs to be serviced too. It

Observation-constrained pulsar magnetospheric models Yes, this one needs to be serviced too. It is glitching, Jarek Dyks nulling, and drifting badly! Nicolaus Copernicus Astronomical Center Polish Academy of Sciences Toru Cartoon by

558 views • 26 slides

Build an Accountable Sales Program in Your Small Business Pam Watson Korbel Leading vs. Lagging

Build an Accountable Sales Program in Your Small Business Pam Watson Korbel Leading vs. Lagging Indicators Activity is King Phone calls Meetings Proposals Wins RESULTS BEFORE & AFTER MEASUREMENT 60 Cold calls Meetings

479 views • 10 slides

Developments in resonant power converters for RF tube modulators Jon Clare Professor of Power

Developments in resonant power converters for RF tube modulators Jon Clare Professor of Power Electronics Head of Power Electronics, Machines and Control Group The University of Nottingham John Adams Institute for Accelerator Science 6 th May

990 views • 50 slides

FOCUS AREAS GUIDING THE STRATEGIC REFRESH PROCESS Generated from discussions in 2015 with

FOCUS AREAS GUIDING THE STRATEGIC REFRESH PROCESS Generated from discussions in 2015 with Community Network & Project Sponsors Group Increase the focus on racial equity and operationalize that focus throughout the Road Map Project

291 views • 9 slides

Where Communication Meets Healthcare Wade Trappe trappe@winlab.rutgers.edu Why is a Wireless

Where Communication Meets Healthcare Wade Trappe trappe@winlab.rutgers.edu Why is a Wireless Center Talking About Healthcare? Communications is essential to health at many levels: Individual patient data is needed for managed care

289 views • 4 slides

Community Health Improvement Learning Collaborative Webinar #6 Evaluate Actions February 16 th ,

Community Health Improvement Learning Collaborative Webinar #6 Evaluate Actions February 16 th , 2016 Agenda Today Welcome Discussion of Key Concepts Resources for Selecting and Using Metrics Discussion of Cross-cutting Tenets

782 views • 35 slides

Using Publicly Available Data for Decisions in Agricultural Supply Chain Authors: Satya Dhavala

Using Publicly Available Data for Decisions in Agricultural Supply Chain Authors: Satya Dhavala and Derik Smith Advisor: Dr. Bruce Arntzen Sponsor: Dow AgroSciences MIT SCM ResearchFest May 22-23, 2013 Agenda Key Question Introduction

750 views • 16 slides

Outside Insight: navigating a world drowning in data Jorn Lyseggen CEO of Meltwater Ken Benoit

Hosted by SEDS Outside Insight: navigating a world drowning in data Jorn Lyseggen CEO of Meltwater Ken Benoit Chair Head of Department of Methodology, LSE Hashtag for Twitter users: #LSEdata Navigating a world drowning in data JORN

567 views • 21 slides

Measures of Effective Teaching (MET) Vicki Phillips Director, College Ready @drvickip

Measures of Effective Teaching (MET) Vicki Phillips Director, College Ready @drvickip Teaching Learning Evidence Underpinned by: Innovation Advocacy Partnerships Why did we choose to focus on teaching ? The

654 views • 34 slides

Optimizing the Lead: A data-driven optimization process that goes beyond lead capture Brian

Optimizing the Lead: A data-driven optimization process that goes beyond lead capture Brian Carroll Pamela Markey Executive Director, Director of Marketing & Revenue Optimization Brand Strategy MECLABS MECLABS Session Speakers Brian

980 views • 70 slides

Artist Management in a Artist Management in a Artist Management in a Small Games Company Small

Artist Management in a Artist Management in a Artist Management in a Small Games Company Small Games Company Small Games Company GDC 2004 GDC 2004 SPEAKER BIOS Di Davies Visual Development manager 8 years traditional animation

568 views • 20 slides

B03: HL-LHC CMS Upgrade QA/QC Plan Carol Wilkinson, Associate Project Manager CD1 Review October

B03: HL-LHC CMS Upgrade QA/QC Plan Carol Wilkinson, Associate Project Manager CD1 Review October 23 rd , 2019 Outline Biographical Sketches U.S. CMS QA/QC Program QA/QC Components QA Plan Key Elements Section 4: CMS/CERN QA

843 views • 25 slides

Rhody Health Options Care Management Kathy Ullrich, LICSW Manager, RHO Care Management ICI

Rhody Health Options Care Management Kathy Ullrich, LICSW Manager, RHO Care Management ICI Consumer Advisory Council April 2, 2014 Agenda Medical Management Teams within Case Management Community Team Waiver Team

403 views • 15 slides

The U.S. Environmental Protection Agency (EPA), in collaboration with the U.S. Department of

The U.S. Environmental Protection Agency (EPA), in collaboration with the U.S. Department of Defense, the U.S. Department of Energy and the U.S. Department of the Interior, developed this presentation to provide writers of five year reviews

501 views • 46 slides

Cybe r R isk T r e nds: 2017 Wr ap-Up Ja nua ry 30 th , 2018, 11 AM E a ste rn Cybe r R

Cybe r R isk T r e nds: 2017 Wr ap-Up Ja nua ry 30 th , 2018, 11 AM E a ste rn Cybe r R isk T r e nds: 2017 Wr ap-Up Visit www.a dvise nltd.c o m a t the e nd o f this we b ina r to do wnlo a d: Co py o f the se slide s

379 views • 20 slides

The Strategist: Strategy from Context Adam Brandenburger J.P . Valles Professor, NYU

The Strategist: Strategy from Context Adam Brandenburger J.P . Valles Professor, NYU Stern School of Business Distinguished Professor, NYU Tandon School of Engineering Faculty Director, NYU Shanghai Program on Creativity +

384 views • 14 slides

High-speed cryptography, Crypto performance problems part 1: often lead users to reduce

High-speed cryptography, Crypto performance problems part 1: often lead users to reduce elliptic-curve formulas cryptographic security levels or give up on cryptography. Daniel J. Bernstein University of Illinois at Chicago & Example 1

1.55k views • 138 slides

INF5210 Information Infrastructure Class #11 Bootstrapping & Gateways Ben Eaton Dan Truong

INF5210 Information Infrastructure Class #11 Bootstrapping & Gateways Ben Eaton Dan Truong Le 30/10/2013 Discuss this weeks reading for class discussion Hanseth & Aanestad (2003) - Design as bootstrapping Hanseth (2002)

1.39k views • 26 slides

Software Quality Research: from Processes to Model- based Techniques Bernhard Peischl Softnet

www.tugraz.at W I S S E N T E C H N I K L E I D E N S C H A F T Software Quality Research: from Processes to Model- based Techniques Bernhard Peischl Softnet Austria 17th April 2015 u

269 views • 11 slides

Whats new since last year? www.4s-dawn.com Product Update Dosing Instructions Total Mg in

Product Update Whats new since last year? www.4s-dawn.com Product Update Dosing Instructions Total Mg in Decimals Whilst Showing Tablets in Fractions OR www.4s-dawn.com Product Update Total mg line, with the tablets in fractions, but the

157 views • 15 slides