Dense Optical Flow Prediction from a Static Image Jacob Walker, - - PowerPoint PPT Presentation

dense optical flow prediction from a static image
SMART_READER_LITE
LIVE PREVIEW

Dense Optical Flow Prediction from a Static Image Jacob Walker, - - PowerPoint PPT Presentation

Dense Optical Flow Prediction from a Static Image Jacob Walker, Abhinav Gupta, and Martial Hebert Aysun Koak Introduction Static images contain action and motion information There are several approaches Common one is


slide-1
SLIDE 1

Dense Optical Flow Prediction from a Static Image

Jacob Walker, Abhinav Gupta, and Martial Hebert

Aysun Koçak

slide-2
SLIDE 2

Introduction

  • Static images contain action and motion

information

  • There are several approaches
  • Common one is agent-centric

Activity forecasting, ECCV, 2012 Patch to the future: Unsupervised visual prediction, CVPR, 2014

slide-3
SLIDE 3

Introduction

  • Two disadvantages
  • motion is modeled as a trajectory
  • shown to perform in restrictive domains
  • This paper proposes a generalized framework
  • single or multiple agent
  • indoor or outdoor environment
slide-4
SLIDE 4

Related Work

  • Non-parametric methods
  • data-driven
  • do not make any assumptions about the underlying

scene

A data-driven approach for event predictjon, ECCV, 2010

slide-5
SLIDE 5

Related Work

  • Parametric methods
  • domain-specific approaches
  • assumptions on what are the active elements

A hierarchical representatjon for future actjon predictjon, ECCV 2014

slide-6
SLIDE 6

Related Work

  • Hybrid methods

Patch to the Future: Unsupervised Visual Prediction, CVPR 2014

slide-7
SLIDE 7

The Proposed Method

  • Predict motion of each and every pixel in terms of optical

flow

  • CNN model for motion prediction
  • Agent-free
  • Makes almost no assumptions about the underlying

scene

  • Also makes long-range prediction
slide-8
SLIDE 8

The Proposed Method

  • Learn a mapping between the input RGB image and the
  • utput space
slide-9
SLIDE 9

Training Framework

  • 1. Extract Optical Flow from Video Frames
  • UCF-101 is action recognition data set consists of

13320 videos from 101 action categories

  • HMDB-51 has 6849 videos from 51 action categories
  • Model trained with over 350,000 frames from the

UCF-101 and over 150,000 frames from the HMDB- 51

  • Labelled with DeepFlow
  • Data augmentation
slide-10
SLIDE 10

Training Framework

  • 2. Assign Optical Flow Vectors to Clusters
  • Regression as Classification
  • motion estimation can be posed as a regression

problem

  • but it has a drawback
  • so reformulate as classification
  • quantize optical flow vectors into 40 clusters by k-means
slide-11
SLIDE 11

Training Framework

  • 3. Train Convolutional Neural Network for a

Pixel Classification Problem

  • Loss function
slide-12
SLIDE 12

Experiments

  • Test on
  • UCF-101
  • HMDB-51
  • KTH contains 600 videos from 6 actions
  • 3-fold cross-validation
slide-13
SLIDE 13

Experiments

  • Metrics
  • Direction similarity
  • Orientation similarity
  • End-Point-Error
slide-14
SLIDE 14

Experiments

  • Metrics
  • Top 5
  • Top 10
slide-15
SLIDE 15

Experiments

  • UCF Dataset
slide-16
SLIDE 16

Experiments

  • HMDB Dataset
slide-17
SLIDE 17
slide-18
SLIDE 18
slide-19
SLIDE 19
slide-20
SLIDE 20

Experiments

slide-21
SLIDE 21

Multi-Frame Prediction

slide-22
SLIDE 22

Multi-Frame Prediction

Long-term Recurrent Convolutional Networks for Visual Recognition and Description

slide-23
SLIDE 23
slide-24
SLIDE 24

Thank You