CS 4495 Computer Vision Activity Recognition Aaron Bobick School - PowerPoint PPT Presentation

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick CS 4495 Computer Vision Activity Recognition Aaron Bobick School of Interactive Computing

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick Administrivia • PS6 – should be working on it! Due Sunday Nov 24 th . • Exam: Tues November 26 th . • Short answer and multiple choice (mostly short answer) • Study guide is posted in calendar. • PS7 – we hope to have out by 11/26. Will be straight forward implementation of Motion History Images

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick Video • A video is a sequence of frames captured over time • Now our image data is a function of space (x, y) and time (t)

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick Video as an “Image Stack” 255 time 0 t • Can look at video data as a spatio-temporal volume • If camera is stationary, each line through time corresponds to a single ray in space Alyosha Efros, CMU

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick Aside: Epipolar Plane (“EPI”) images

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick EPI images and activity

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick Processing video: object detection • If the goal of “activity recognition” is to recognize the activity of the objects… • … you (may) have to find the objects….

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick Background subtraction Slide credit: Birgi Tamersoy

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick Background subtraction • Simple techniques can do ok with static camera • …But hard to do perfectly • Widely used: • Traffic monitoring (counting vehicles, detecting & tracking vehicles, pedestrians), • Human action recognition (run, walk, jump, squat), • Human-computer interaction • Object tracking

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick Simple approach: background subtraction Slide credit: Birgi Tamersoy

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick Frame differencing Slide credit: Birgi Tamersoy

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick Mean filtering Slide credit: Birgi Tamersoy

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick Frame differences vs. background subtraction • Toyama et al. 1999

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick Median Filtering Slide credit: Birgi Tamersoy

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick Average/Median Image Alyosha Efros, CMU

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick Background Subtraction - = Alyosha Efros, CMU

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick Pros and cons Advantages: • Extremely easy to implement and use! • All pretty fast. • Corresponding background models need not be constant, they change over time. Disadvantages: • Accuracy of frame differencing depends on object speed and frame rate • Median background model: relatively high memory requirements. • Setting global threshold Th… When will this basic approach fail? Slide credit: Birgi Tamersoy

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick Background mixture models Idea : model each background pixel with a mixture of Gaussians; update its parameters over time. • Adaptive Background Mixture Models for Real-Time Tracking, Chris Stauer & W.E.L. Grimson

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick Background subtraction with depth How can we select foreground pixels based on depth information?

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick Human activity in video No universal terminology, but approximately: • “ Event ”: a single instant in time detection. • “ Actions ” or “Movements” : atomic motion patterns -- often gesture-like, single clear-cut trajectory, single nameable behavior (e.g., sit, wave arms) • “ Activity ”: series or composition of actions (e.g., interactions between people) Adapted from Venu Govindaraju and A.Bobick

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick Surveillance http://users.isr.ist.utl.pt/~etienne/mypubs/Auvinetal06PETS.pdf

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick Human activity in video: basic approaches • Model-based action recognition: • Use human body tracking and pose estimation techniques, relate to action descriptions (or learn) • Major challenge: accurate tracks in spite of occlusion, ambiguity, low resolution • Model-based activity recognition: • Given some lower level detection of actions (or events) recognize the activity by comparing to some structural representation of the activity • Needs to handle uncertainty. • Activity as motion, space-time appearance patterns • Describe overall patterns, but no explicit body tracking • Typically learn a classifier • Recently: “Activity-recognition” from static image • Imagine a picture of a person holding a flute. What are they doing?

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick Motion and perceptual organization • Even “impoverished” motion data can evoke a strong percept

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick Example • Even “impoverished” motion data can evoke a strong percept Video from Davis & Bobick

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick Motion energy images • Spatial accumulation of motion. • Collapse over specific time window. • Motion measurement method not critical (e.g. motion differencing). Time

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick Motion history images • Motion history images are a different Moved function of temporal volume. t-15 • Pixel operator is replacement decay: if moving I τ (x,y,t) = τ otherwise I τ (x,y,t) = max( I τ (x,y,t-1)-1 ,0) • Trivial to construct I τ− k (x,y,t) from I τ (x,y,t) so can process multiple time Moved window lengths without more search. t-1 • MEI is thresholded MHI

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick Temporal-templates • MEI+ MHI = Temporal template motion energy motion history image image

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick Aerobics examples

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick Motion Energy Images Davis & Bobick 1999: The Representation and Recognition of Action Using Temporal Templates

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick How to recognize these images? • These are gray scale blob like images. • 100 years of computer vision for recognizing gray blobs (for small values of a hundred). • Old style computer vision: compute some summarization statistics of the pattern 1. construct generative model 2. recognize based upon those statistics. 3.

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick Image moments Moments summarize a shape given image I(x,y) = ∑∑ i j ( , ) M x y I x y ij x y Central moments are translation invariant: ∑∑ µ = − − p q ( ) ( ) ( , ) x x y y I x y pq x y M M = = 10 01 x y M M 00 00

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick Hu moments • Set of 7 moments • Apply to Motion History Image for global space-time “shape” descriptor • Translation and rotation and scale invariant [ , , , , , , ] h h h h h h h 1 2 3 4 5 6 7

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick Hu moments = h 1 = h 2 = h 3 = h 4 = h 5 = h 6

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick = h 7

Activity Recognition 1 CS 4495 Computer Vision – A. Bobick Build a classifier • Generative or Discriminative? • Generative – builds model of each class; compare all • Discriminative – builds model of the boundary between classes • How would you build decent generative models of each class of action? • Use a Gaussian in Hu-moment feature space • Compare likelihoods p(data | model of action i) • If have priors, use them by Bayes rule ∝ (model | data) p(data | model ) p(model ) p i i i • Otherwise just use likelihood. • Or use NN? (Problem Set!) • More on classification on Dec 3

CS 4495 Computer Vision Activity Recognition Aaron Bobick School - PowerPoint PPT Presentation

Activity Recognition 1 CS 4495 Computer Vision A. Bobick CS 4495 Computer Vision Activity Recognition Aaron Bobick School of Interactive Computing Activity Recognition 1 CS 4495 Computer Vision A. Bobick Administrivia PS6

CS 4495 Computer Vision 3D Perception Kelsey Hawkins Robotics 3D Perception CS 4495 Computer

CS 4495 Computer Vision Stereo: Disparity and Matching Aaron Bobick School of Interactive

CS 4495 Computer Vision Binary images and Morphology Aaron Bobick School of Interactive

CS 4495 Computer Vision Camera Model Aaron Bobick School of Interactive Computing Camera Model

CS 4495 Computer Vision Linear Filtering 2: Templates, Edges Aaron Bobick School of Interactive

CS 4495 Computer Vision Motion Models Aaron Bobick School of Interactive Computing Motion

CS 4495 Computer Vision Classification 2 Aaron Bobick School of Interactive Computing

CS 4495 Computer Vision Hidden Markov Models Aaron Bobick School of Interactive Computing

CS 4495 Computer Vision Frequency2: Sampling and Aliasing Aaron Bobick School of Interactive

CS 4495 Computer Vision Features 2 SIFT descriptor Aaron Bobick School of Interactive

CS 4495 Computer Vision Features 1 Harris and other corners Aaron Bobick School of

CS 4495 Computer Vision Frequency and Fourier Transforms Aaron Bobick School of Interactive

CS 4495 Computer Vision Tracking 1- Kalman,Gaussian Aaron Bobick School of Interactive

CS 4495 Computer Vision Camera Model Aaron Bobick School of Interactive Computing Camera Model

CS 4495 Computer Vision Stereo: Disparity and Matching Aaron Bobick School of Interactive

CS 4495 Computer Vision RAN dom SA mple C onsensus Aaron Bobick School of Interactive Computing

27 A Complete Plane Stress FEM Program IFEM Ch 27 Slide 1 Introduction to FEM The 3 Basic

Introduction A common problem within interactive compute environments is that there are several

Hands-on activities Activity 17 Creating a GPU queue for nodes with NVIDIA cards Requirements o

Modelling Timed Concurrent Systems Using Activity Diagram Patterns tienne Andr, Christine

Activity 1 Multiply the following matrices: 1 0 3 1 4 2 1 2 0

Lecture 07 (23-11-2015) Detailed Specification with SysML Christoph Lth Jan Peleska

Scalable Critical Path Analysis for Hybrid MPI-CUDA Applications The Fourth International

Simultaneous Learning and Reshaping of an Approximated Optimization Task Patrick MacAlpine , Elad