Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Aaron Bobick School of Interactive Computing
CS 4495 Computer Vision Activity Recognition Aaron Bobick School - - PowerPoint PPT Presentation
Activity Recognition 1 CS 4495 Computer Vision A. Bobick CS 4495 Computer Vision Activity Recognition Aaron Bobick School of Interactive Computing Activity Recognition 1 CS 4495 Computer Vision A. Bobick Administrivia PS6
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Aaron Bobick School of Interactive Computing
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
forward implementation of Motion History Images
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
(x, y) and time (t)
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
single ray in space
t
255
time
Alyosha Efros, CMU
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
activity of the objects…
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Slide credit: Birgi Tamersoy
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
pedestrians),
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Slide credit: Birgi Tamersoy
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Slide credit: Birgi Tamersoy
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Slide credit: Birgi Tamersoy
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Slide credit: Birgi Tamersoy
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Slide credit: Birgi Tamersoy
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Alyosha Efros, CMU
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Alyosha Efros, CMU
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Advantages:
they change over time. Disadvantages:
and frame rate
requirements.
Slide credit: Birgi Tamersoy
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Idea: model each background pixel with a mixture of Gaussians; update its parameters over time.
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Adapted from Venu Govindaraju and A.Bobick
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
http://users.isr.ist.utl.pt/~etienne/mypubs/Auvinetal06PETS.pdf
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
action descriptions (or learn)
resolution
activity by comparing to some structural representation of the activity
What are they doing?
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
percept
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
percept
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
percept Video from Davis & Bobick
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
differencing).
Time
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
function of temporal volume.
if moving Iτ (x,y,t) = τ
Iτ(x,y,t) = max(Iτ(x,y,t-1)-1 ,0)
Iτ(x,y,t) so can process multiple time window lengths without more search.
Moved t-1 Moved t-15
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
motion history image motion energy image
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Davis & Bobick 1999: The Representation and Recognition of Action Using Temporal Templates
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
(for small values of a hundred).
1.
compute some summarization statistics of the pattern
2.
construct generative model
3.
recognize based upon those statistics.
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Moments summarize a shape given image I(x,y) Central moments are translation invariant:
i j ij x y
p q pq x y
10 01 00 00
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
“shape” descriptor
7 6 5 4 3 2 1
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
1
2
3
4
5
6
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
7
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
(model | data) p(data | model ) p(model )
i i i
p ∝
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Treat both as grayscale images.
each movement.
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
‘98)
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
children’s playspace.
“action” recognition.
the machine knows the context.
London, 2001
Presence, August 1999.
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
“plus” (but in paper).
something, but only speak it when sure.
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
static camera’s video
techniques allow effective gesture or action recognition
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
What is the goal of a representation of activity/behaviors?
grounded
learnable from specific data
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Data-driven Knowledge Statistical Structural
Movement Activity
MHI’s PHMM’s SCFG’s P-Net’s Action BN’s PNF Event N-grams Suffix Trees
Temporal and relational complexity
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Data-driven Knowledge Statistical Structural
Movement Activity
MHI’s PHMM’s SCFG’s P-Net’s Action BN’s PNF Event N-grams Suffix Trees
Temporal and relational complexity
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
and triggering probabilities
model.
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
"Higher-level" Activities: Known structure, uncertain elements
sequences of primitive elements.
rules; often ad hoc approaches taken.
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
(individual elements might be HMMs)
uncertain primitives.
perceptually relevant uncertainty.
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
production rules. Traditional parsing yields most likely parse given a known set of input symbols.
priori work on parsing SCFGs using efficient Earley parser.
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
have some level primitives or even HMMs.
examples.) Make sure not too sensitive to them.
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
probabilistically.
events: (CAR-ENTER, CAR-STOP, PERSON-FOUND, CAR-EXIT, PERSON-EXIT)
PERSON-FOUND event must be close to CAR-STOP)
ambiguity.
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
CAR_PASS -> CAR_ENTER CAR_EXIT | CAR_ENTER CAR_HIDDEN CAR_EXIT CAR_HIDDEN -> CAR_LOST CAR_FOUND | CAR_LOST CAR_FOUND CAR_HIDDEN
PERSON_LOST -> person_lost | SKIP person_lost
Events: ce pe cl cf cs px pl cx PICKUP -> ce pe cl cf cs px pl cx P_PASS -> ce pe cl cf cs px pl cx
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
and are the right levels of annotation to recover (compare to HMMs).
representation of structural elements and uncertainties.
support training, but not of higher level activity.
detectors; only assumes likelihood generation.
errors through probability.
stream) sequencing.
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
priori and are the right levels of annotation to recover (compare to HMMs).
representation of structural elements and uncertainties.
support training, but not of higher level activity.
detectors; only assumes likelihood generation.
handles errors through probability.
stream) sequencing.
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
intervals
propagation
active at a time!
well logical constraint
and node:
activation
between successive intervals
node
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
points of any intervals
Examples of logic constraint
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
A DBN style rollout to compute corresponding conceptual schema
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
point out operating error as feedback.
missing_1_step sequences and 10 missing_6_steps sequences
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Initiate 1 particle at dummy starting node Repeat
For each particle generate all possible consequent states calculate the probability for each states End Select n particles to survive
Until the final time steps is reached Output the path represented by the particle with highest probability
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick
Labeling individual nodes Labels on Node J: Insert
Activity Recognition 1 CS 4495 Computer Vision – A. Bobick