1
EE 6882 Statistical Methods for Video Indexing and Analysis
Fall 2003
- Prof. Shih-Fu Chang
EE 6882 Statistical Methods for Video Indexing and Analysis Fall - - PowerPoint PPT Presentation
EE 6882 Statistical Methods for Video Indexing and Analysis Fall 2003 Prof. Shih-Fu Chang http://www.ee.columbia.edu/~sfchang Lecture 1 (9/3/03) 1 Research Problems in Video Indexing and Analysis Object detection and recognition (e.g.,
1
2 EE6882-Chang
Object detection and recognition
Structure parsing
Event detection
Search and retrieval
Synthesis
3 EE6882-Chang
shot story anchor shot
4 EE6882-Chang
Emerging mature tools and promising
Increasing computing resources More challenging, interesting problems Increasing benchmark data
5 EE6882-Chang
Learn insights of different tools and
Understand match between tools and
Get some experience on tools publicly
Related hard-core courses, see web site
6 EE6882-Chang
Problems
Image/video classification Interactive image retrieval Video structure parsing Multimedia data mining
Techniques
Bayesian, factor graph, graphical model HMM and variations SVM Hierarchical Mixture
7 EE6882-Chang
(From Jain, Duin, and Mao, SPR Review, ’99)
8 EE6882-Chang
PCA Fischer Analysis MDS Kernel PCA (Jain et al 99)
9 EE6882-Chang
There are no universally optimal classifiers! Statistical structures of problems and models
Generation vs. discrimination Feature representation and selection Amount of training/test data Performance estimation and comparison Online vs. offline User supervision/feedback
10 EE6882-Chang
Rule of thumb -- # of training patterns per class / # of features > 10
11 EE6882-Chang
A few examples from paper list
12 EE6882-Chang
(Valaiya et al)
13 EE6882-Chang
Feature independence MAP Classification VQ as distribution estimator
14 EE6882-Chang
(Naphade et al)
15 EE6882-Chang
Extract > 45K selective efficient features by multi-scale filtering
Classifier combination and sample re-weighting (Tieu and Viola)
16 EE6882-Chang
User selected examples 20 retrieval results Negative images in the training set close to decision boundary Images in the testing set close to the decision boundary
Real-time evaluation of 20 features over millions of images
17 EE6882-Chang
?
time {video, audio}
a static face? motion energy changes? change from music to speech? speech segment? {cue words}j appear {cue words}i appear
k
k
1 k
1 k
(Hsu and Chang)
k
18 EE6882-Chang
(Duygulu et al)
19 EE6882-Chang
time
top-level states running pitching break bottom-level states
bench close up batter audience field bird view pitcher 1st base
Learning Multi-Level Markovian Temporal Dependence
Baseball Example
(Xie et al)
20 EE6882-Chang
Reading seminar 2 papers reviewed and demonstrated each week
Each student assigned one paper
Everyone writes comments before and after class
Term project at the end of course (12/10/03)
21 EE6882-Chang
Each paper allocated 60 mins total Discuss paper and plan demos with me
Prepare copies of slide handouts before
Computer demo of the reviewed
22 EE6882-Chang
Review
Background review and examples Problem addressed and main ideas Insights about why it works Limitation, generality, and repeatability Alternatives and comparisons
Demo
Software and data available and repeatable? Reconstruct the method and try on toy data set?
(from some publicly available generic toolkit)
Analysis of results (not just accuracy numbers, offer
explanations and verifiable theories about observations)
Demo code archived on class site and shared with others
23 EE6882-Chang
Familiarity with
Image processing or computer vision Statistical pattern recognition or machine learning Computer programming (e.g., Matlab)
Background assessment given in the first
video representation, features, and statistical
24 EE6882-Chang
25% paper review,
Auditing permitted only
for non-students with active, continuous class participation
25 EE6882-Chang
How to read/present/write a research
Software links on web site to
Image/video data and features from
26 EE6882-Chang
Available on the web site Next 2 lectures (need volunteers)
Image classification (9/10, work with me
Bayesian Methods (Vailaya, Jain, and Zhang) Factor Graph (Naphade and Huang)
Boosting (9/24)
Freund & Schapire, Tieu and Viola
27 EE6882-Chang
Everyone learns insights and experience
Accumulate tools and reports