1
Visual Recognition Spring 2016
Introductions
- Instructor:
- Prof. Kristen Grauman
- TA:
Introductions Instructor : Prof. Kristen Grauman TA : Kai-Yang - - PDF document
Visual Recognition Spring 2016 Introductions Instructor : Prof. Kristen Grauman TA : Kai-Yang Chiang 1 Today Course overview Requirements, logistics What is computer vision? Done? 2 Computer Vision Automatic
Real-time stereo Structure from motion
NASA Mars Rover
Tracking
Demirdjian et al. Snavely et al. Wang et al.
sky water Ferris wheel amusement park Cedar Point 12 E tree tree tree carousel deck people waiting in line ride ride ride umbrellas pedestrians maxair bench tree Lake Erie people sitting on ride
Objects Activities Scenes Locations Text / writing Faces Gestures Motions Emotions…
The Wicked Twister
Ph.D. thesis, MIT Department of Electrical Engineering, 1963.
Personal photo albums Surveillance and security Movies, news, sports Medical and scientific images Slide credit; L. Lazebnik
Setting camera focus via face detection Camera waits for everyone to smile to take a photo [Canon]
http://www.darpa.mil/grandchallenge/galler y .asp
Kooaba, Bay & Quack et al. Y eh et al., MIT Belhumeur et al.
Snav ely et al. Simon & Seitz
Siv ic & Zisserman Lee & Grauman Wang et al.
Objects Actions Categories
Gammeter et al.
Human joystick, NewsBreaker Live Assistive technology systems Camera Mouse, Boston College Microsoft Kinect
slide credit: Fei-Fei, Fergus & Torralba
Video credit: Rob Fergus and Antonio Torralba
Video credit: Rob Fergus and Antonio Torralba
slide credit: Fei-Fei, Fergus & Torralba
COIL Roberts 1963
1996 1963 …
INRIA Pedestrians UIUC Cars MIT-CMU Faces
2000
1996 1963 …
Caltech-256 Caltech-101 MSRC 21 Objects
2000 2005
1996 1963 …
Faces in the Wild 80M Tiny Images Birds-200 PASCAL VOC ImageNet
2000 2005 2007 2008 2013
1996 1963 …
https://pdollar.wordpress.com/2015/01/21/image-captioning/
KITTI dataset – Andreas Geiger et al.
WhittleSearch – Adriana Kovashka et al.
Activities of Daily Living – Hamed Pirsiavash et al.
– Experiment with different types of (mini) training/testing data sets – Evaluate sensitivity to important parameter settings – Show (on a small scale) an example to analyze a strength/weakness of the approach
– Instance recognition – Category recognition – Mid-level representations – Object detection
– Great outdoors – Social signals – Noticing and remembering – Low-supervision learning – 3d scenes and objects – Recognition in action – Attributes and parts – Language and vision
– Instance recognition – Category recognition – Mid-level representations – Object detection
– Great outdoors – Social signals – Noticing and remembering – Low-supervision learning – 3d scenes and objects – Recognition in action – Attributes and parts – Language and vision