1
Honors Machine Vision
Jan 17, 2017
Kristen Grauman, University of Texas at Austin
Introductions
- Instructor:
- Prof. Kristen Grauman
- TA:
Introductions Instructor : Prof. Kristen Grauman TA : Dongguang - - PDF document
Honors Machine Vision Jan 17, 2017 Kristen Grauman, University of Texas at Austin Introductions Instructor : Prof. Kristen Grauman TA : Dongguang You 1 Today Course overview Requirements, logistics What is computer vision?
Kristen Grauman, University of Texas at Austin
Real-time stereo Structure from motion
NASA Mars Rover
Tracking
Demirdjian et al. Snavely et al. Wang et al.
sky water Ferris wheel amusement park Cedar Point 12 E tree tree tree carousel deck people waiting in line ride ride ride umbrellas pedestrians maxair bench tree Lake Erie people sitting on ride
Objects Activities Scenes Locations Text / writing Faces Gestures Motions Emotions…
The Wicked Twister
Ph.D. thesis, MIT Department of Electrical Engineering, 1963.
Personal photo albums Surveillance and security Movies, news, sports Medical and scientific images Slide credit; L. Lazebnik
Setting camera focus via face detection Camera waits for everyone to smile to take a photo [Canon]
kooaba Situated search Yeh et al., MIT MSR Lincoln Google Goggles
Human joystick, NewsBreaker Live Assistive technology systems Camera Mouse, Boston College Microsoft Kinect
Image guided surgery MIT AI Vision Group fMRI data Golland et al.
The Matrix What Dreams May Come
Mocap for Pirates of the Carribean, Industrial Light and Magic Source: S. Seitz
Navigation, driver safety Monitoring pool
(Poseidon)
Surveillance Pedestrian detection MERL, Viola et al.
slide credit: Fei-Fei, Fergus & Torralba
slide credit: Fei-Fei, Fergus & Torralba
COIL Roberts 1963
1996 1963 …
INRIA Pedestrians INRIA Pedestrians UIUC Cars UIUC Cars MIT-CMU Faces MIT-CMU Faces INRIA Pedestrians UIUC Cars MIT-CMU Faces
2000
1996 1963 …
Caltech-256 Caltech-256 Caltech-101 Caltech-101 MSRC 21 Objects MSRC 21 Objects Caltech-256 Caltech-101 MSRC 21 Objects
2000 2005
1996 1963 …
Faces in the Wild Faces in the Wild 80M Tiny Images 80M Tiny Images Birds-200 Birds-200 PASCAL VOC PASCAL VOC ImageNet ImageNet Faces in the Wild 80M Tiny Images Birds-200 PASCAL VOC PASCAL VOC PASCAL VOC ImageNet
2000 2005 2007 2008 2013
1996 1963 …
https://pdollar.wordpress.com/2015/01/21/image-captioning/
KITTI dataset – Andreas Geiger et al.
WhittleSearch – Adriana Kovashka et al.
Activities of Daily Living – Hamed Pirsiavash et al.
[fig from Shi et al]
Hartley and Zisserman Lowe
Fei-Fei Li
im[176][201] has value 164 im[194][203] has value 37 width 520 j=1 500 height i=1
R G B
Image from Fei-Fei Li