Lecture 1 -
- Fei-Fei Li & Andrej Karpathy
Lecture 1: Introduction
5-‑Jan-‑15 ¡ 1 ¡
Lecture 1: Introduction 1 5-Jan-15 Lecture 1 - Fei-Fei Li - - PowerPoint PPT Presentation
Lecture 1: Introduction 1 5-Jan-15 Lecture 1 - Fei-Fei Li & Andrej Karpathy Welcome to CS231n 2 5-Jan-15 Lecture 1 - Fei-Fei Li & Andrej Karpathy Biology Biology
Lecture 1 -
5-‑Jan-‑15 ¡ 1 ¡
Lecture 1 -
5-‑Jan-‑15 ¡ 2 ¡
Lecture 1 -
Computer Computer Vision ision
Neuroscience Machine learning Speech, NLP
Information retrieval
Mathematics Mathematics Computer Computer Science Science Biology Biology Engineering Engineering Physics Physics
Robotics Cognitive sciences
Psychology Psychology
graphics, algorithms, theory,…
Image processing 5-‑Jan-‑15 ¡ 3 ¡
systems, architecture, …
Lecture 1 -
Computer Computer Vision ision
Neuroscience Machine learning Speech, NLP
Information retrieval
Mathematics Mathematics Computer Computer Science Science Biology Biology Engineering Engineering Physics Physics
Robotics Cognitive sciences
Psychology Psychology
graphics, algorithms, theory,…
Image processing 5-‑Jan-‑15 ¡ 4 ¡
systems, architecture, …
Lecture 1 -
– Undergraduate introductory class
– Core computer vision class for seniors, masters, and PhDs – Topics include image processing, cameras, 3D reconstruction, segmentation, object recognition, scene understanding
CS231n (this term, Pr m, Prof. Fei-Fei Li & Andr
ej Karpathy Karpathy) )
– Neural network (aka “deep lear Neural network (aka “deep learning”) class on image ning”) class on image classification classification
Computer Vision
– Project-based advanced vision class to prepare students for CV research
– Computer vision and computational photography for mobile platform (e.g. Android)
in computer vision
5-‑Jan-‑15 ¡ 5 ¡
Lecture 1 -
5-‑Jan-‑15 ¡ 6 ¡
Lecture 1 -
5-‑Jan-‑15 ¡ 7 ¡
543million years, B.C.
Lecture 1 -
5-‑Jan-‑15 ¡ 8 ¡
Leonardo da Vinci 16th Century, A.D.
Lecture 1 -
5-‑Jan-‑15 ¡ 9 ¡
Hubel & Wiesel, 1959
Lecture 1 -
5-‑Jan-‑15 ¡ 10 ¡
Larry Roberts, 1963
Lecture 1 -
5-‑Jan-‑15 ¡ 11 ¡
Lecture 1 -
5-‑Jan-‑15 ¡ 12 ¡
David Marr, 1970s
Lecture 1 -
5-‑Jan-‑15 ¡ 13 ¡
Stages of Visual Representation, David Marr, 1970s
Lecture 1 -
5-‑Jan-‑15 ¡ 14 ¡
Brooks & Binford, 1979 Fischler and Elschlager, 1973
Lecture 1 -
5-‑Jan-‑15 ¡ 15 ¡
David Lowe, 1987
Lecture 1 -
5-‑Jan-‑15 ¡ 16 ¡
(Shi & Malik, 1997)
Lecture 1 -
5-‑Jan-‑15 ¡ 17 ¡
Face Detection, Viola & Jones, 2001
Lecture 1 -
5-‑Jan-‑15 ¡ 18 ¡
“SIFT” & Object Recognition, David Lowe, 1999
Lecture 1 -
5-‑Jan-‑15 ¡ 19 ¡
Spatial Pyramid Matching, Lazebnik, Schmid & Ponce, 2006
Lecture 1 -
5-‑Jan-‑15 ¡ 20 ¡
Histogram of Gradients (HoG) Dalal & Triggs, 2005 Deformable Part Model Felzenswalb, McAllester, Ramanan, 2009
Lecture 1 -
5-‑Jan-‑15 ¡ 21 ¡
PASCAL V ASCAL Visual Object Challenge isual Object Challenge (20 object categories) (20 object categories)
[Everingham et al. 2006-2012]
2009 2010 2011 2012 0.4 0.5 0.6 0.7 0.8 0.9 1
Average Precision Challenge Year
all aeroplane bicycle bird boat bottle bus car cat chair cow diningtable dog horse motorbike person pottedplant sheep sofa train tvmonitor
Lecture 1 -
5-‑Jan-‑15 ¡ 22 ¡
www.image-‑net.org ¡
Deng, Dong, Socher, Li, Li, & Fei-Fei, 2009
¡
¡
Lecture 1 -
5-‑Jan-‑15 ¡ 23 ¡
Output: Output: Scale T-shirt Steel drum Drumstick Mud turtle
Steel ¡drum ¡
Output: Output: Scale T-shirt Giant panda Drumstick Mud turtle
The Image Classification Challenge: 1,000 object classes 1,431,167 images
Russakovsky et al. arXiv, 2014
Lecture 1 -
5-‑Jan-‑15 ¡ 24 ¡
Steel ¡drum ¡
The Image Classification Challenge: 1,000 object classes 1,431,167 images
0.28 ¡ 0.26 ¡ 0.16 ¡ 0.12 ¡ 0.07 ¡
Russakovsky et al. arXiv, 2014
Lecture 1 -
5-‑Jan-‑15 ¡ 25 ¡
Lecture 1 -
CS231n focuses on one of the most important problems of visual recognition – image classification
5-‑Jan-‑15 ¡ 26 ¡
Lecture 1 -
5-‑Jan-‑15 ¡ 27 ¡
Lecture 1 -
There is a number of visual recognition problems that are related to image classification, such as
5-‑Jan-‑15 ¡ 28 ¡
Lecture 1 -
5-‑Jan-‑15 ¡ 29 ¡
Lecture 1 -
Convolutional Neural Network (CNN) has become an important tool for object recognition
5-‑Jan-‑15 ¡ 30 ¡
Lecture 1 -
ConvoluHon ¡ Pooling ¡ SoMmax ¡ Other ¡
GoogLeNet VGG MSRA SuperVision
[Krizhevsky NIPS 2012]
Year 2012 ear 2012 Year 2014 ear 2014 Year 2010 ear 2010
Dense ¡grid ¡descriptor: ¡ HOG, ¡LBP ¡ Coding: ¡local ¡coordinate, ¡ super-‑vector ¡ Pooling, ¡SPM ¡ Linear ¡SVM ¡
NEC-UIUC
[Lin CVPR 2011] [Szegedy arxiv 2014] [Simonyan arxiv 2014] [He arxiv 2014]
5-‑Jan-‑15 ¡ 31 ¡
Lecture 1 -
5-‑Jan-‑15 ¡ 32 ¡
Lecture 1 -
LeCun et al. Krizhevsky et al. # of transistors # of pixels used in training # of transistors # of pixels used in training
107 1014 106 109
GPUs 5-‑Jan-‑15 ¡ 33 ¡
Lecture 1 -
The quest for visual intelligence goes far beyond object recognition…
5-‑Jan-‑15 ¡ 34 ¡
Lecture 1 -
5-‑Jan-‑15 ¡ 35 ¡
Lecture 1 -
5-‑Jan-‑15 ¡ 36 ¡
Lecture 1 -
Some kind of game or fight. Two groups of two men? The foregound pair looked like one was getting a fist in the face. Outdoors seemed like because i have an impression of grass and maybe lines on the grass? That would be why I think perhaps a game, rough game though, more like rugby than football because they pairs weren't in pads and helmets, though I did get the impression of similar clothing. maybe some trees? in the background. (Subject: SM)
PT = 500ms PT = 500ms
Fei-Fei, Iyer, Koch, Perona, JoV, 2007
5-‑Jan-‑15 ¡ 37 ¡
Lecture 1 -
5-‑Jan-‑15 ¡ 38 ¡
Lecture 1 -
5-‑Jan-‑15 ¡ 39 ¡
Lecture 1 -
– Justin Johnson, Ph.D. candidate, CS – Yuke Zhu, master candidate, CS – TBA
– cs231n-winter1415-staf cs231n-winter1415-staff@lists.stanfor f@lists.stanford.edu d.edu – Piazza Piazza – Twitter witter
5-‑Jan-‑15 ¡ 40 ¡
Lecture 1 -
– Milestone: 5% – Final write-up: 35% – Bonus points for exceptional poster presentation
– 7 free late days – use them in your ways – Afterwards, 25% off per day late – Not accepted after 3 late days per PS – Does not apply to Final Course Project
– Read the student code book, understand what is ‘collaboration’ and what is ‘academic infraction’
5-‑Jan-‑15 ¡ 41 ¡
Lecture 1 -
– All class assignments will be in Python (and use numpy), but some of the deep learning libraries we may look at later in the class are written in C++. – A Python tutorial available on course website
Learning)
– We will be formulating cost functions, taking derivatives and performing optimization with gradient descent.
5-‑Jan-‑15 ¡ 42 ¡
Lecture 1 -
http://vision.stanford.edu/teaching/cs231n/index.html
5-‑Jan-‑15 ¡ 43 ¡
Lecture 1 -
5-‑Jan-‑15 ¡ 44 ¡
architecture in the cat's visual cortex." The Journal of physiology 160.1 (1962): 106. [PDF]
Institute of Technology, 1963. [PDF]
105-113. [PDF]
IEEE Transactions on Computers 22.1 (1973): 67-92. [PDF]
Intelligence, 31, 3 (1987), pp. 355-395. [PDF]
Machine Intelligence, IEEE Transactions on 22.8 (2000): 888-905. [PDF]
Computer Vision and Pattern Recognition, 2001. CVPR 2001. Proceedings of the 2001 IEEE Computer Society Conference on. Vol. 1. IEEE, 2001. [PDF]
Computer Vision 60.2 (2004): 91-110. [PDF]
matching for recognizing natural scene categories." Computer Vision and Pattern Recognition, 2006 IEEE Computer Society Conference on. Vol. 2. IEEE, 2006. [PDF]
Lecture 1 -
5-‑Jan-‑15 ¡ 45 ¡
Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on. Vol. 1. IEEE,
deformable part model." Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on. IEEE, 2008 [PDF]
Computer Vision 88.2 (2010): 303-338. [PDF]
Recognition, 2009. CVPR 2009. IEEE Conference on. IEEE, 2009. [PDF]
Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on. IEEE, 2011. [PDF]
convolutional neural networks." Advances in neural information processing systems. 2012. [PDF]
[PDF]
recognition." arXiv preprint arXiv:1409.1556 (2014). [PDF]
arXiv preprint arXiv:1406.4729 (2014). [PDF]
IEEE 86.11 (1998): 2278-2324. [PDF]