CS 4803 / 7643: Deep Learning Website: - - PowerPoint PPT Presentation

cs 4803 7643 deep learning
SMART_READER_LITE
LIVE PREVIEW

CS 4803 / 7643: Deep Learning Website: - - PowerPoint PPT Presentation

CS 4803 / 7643: Deep Learning Website: www.cc.gatech.edu/classes/AY2019/cs7643_fall/ Piazza: piazza.com/gatech/fall2018/cs48037643 Canvas: gatech.instructure.com/courses/28059 Gradescope: gradescope.com/courses/22096 Dhruv Batra School of


slide-1
SLIDE 1

CS 4803 / 7643: Deep Learning

Dhruv Batra School of Interactive Computing Georgia Tech

Website: www.cc.gatech.edu/classes/AY2019/cs7643_fall/ Piazza: piazza.com/gatech/fall2018/cs48037643 Canvas: gatech.instructure.com/courses/28059 Gradescope: gradescope.com/courses/22096

slide-2
SLIDE 2

Outline

  • What is Deep Learning, the field, about?

– Highlight of some recent projects from my lab

  • What is this class about?
  • What to expect?

– Logistics

  • FAQ

(C) Dhruv Batra 2

slide-3
SLIDE 3

Outline

  • What is Deep Learning, the field, about?

– Highlight of some recent projects from my lab

  • What is this class about?
  • What to expect?

– Logistics

  • FAQ

(C) Dhruv Batra 3

slide-4
SLIDE 4

What is Deep Learning? Some of the most exciting developments in Machine Learning, Vision, NLP, Speech, Robotics & AI in general in the last 5 years!

(C) Dhruv Batra 4

slide-5
SLIDE 5

Proxy for public interest

(C) Dhruv Batra 5

slide-6
SLIDE 6

1000 object classes 1.4M/50k/100k images

Person Dalmatian

http://image-net.org/challenges/LSVRC/{2010,…,2015}

(C) Dhruv Batra 6

ImageNet Large Scale Visual Recognition Challenge (ILSVRC)

Image Classification

slide-7
SLIDE 7

Image Classification

(C) Dhruv Batra 7

slide-8
SLIDE 8

(C) Dhruv Batra 8

https://qz.com/1034972/the-data-that-changed-the-direction-of-ai-research-and-possibly-the-world/

slide-9
SLIDE 9

AlphaGo vs Lee Sedol

(C) Dhruv Batra 9

slide-10
SLIDE 10

Tasks are getting bolder

(C) Dhruv Batra 10 A group of young people playing a game of Frisbee

Antol et al., 2015 Vinyals et al., 2015 Das et al., 2017

slide-11
SLIDE 11

Visual Question Answering (VQA)

(C) Dhruv Batra 12

slide-12
SLIDE 12

Visual Dialog

[CVPR ‘17]

Abhishek Das (Georgia Tech) Satwik Kottur (CMU) Avi Singh (UC Berkeley) Khushi Gupta (CMU) Deshraj Yadav (Virginia Tech) José Moura (CMU) Devi Parikh (Georgia Tech / FAIR) Dhruv Batra (Georgia Tech / FAIR)

slide-13
SLIDE 13

(C) Dhruv Batra 16

slide-14
SLIDE 14

(C) Dhruv Batra 17

slide-15
SLIDE 15

(C) Dhruv Batra 18

A man and a woman are holding umbrellas

slide-16
SLIDE 16

(C) Dhruv Batra 19

A man and a woman are holding umbrellas What color is his umbrella?

slide-17
SLIDE 17

(C) Dhruv Batra 20

man his

slide-18
SLIDE 18

(C) Dhruv Batra 21

umbrella

slide-19
SLIDE 19

(C) Dhruv Batra 22

A man and a woman are holding umbrellas What color is his umbrella? His umbrella is black

slide-20
SLIDE 20

(C) Dhruv Batra 23

A man and a woman are holding umbrellas What color is his umbrella? His umbrella is black What about hers?

slide-21
SLIDE 21

(C) Dhruv Batra 24

woman her

slide-22
SLIDE 22

(C) Dhruv Batra 25

umbrella umbrella hers

slide-23
SLIDE 23

(C) Dhruv Batra 26

A man and a woman are holding umbrellas What color is his umbrella? His umbrella is black What about hers? Hers is multi-colored

slide-24
SLIDE 24

(C) Dhruv Batra 27

A man and a woman are holding umbrellas What color is his umbrella? His umbrella is black What about hers? Hers is multi-colored How many other people are in the image?

slide-25
SLIDE 25

(C) Dhruv Batra 28

man and a woman

  • ther people
slide-26
SLIDE 26

(C) Dhruv Batra 29

A man and a woman are holding umbrellas What color is his umbrella? His umbrella is black What about hers? Hers is multi-colored How many other people are in the image? I think 3. They are occluded

slide-27
SLIDE 27

(C) Dhruv Batra 30

A man and a woman are holding umbrellas What color is his umbrella? His umbrella is black What about hers? Hers is multi-colored How many other people are in the image? I think 3. They are occluded How many are men?

slide-28
SLIDE 28

(C) Dhruv Batra 31

man and a woman

  • ther people

3 How many are men?

slide-29
SLIDE 29

Live demo at vqa.cloudcv.org. demo.visualdialog.org

slide-30
SLIDE 30

(C) Dhruv Batra 35

slide-31
SLIDE 31

(C) Dhruv Batra 36

slide-32
SLIDE 32

Embodied Question Answering

[CVPR ’18 Oral]

Abhishek Das (Georgia Tech) Samyak Datta (Georgia Tech) Devi Parikh (Georgia Tech/ FAIR) Dhruv Batra (Georgia Tech/ FAIR) Stefan Lee (Georgia Tech) Georgia Gkioxari (FAIR)

slide-33
SLIDE 33

(C) Dhruv Batra 38

slide-34
SLIDE 34

What is to the left of the shower? Cabinet

slide-35
SLIDE 35

What color is the car? – AI Challenges

  • Language Understanding

– What is the question asking?

  • Vision

– What does a ‘car’ look like?

  • Active Perception

– Agent must navigate by perception

  • Common sense

– Where are ‘cars’ generally located in the house?

  • Credit Assignment

– (forward, forward, turn-right, forward, . . . , turn-left, ‘red’)

(C) Dhruv Batra 40

slide-36
SLIDE 36

(C) Dhruv Batra 41

slide-37
SLIDE 37

So what is Deep (Machine) Learning?

  • Representation Learning
  • Neural Networks
  • Deep Unsupervised/Reinforcement/Structured/

<insert-qualifier-here> Learning

  • Simply: Deep Learning

(C) Dhruv Batra 43

slide-38
SLIDE 38

So what is Deep (Machine) Learning?

  • A few different ideas:
  • (Hierarchical) Compositionality

– Cascade of non-linear transformations – Multiple layers of representations

  • End-to-End Learning

– Learning (goal-driven) representations – Learning to feature extraction

  • Distributed Representations

– No single neuron “encodes” everything – Groups of neurons work together

(C) Dhruv Batra 44

slide-39
SLIDE 39

45

\ˈd ē p\

fixed learned

your favorite classifier hand-crafted features SIFT/HOG

“car” “+”

This burrito place is yummy and fun!

VISION SPEECH NLP

Traditional Machine Learning

fixed learned

your favorite classifier hand-crafted features MFCC

fixed learned

your favorite classifier hand-craCed features Bag-of-words Slide Credit: Marc'Aurelio Ranzato, Yann LeCun

slide-40
SLIDE 40

47

VISION SPEECH NLP pixels edge texton motif part

  • bject

sample spectral band formant motif phone word character NP/VP/.. clause sentence story word

Hierarchical Compositionality

Slide Credit: Marc'Aurelio Ranzato, Yann LeCun

slide-41
SLIDE 41

(C) Dhruv Batra 48

Building A Complicated Function

Given a library of simple functions Compose into a complicate function

Slide Credit: Marc'Aurelio Ranzato, Yann LeCun

slide-42
SLIDE 42

(C) Dhruv Batra 49

Building A Complicated Function

Given a library of simple functions Compose into a complicate function

Idea 1: Linear Combinations

  • Boosting
  • Kernels

f(x) = X

i

αigi(x)

Slide Credit: Marc'Aurelio Ranzato, Yann LeCun

slide-43
SLIDE 43

(C) Dhruv Batra 50

Building A Complicated Function

Given a library of simple functions Compose into a complicate function

Idea 2: Compositions

  • Deep Learning
  • Grammar models
  • Scattering transforms…

f(x) = g1(g2(. . . (gn(x) . . .))

Slide Credit: Marc'Aurelio Ranzato, Yann LeCun

slide-44
SLIDE 44

(C) Dhruv Batra 51

Building A Complicated Function

Given a library of simple functions Compose into a complicate function

Idea 2: Compositions

  • Deep Learning
  • Grammar models
  • Scattering transforms…

f(x) = log(cos(exp(sin3(x))))

Slide Credit: Marc'Aurelio Ranzato, Yann LeCun

slide-45
SLIDE 45

Deep Learning = Hierarchical Compositionality

“car”

Slide Credit: Marc'Aurelio Ranzato, Yann LeCun

slide-46
SLIDE 46

Trainable Classifier Low-Level Feature Mid-Level Feature High-Level Feature

Feature visualization of convolutional net trained on ImageNet from [Zeiler & Fergus 2013]

“car”

Deep Learning = Hierarchical Compositionality

Slide Credit: Marc'Aurelio Ranzato, Yann LeCun

slide-47
SLIDE 47

So what is Deep (Machine) Learning?

  • A few different ideas:
  • (Hierarchical) Compositionality

– Cascade of non-linear transformations – Multiple layers of representations

  • End-to-End Learning

– Learning (goal-driven) representations – Learning to feature extraction

  • Distributed Representations

– No single neuron “encodes” everything – Groups of neurons work together

(C) Dhruv Batra 55

slide-48
SLIDE 48

56

\ˈd ē p\

fixed learned

your favorite classifier hand-crafted features SIFT/HOG

“car” “+”

This burrito place is yummy and fun!

VISION SPEECH NLP

Traditional Machine Learning

fixed learned

your favorite classifier hand-crafted features MFCC

fixed learned

your favorite classifier hand-craCed features Bag-of-words Slide Credit: Marc'Aurelio Ranzato, Yann LeCun

slide-49
SLIDE 49

Feature Engineering

(C) Dhruv Batra 57

SIFT Spin Images HoG Textons and many many more….

slide-50
SLIDE 50

fixed unsupervised supervised

classifier Mixture of Gaussians MFCC

\ˈd ē p\

fixed unsupervised supervised

classifier K-Means/ pooling SIFT/HOG

“car”

fixed unsupervised supervised

classifier n-grams Parse Tree Syntactic

“+”

This burrito place is yummy and fun!

VISION SPEECH NLP

Traditional Machine Learning (more accurately)

“Learned”

Slide Credit: Marc'Aurelio Ranzato, Yann LeCun

(C) Dhruv Batra 59

slide-51
SLIDE 51

fixed unsupervised supervised

classifier Mixture of Gaussians MFCC

\ˈd ē p\

fixed unsupervised supervised

classifier K-Means/ pooling SIFT/HOG

“car”

fixed unsupervised supervised

classifier n-grams Parse Tree Syntactic

“+”

This burrito place is yummy and fun!

VISION SPEECH NLP

Deep Learning = End-to-End Learning

“Learned”

Slide Credit: Marc'Aurelio Ranzato, Yann LeCun

(C) Dhruv Batra 60

slide-52
SLIDE 52
  • “Shallow” models
  • Deep models

Trainable Feature- Transform / Classifier Trainable Feature- Transform / Classifier Trainable Feature- Transform / Classifier Learned Internal Representations

“Shallow” vs Deep Learning

“Simple” Trainable Classifier hand-crafted Feature Extractor fixed learned

Slide Credit: Marc'Aurelio Ranzato, Yann LeCun

slide-53
SLIDE 53

So what is Deep (Machine) Learning?

  • A few different ideas:
  • (Hierarchical) Compositionality

– Cascade of non-linear transformations – Multiple layers of representations

  • End-to-End Learning

– Learning (goal-driven) representations – Learning to feature extraction

  • Distributed Representations

– No single neuron “encodes” everything – Groups of neurons work together

(C) Dhruv Batra 63

slide-54
SLIDE 54

Distributed Representations Toy Example

  • Local vs Distributed

(C) Dhruv Batra 64

Slide Credit: Moontae Lee

slide-55
SLIDE 55

Distributed Representations Toy Example

  • Can we interpret each dimension?

(C) Dhruv Batra 65

Slide Credit: Moontae Lee

slide-56
SLIDE 56

Power of distributed representations!

(C) Dhruv Batra 66

Local Distributed

Slide Credit: Moontae Lee

slide-57
SLIDE 57

Power of distributed representations!

  • United States:Dollar :: Mexico:?

(C) Dhruv Batra 67

Slide Credit: Moontae Lee

slide-58
SLIDE 58

ThisPlusThat.me

(C) Dhruv Batra 68

Image Credit: http://insightdatascience.com/blog/thisplusthat_a_search_engine_that_lets_you_add_words_as_vectors.html

slide-59
SLIDE 59

So what is Deep (Machine) Learning?

  • A few different ideas:
  • (Hierarchical) Compositionality

– Cascade of non-linear transformations – Multiple layers of representations

  • End-to-End Learning

– Learning (goal-driven) representations – Learning to feature extraction

  • Distributed Representations

– No single neuron “encodes” everything – Groups of neurons work together

(C) Dhruv Batra 69

slide-60
SLIDE 60

Benefits of Deep/Representation Learning

  • (Usually) Better Performance

– “Because gradient descent is better than you” Yann LeCun

  • New domains without “experts”

– RGBD – Multi-spectral data – Gene-expression data – Unclear how to hand-engineer

(C) Dhruv Batra 70

slide-61
SLIDE 61

“Expert” intuitions can be misleading

  • “Every time I fire a linguist, the performance of our

speech recognition system goes up”

– Fred Jelinik, IBM ’98

(C) Dhruv Batra 71

slide-62
SLIDE 62

Benefits of Deep/Representation Learning

  • Modularity!
  • Plug and play architectures!

(C) Dhruv Batra 72

slide-63
SLIDE 63

Any DAG of differentialble modules is allowed!

Differentiable Computation Graph

Slide Credit: Marc'Aurelio Ranzato

(C) Dhruv Batra 73

slide-64
SLIDE 64

(C) Dhruv Batra 74

slide-65
SLIDE 65

Logistic Regression as a Cascade

(C) Dhruv Batra 75

Given a library of simple functions Compose into a complicate function

− log ✓ 1 1 + e−w|x ◆

Slide Credit: Marc'Aurelio Ranzato, Yann LeCun

slide-66
SLIDE 66

Logistic Regression as a Cascade

(C) Dhruv Batra 76

Given a library of simple functions Compose into a complicate function

− log ✓ 1 1 + e−w|x ◆ w

|x

Slide Credit: Marc'Aurelio Ranzato, Yann LeCun

slide-67
SLIDE 67

Key Computation: Forward-Prop

(C) Dhruv Batra 77

Slide Credit: Marc'Aurelio Ranzato, Yann LeCun

slide-68
SLIDE 68

Key Computation: Back-Prop

(C) Dhruv Batra 78

Slide Credit: Marc'Aurelio Ranzato, Yann LeCun

slide-69
SLIDE 69

Any DAG of differentialble modules is allowed!

Differentiable Computation Graph

Slide Credit: Marc'Aurelio Ranzato

(C) Dhruv Batra 79

slide-70
SLIDE 70

Visual Dialog Model #1

Late Fusion Encoder

Slide Credit: Abhishek Das

slide-71
SLIDE 71

Visual Dialog Model #1

Late Fusion Encoder

Slide Credit: Abhishek Das

slide-72
SLIDE 72

Visual Dialog Model #1

Late Fusion Encoder

Slide Credit: Abhishek Das

slide-73
SLIDE 73

Visual Dialog Model #1

Late Fusion Encoder

Slide Credit: Abhishek Das

slide-74
SLIDE 74

Visual Dialog Model #1

Late Fusion Encoder

Slide Credit: Abhishek Das

slide-75
SLIDE 75

Visual Dialog Model #1

Late Fusion Encoder

Slide Credit: Abhishek Das

slide-76
SLIDE 76

Visual Dialog Model #1

Late Fusion Encoder

Slide Credit: Abhishek Das

slide-77
SLIDE 77

Visual Dialog Model #1

Late Fusion Encoder

Slide Credit: Abhishek Das

slide-78
SLIDE 78

Problems with Deep Learning

  • Problem#1: Non-Convex! Non-Convex! Non-Convex!

– Depth>=3: most losses non-convex in parameters – Theoretically, all bets are off – Leads to stochasticity

  • different initializations à different local minima
  • Standard response #1

– “Yes, but all interesting learning problems are non-convex” – For example, human learning

  • Order matters à wave hands à non-convexity
  • Standard response #2

– “Yes, but it often works!”

(C) Dhruv Batra 88

slide-79
SLIDE 79

Problems with Deep Learning

  • Problem#2: Lack of interpretability

– Hard to track down what’s failing – Pipeline systems have “oracle” performances at each step – In end-to-end systems, it’s hard to know why things are not working

(C) Dhruv Batra 89

slide-80
SLIDE 80

Problems with Deep Learning

  • Problem#2: Lack of interpretability

(C) Dhruv Batra 90 End-to-End Pipeline [Fang et al. CVPR15] [Vinyals et al. CVPR15]

slide-81
SLIDE 81

Problems with Deep Learning

  • Problem#2: Lack of interpretability

– Hard to track down what’s failing – Pipeline systems have “oracle” performances at each step – In end-to-end systems, it’s hard to know why things are not working

  • Standard response #1

– Tricks of the trade: visualize features, add losses at different layers, pre-train to avoid degenerate initializations… – “We’re working on it”

  • Standard response #2

– “Yes, but it often works!”

(C) Dhruv Batra 91

slide-82
SLIDE 82

Problems with Deep Learning

  • Problem#3: Lack of easy reproducibility

– Direct consequence of stochasticity & non-convexity

  • Standard response #1

– It’s getting much better – Standard toolkits/libraries/frameworks now available – Caffe, Theano, (Py)Torch

  • Standard response #2

– “Yes, but it often works!”

(C) Dhruv Batra 92

slide-83
SLIDE 83

Yes it works, but how?

(C) Dhruv Batra 93

slide-84
SLIDE 84

Outline

  • What is Deep Learning, the field, about?

– Highlight of some recent projects from my lab

  • What is this class about?
  • What to expect?

– Logistics

  • FAQ

(C) Dhruv Batra 94

slide-85
SLIDE 85

Outline

  • What is Deep Learning, the field, about?

– Highlight of some recent projects from my lab

  • What is this class about?
  • What to expect?

– Logistics

  • FAQ

(C) Dhruv Batra 95

slide-86
SLIDE 86

What is this class about?

(C) Dhruv Batra 96

slide-87
SLIDE 87

What was F17 DL class about?

  • Firehose of arxiv

(C) Dhruv Batra 97

slide-88
SLIDE 88

Arxiv Fire Hose

(C) Dhruv Batra 98

PhD Student Deep Learning papers

slide-89
SLIDE 89

What was F17 DL class about?

  • Goal:

– After taking this class, you should be able to pick up the latest Arxiv paper, easily understand it, & implement it.

  • Target Audience:

– Junior/Senior PhD students who want to conduct research and publish in Deep Learning. (think ICLR/CVPR papers as outcomes)

(C) Dhruv Batra 99

slide-90
SLIDE 90

What is the F18 DL class about?

  • Introduction to Deep Learning
  • Goal:

– After finishing this class, you should be ready to get started

  • n your first DL research project.
  • CNNs
  • RNNs
  • Deep Reinforcement Learning
  • Generative Models (VAEs, GANs)
  • Target Audience:

– Senior undergrads, MS-ML, and new PhD students

(C) Dhruv Batra 100

slide-91
SLIDE 91

What this class is NOT

  • NOT the target audience:

– Advanced grad-students already working in ML/DL areas – People looking to understand latest and greatest cutting- edge research (e.g. GANs, AlphaGo, etc) – Undergraduate/Masters students looking to graduate with a DL class on their resume.

  • NOT the goal:

– Teaching a toolkit. “Intro to TensorFlow/PyTorch” – Intro to Machine Learning

(C) Dhruv Batra 101

slide-92
SLIDE 92

Caveat

  • This is an ADVANCED Machine Learning class

– This should NOT be your first introduction to ML – You will need a formal class; not just self-reading/coursera – If you took CS 7641/ISYE 6740/CSE 6740 @GT, you’re in the right place – If you took an equivalent class elsewhere, see list of topics taught in CS 7641 to be sure.

(C) Dhruv Batra 102

slide-93
SLIDE 93

Prerequisites

  • Intro Machine Learning

– Classifiers, regressors, loss functions, MLE, MAP

  • Linear Algebra

– Matrix multiplication, eigenvalues, positive semi-definiteness…

  • Calculus

– Multi-variate gradients, hessians, jacobians… (C) Dhruv Batra 103

slide-94
SLIDE 94

Prerequisites

  • Intro Machine Learning

– Classifiers, regressors, loss functions, MLE, MAP

  • Linear Algebra

– Matrix multiplication, eigenvalues, positive semi-definiteness…

  • Calculus

– Multi-variate gradients, hessians, jacobians… (C) Dhruv Batra 104

slide-95
SLIDE 95

Prerequisites

  • Intro Machine Learning

– Classifiers, regressors, loss functions, MLE, MAP

  • Linear Algebra

– Matrix multiplication, eigenvalues, positive semi-definiteness…

  • Calculus

– Multi-variate gradients, hessians, jacobians…

  • Programming!

– Homeworks will require Python, C++! – Libraries/Frameworks: PyTorch – HW0 (pure python), HW1 (python + PyTorch), HW2+3 (PyTorch) – Your language of choice for project

(C) Dhruv Batra 105

slide-96
SLIDE 96

Course Information

  • Instructor: Dhruv Batra

– dbatra@gatech – Location: 219 CCB

(C) Dhruv Batra 107

slide-97
SLIDE 97

Machine Learning & Perception Group

(C) Dhruv Batra

Dhruv Batra Assistant Professor

Stefan Lee Research Scientist

slide-98
SLIDE 98

TAs

(C) Dhruv Batra 109 Michael Cogswell 3rd year CS PhD student http://mcogswell.io/ Erik Wijmans 2nd year CS PhD student http://wijmans.xyz/ Nirbhay Modhe 2nd year CS PhD student https://nirbhayjm.gith ub.io/ Harsh Agrawal 1st year CS PhD student https://dexter1691.gi thub.io/

slide-99
SLIDE 99

TA: Michael Cogswell

  • PhD student working with Dhruv
  • Research work/interest:

– Deep Learning – applications to Computer Vision and AI

  • I also Fence (mainly foil)

(C) Dhruv Batra 110

slide-100
SLIDE 100

PhD student in CS Research Interests Scene Understanding Embodied Agents 3D Computer Vision

TA: Erik Wijmans

slide-101
SLIDE 101

2nd Year PhD Student Research Interests:

  • Visual Dialog
  • Bayesian Machine Learning
  • Generative Modeling

TA: Nirbhay Modhe

slide-102
SLIDE 102

TA: Harsh Agrawal

  • 1st year CS PhD student
  • Previously at Snapchat Research
  • Research at the intersection of

vision and language

113

Sorting jumbled story elements into coherent story

slide-103
SLIDE 103

Organization & Deliverables

  • 4 homeworks (80%)

– Mix of theory and implementation – First one goes out next week

  • Start early, Start early, Start early, Start early, Start early, Start early,

Start early, Start early, Start early, Start early

  • Final project (20%)

– Projects done in groups of 3-4

  • (Bonus) Class Participation (5%)

– Contribute to class discussions on Piazza – Ask questions, answer questions

(C) Dhruv Batra 114

slide-104
SLIDE 104

Late Days

  • “Free” Late Days

– 7 late days for the semester

  • Use for HWs
  • Cannot use for project related deadlines

– After free late days are used up:

  • 25% penalty for each late day

(C) Dhruv Batra 115

slide-105
SLIDE 105

HW0

  • Out today; due Sept 5 (09/05)

– Available on class webpage + Canvas

  • Grading

– <=80% means that you might not be prepared for the class

  • Topics

– PS: probability, calculus, convexity, proving things – HW: Implement training of a soft-max classifier via SGD

(C) Dhruv Batra 116

slide-106
SLIDE 106

Project

  • Goal

– Chance to try Deep Learning – Encouraged to apply to your research (computer vision, NLP, robotics,…) – Must be done this semester. – Can combine with other classes

  • get permission from both instructors; delineate different parts

– Extra credit for shooting for a publication

  • Main categories

– Application/Survey

  • Compare a bunch of existing algorithms on a new application domain of

your interest

– Formulation/Development

  • Formulate a new model or algorithm for a new or old problem

– Theory

  • Theoretically analyze an existing algorithm

(C) Dhruv Batra 117

slide-107
SLIDE 107

Computing

  • Major bottleneck

– GPUs

  • Options

– Your own / group / advisor’s resources – Google Cloud Credits

  • $50 credits to every registered student courtesy Google

– Minsky cluster in IC

(C) Dhruv Batra 118

slide-108
SLIDE 108

4803 vs 7643

  • Level differentiation
  • HWs

– Extra credit questions for 4803 students, necessary for 7643

  • Project

– Higher expectations from 7643

(C) Dhruv Batra 119

slide-109
SLIDE 109

Outline

  • What is Deep Learning, the field, about?

– Highlight of some recent projects from my lab

  • What is this class about?
  • What to expect?

– Logistics

  • FAQ

(C) Dhruv Batra 120

slide-110
SLIDE 110

Waitlist / Audit / Sit in

  • Waitlist

– Class is full. Size will not increase further. – Do HW0. Come to first few classes. – Hope people drop.

  • Audit or Pass/Fail

– We will give preference to people taking class for credit.

  • Sitting in

– Talk to instructor.

(C) Dhruv Batra 121

slide-111
SLIDE 111

Re-grading Policy

  • Homework assignments

– Within 1 week of receiving grades: see the TAs

  • This is an advanced grad class.

– The goal is understanding the material and making progress towards our research.

(C) Dhruv Batra 122

slide-112
SLIDE 112

Collaboration Policy

  • Collaboration

– Only on HWs and project (not allowed in HW0). – You may discuss the questions – Each student writes their own answers – Write on your homework anyone with whom you collaborate – Each student must write their own code for the programming part

  • Zero tolerance on plagiarism

– Neither ethical nor in your best interest – Always credit your sources – Don’t cheat. We will find out.

(C) Dhruv Batra 123

slide-113
SLIDE 113

Communication Channels

  • Primary means of communication -- Piazza

– No direct emails to Instructor unless private information – Instructor/TAs can provide answers to everyone on forum – Class participation credit for answering questions! – No posting answers. We will monitor.

  • Staff Mailing List

– cs4803-7643-f18-staff@googlegroups.com

  • Links:

– Website: www.cc.gatech.edu/classes/AY2019/cs7643_fall/ – Piazza: piazza.com/gatech/fall2018/cs48037643 – Canvas: gatech.instructure.com/courses/28059 – Gradescope: gradescope.com/courses/22096 (C) Dhruv Batra 124

slide-114
SLIDE 114

Todo

  • HW0

– Due Wed Sept 5 11:55pm

(C) Dhruv Batra 125

slide-115
SLIDE 115

Welcome

(C) Dhruv Batra 126