CS 4803 / 7643: Deep Learning Topics: (Continue) Low-label ML - PowerPoint PPT Presentation

CS 4803 / 7643: Deep Learning Topics: – (Continue) Low-label ML Formulations Zsolt Kira Georgia Tech

Administrative • Projects! – Poster details out on piazza – Note: No late days for anything project related! – Also note: Keep track of your GCP usage and costs! Set limits on spending (C) Dhruv Batra & Zsolt Kira 31

Meta-Learning for Few-Shot Recognition • Key idea : We want to learn from a few examples (called the support set ) to make predictions on query set for novel classes – Assume: We have larger labeled dataset for a different set of categories ( base classes ) • How do we test this? – N-way k-shot test – k: Number of examples in support set – N: Number of “confusers” that we have to choose target class among Target Query Set (C) Dhruv Batra & Zsolt Kira 32

Normal Approach • Do what we always do: Fine-tuning – Train classifier on base classes – Freeze features – Learn classifier weights for new classes using few amounts of labeled data (during “ inference” time!) A Closer Look at Few-shot Classification, Wei-Yu Chen, Yen-Cheng Liu, (C) Dhruv Batra & Zsolt Kira 33 Zsolt Kira, Yu-Chiang Frank Wang, Jia-Bin Huang

Cons of Normal Approach • The training we do on the base classes does not factor the task into account • No notion that we will be performing a bunch of N- way tests • Idea: simulate what we will see during test time (C) Dhruv Batra & Zsolt Kira 34

Meta-Training Approach • Set up a set of smaller tasks during training which simulates what we will be doing during testing – Can optionally pre-train features on held-out base classes (not typical) • Testing stage is now the same, but with new classes (C) Dhruv Batra & Zsolt Kira 35

Meta-Learning Approaches • Learning a model conditioned on support set (C) Dhruv Batra & Zsolt Kira 36

Meta-Learner • How to parametrize learning algorithms? • Two approaches to defining a meta-learner – Take inspiration from a known learning algorithm • kNN/kernel machine: Matching networks (Vinyals et al. 2016) • Gaussian classifier: Prototypical Networks (Snell et al. 2017) • Gradient Descent: Meta-Learner LSTM (Ravi & Larochelle, 2017) , MAML (Finn et al. 2017) – Derive it from a black box neural network • MANN (Santoro et al. 2016) • SNAIL (Mishra et al. 2018) (C) Dhruv Batra & Zsolt Kira 37 Slide Credit: Hugo Larochelle

Matching Networks (C) Dhruv Batra & Zsolt Kira 39 Slide Credit: Hugo Larochelle

Prototypical Networks (C) Dhruv Batra & Zsolt Kira 40 Slide Credit: Hugo Larochelle

Prototypical Networks (C) Dhruv Batra & Zsolt Kira 41 Slide Credit: Hugo Larochelle

More Sophisticated Meta-Learning Approaches • Learn gradient descent: – Parameter initialization and update rules • Learn just an initialization and use normal gradient descent (MAML) (C) Dhruv Batra & Zsolt Kira 42

Meta-Learner LSTM (C) Dhruv Batra & Zsolt Kira 43 Slide Credit: Hugo Larochelle

Meta-Learning Algorithm (C) Dhruv Batra & Zsolt Kira 47

Model-Agnostic Meta-Learning (MAML) (C) Dhruv Batra & Zsolt Kira 49 Slide Credit: Hugo Larochelle

Model-Agnostic Meta-Learning (MAML) (C) Dhruv Batra & Zsolt Kira 50

Model-Agnostic Meta-Learning (MAML) (C) Dhruv Batra & Zsolt Kira 51 Slide Credit: Sergey Levine

Model-Agnostic Meta-Learning (MAML) (C) Dhruv Batra & Zsolt Kira 52 Slide Credit: Sergey Levine

Comparison (C) Dhruv Batra & Zsolt Kira 53 Slide Credit: Sergey Levine

Experiments (C) Dhruv Batra & Zsolt Kira 55 Slide Credit: Hugo Larochelle

Memory-Augmented Neural Network (C) Dhruv Batra & Zsolt Kira 56 Slide Credit: Hugo Larochelle

But beware A Closer Look at Few-shot Classification, Wei-Yu Chen, Yen-Cheng Liu, Zsolt Kira, Yu-Chiang Frank Wang, Jia-Bin Huang (C) Dhruv Batra & Zsolt Kira 57 Slide Credit: Hugo Larochelle

(C) Dhruv Batra & Zsolt Kira 58

Distribution Shift • What if there is a distribution shift (cross- domain)? • Lesson: Methods that are successful within-domain might be worse across domains ! (C) Dhruv Batra & Zsolt Kira 59

Distribution Shift (C) Dhruv Batra & Zsolt Kira 60

Random Task Proposals (C) Dhruv Batra & Zsolt Kira 61

Does it Work? (C) Dhruv Batra & Zsolt Kira 62

Discussions • What is the right definition of distributions over problems? – varying number of classes / examples per class (meta- training vs. meta-testing) ? – semantic differences between meta-training vs. meta-testing classes ? – overlap in meta-training vs. meta-testing classes (see recent “low-shot” literature) ? • Move from static to interactive learning – how should this impact how we generate episodes ? – meta-active learning ? (few successes so far) (C) Dhruv Batra & Zsolt Kira 63 Slide Credit: Hugo Larochelle

CS 4803 / 7643: Deep Learning Topics: (Continue) Low-label ML - PowerPoint PPT Presentation

CS 4803 / 7643: Deep Learning Topics: (Continue) Low-label ML Formulations Zsolt Kira Georgia Tech Administrative Projects! Poster details out on piazza Note: No late days for anything project related! Also note: Keep

CS 4803 / 7643: Deep Learning Website: http://www.cc.gatech.edu/classes/AY2020/cs7643_spring/

CS 4803 / 7643: Deep Learning Website: https://www.cc.gatech.edu/classes/AY2020/cs7643_fall/

CS 4803 / 7643: Deep Learning Topics: Image Classification Supervised Learning view

CS 4803 / 7643: Deep Learning Topics: Structured representations with graph networks Zsolt

CS 4803 / 7643: Deep Learning Topics: Dynamic Programming (Q-Value Iteration)

CS 4803 / 7643: Deep Learning Topics: Moving beyond supervised learning Zsolt Kira Georgia

CS 4803 / 7643: Deep Learning Topic: Reinforcement Learning (RL) Overview Markov

CS 4803 / 7643: Deep Learning Topics: Policy Gradients Actor Critic Ashwin Kalyan

CS 4803 / 7643: Deep Learning Guest Lecture: Embeddings and world2vec Feb. 18 th 2020 Ledell Wu

CS 4803 / 7643: Deep Learning Topics: Forward and backward though conv (Beginning) of

CS 4803 / 7643: Deep Learning Topics: Specifying Layers Forward & Backward

CS 4803 / 7643: Deep Learning Topics: Application: PointGoal Navigation Trust Region

CS 4803 / 7643: Deep Learning Topics: Low-label ML Formulations Zsolt Kira Georgia Tech

CS 4803 / 7643: Deep Learning Topics: Backpropagation Vector/Matrix/Tensor math

CS 4803 / 7643: Deep Learning Topics: Specifying Layers Forward & Backward

CS 4803 / 7643: Deep Learning Topics: Policy Gradients Actor Critic Zsolt Kira Georgia

A simple way to create a website with A simple way to create a website with Blogdown Blogdown R

OPEN SOURCE HUGO TESO ALLOW ME TO... CYBER-CYBER AIRPLANES! NDA... :( I DO OPEN SAUCE!

Creating Websites with Hugo & R Markdown Thomas Lo Russo WHO Research Associate Official

RESEARCH: gathering sources, building networks, disseminating research JUDITH TOWNEND, CITY

Neural Networks Hugo Larochelle ( @hugo_larochelle ) Google Brain 2 NEURAL NETWORKS What

On the Direction of Innovation Hugo A. Hopenhayn Francesco Squintani EARIE, Milan August 2014

Introd u ction to iterators P YTH ON DATA SC IE N C E TOOL BOX ( PAR T 2 ) H u go Bo w ne -

sttde : a time-depending and post- estimation within time-interval command Hugo Sjqvist

Sambuz

Useful Links

Newsletter

Mail Us

CS 4803 / 7643: Deep Learning Topics: (Continue) Low-label ML - PowerPoint PPT Presentation

CS 4803 / 7643: Deep Learning Topics: (Continue) Low-label ML Formulations Zsolt Kira Georgia Tech Administrative Projects! Poster details out on piazza Note: No late days for anything project related! Also note: Keep

CS 4803 / 7643: Deep Learning Website: http://www.cc.gatech.edu/classes/AY2020/cs7643_spring/

CS 4803 / 7643: Deep Learning Website: https://www.cc.gatech.edu/classes/AY2020/cs7643_fall/

CS 4803 / 7643: Deep Learning Topics: Image Classification Supervised Learning view

CS 4803 / 7643: Deep Learning Topics: Structured representations with graph networks Zsolt

CS 4803 / 7643: Deep Learning Topics: Dynamic Programming (Q-Value Iteration)

CS 4803 / 7643: Deep Learning Topics: Moving beyond supervised learning Zsolt Kira Georgia

CS 4803 / 7643: Deep Learning Topic: Reinforcement Learning (RL) Overview Markov

CS 4803 / 7643: Deep Learning Topics: Policy Gradients Actor Critic Ashwin Kalyan

CS 4803 / 7643: Deep Learning Guest Lecture: Embeddings and world2vec Feb. 18 th 2020 Ledell Wu

CS 4803 / 7643: Deep Learning Topics: Forward and backward though conv (Beginning) of

CS 4803 / 7643: Deep Learning Topics: Specifying Layers Forward &amp; Backward

CS 4803 / 7643: Deep Learning Topics: Application: PointGoal Navigation Trust Region

CS 4803 / 7643: Deep Learning Topics: Low-label ML Formulations Zsolt Kira Georgia Tech

CS 4803 / 7643: Deep Learning Topics: Backpropagation Vector/Matrix/Tensor math

CS 4803 / 7643: Deep Learning Topics: Specifying Layers Forward &amp; Backward

CS 4803 / 7643: Deep Learning Topics: Policy Gradients Actor Critic Zsolt Kira Georgia

A simple way to create a website with A simple way to create a website with Blogdown Blogdown R

OPEN SOURCE HUGO TESO ALLOW ME TO... CYBER-CYBER AIRPLANES! NDA... :( I DO OPEN SAUCE!

Creating Websites with Hugo &amp; R Markdown Thomas Lo Russo WHO Research Associate Official

RESEARCH: gathering sources, building networks, disseminating research JUDITH TOWNEND, CITY

Neural Networks Hugo Larochelle ( @hugo_larochelle ) Google Brain 2 NEURAL NETWORKS What

On the Direction of Innovation Hugo A. Hopenhayn Francesco Squintani EARIE, Milan August 2014

Introd u ction to iterators P YTH ON DATA SC IE N C E TOOL BOX ( PAR T 2 ) H u go Bo w ne -

sttde : a time-depending and post- estimation within time-interval command Hugo Sjqvist

Sambuz

Useful Links

Newsletter

Mail Us

CS 4803 / 7643: Deep Learning Topics: Specifying Layers Forward & Backward

CS 4803 / 7643: Deep Learning Topics: Specifying Layers Forward & Backward

Creating Websites with Hugo & R Markdown Thomas Lo Russo WHO Research Associate Official