Advanced Meta-Learning: Task Construction CS 330 1 Logistics - PowerPoint PPT Presentation

Advanced Meta-Learning: Task Construction CS 330 1

Logistics Homework 2 out, due Friday, October 16th Project group form due Weds, October 7th (encouraged to do it early) Proposal proposal due & presentations on October 14th 2

Question of the Day How should tasks be de fi ned for good meta-learning performance?

Plan for Today Brief Recap of Meta-Learning & Task Construction Memorization in Meta-Learning - When it arises - A potential solutions Meta-Learning without Tasks Provided - Unsupervised Meta-Learning - Meta-Learning from Unsegmented Task Stream (time permitting) 🚩 Disclaimer 🚩 : These topics are at the bleeding edge of research. Goals for by the end of lecture : - Understand when & how memorization in meta-learning may occur - Understand techniques for constructing tasks automatically 4

Recap: Black-Box Meta-Learning φ i f θ 4 y ts x ts 0 1 2 3 4 D tr i Key idea: parametrize learner as a neural network - challenging op0miza0on problem + expressive

Recap: Op9miza9on-Based Meta-Learning φ i r θ L 4 y ts x ts 0 1 2 3 4 D tr i Key idea: embed op5miza5on inside the inner learning process + structure of op0miza0on - typically requires second-order op0miza0on embedded into meta-learner

Recap: Non-Parametric Meta-Learning 0 1 2 x 3 4 4 y ts x ts 0 1 2 3 4 D tr i Key idea: non-parametric learner (e.g. nearest neighbor to examples, prototypes) with parametric embedding space / distance metric + easy to op0mize, - largely restricted to classifica0on computa0onally fast

Recap: Task Construc9on Techniques For N-way image classifica9on Use labeled images from prior classes For adap9ng to regional differences For few-shot imita9on learning Rußwurm et al. Meta-Learning for Few-Shot Land Yu et al. One-Shot Imita5on Learning from Cover Classifica5on. CVPR 2020 EarthVision Workshop Observing Humans. RSS 2018 Use labeled images from prior regions Use demonstra9ons for prior tasks

Plan for Today Brief Recap of Meta-Learning & Task Construction Memorization in Meta-Learning - When it arises - A potential solutions Meta-Learning without Tasks Provided - Unsupervised Meta-Learning - Meta-Learning from Unsegmented Task Stream (time permitting) 9

How we construct tasks for meta-learning. 𝒠 tr x ts 0 1 2 3 4 2 4 0 1 2 3 4 3 1 T 3 0 1 2 3 4 4 3 Randomly assign class labels to image classes for each task —> Tasks are mutually exclusive . Algorithms must use training data to infer label ordering.

What if label order is consistent? 𝒠 tr x ts 0 1 2 3 4 2 4 0 1 2 3 4 3 1 T 3 0 2 3 4 1 1 2 Tasks are non-mutually exclusive : a single function can solve all tasks. The network can simply learn to classify inputs, irrespective of 𝒠 tr

The network can simply learn to classify inputs, irrespective of 𝒠 tr 4 1 2 3 4 0 4 r θ L 0 1 2 3 4

What if label order is consistent? 𝒠 tr x ts 0 1 2 3 4 2 4 0 1 2 3 4 3 1 T 3 0 2 3 4 1 1 2 For new image classes: can’t make predictions w/o 𝒠 tr T test training data test set

Is this a problem? - No : for image classi fi cation, we can just shu ffl e labels* - No , if we see the same image classes as training (& don’t need to adapt at meta-test time) - But, yes , if we want to be able to adapt with data for new tasks.

Another example “hammer” “close drawer” “stack” meta-training … T 50 “close box” T test If you tell the robot the task goal, the robot can ignore the trials. T Yu, D Quillen, Z He, R Julian, K Hausman, C Finn, S Levine. Meta-World . CoRL ‘19

Another example Model can memorize the canonical orientations of the training objects. Yin, Tucker, Yuan, Levine, Finn. Meta-Learning without Memorization . ICLR ‘19

Can we do something about it?

If tasks mutually exclusive : single function cannot solve all tasks (i.e. due to label shu ffl ing, hiding information) If tasks are non - mutually exclusive : single function can solve all tasks y ts = f θ ( D tr multiple solutions to the i , x ts ) meta-learning problem 𝒠 tr One solution: θ memorize canonical pose info in & ignore i 𝒠 tr Another solution: θ carry no info about canonical pose in , acquire from i An entire spectrum of solutions based on how information fl ows. Suggests a potential approach: control information fl ow. Yin, Tucker, Yuan, Levine, Finn. Meta-Learning without Memorization . ICLR ‘19

If tasks are non - mutually exclusive : single function can solve all tasks y ts = f θ ( D tr multiple solutions to the i , x ts ) meta-learning problem 𝒠 tr One solution: θ memorize canonical pose info in & ignore i 𝒠 tr Another solution: θ carry no info about canonical pose in , acquire from i An entire spectrum of solutions based on how information fl ows. one option: max I ( ̂ y ts , 𝒠 tr | x ts ) Meta-regularization minimize meta-training loss + information in θ ℒ ( θ , 𝒠 meta − train ) + β D KL ( q ( θ ; θ μ , θ σ ) ∥ p ( θ )) θ Places precedence on using information from over storing info in . 𝒠 tr Can combine with your favorite meta-learning algorithm. Yin, Tucker, Yuan, Levine, Finn. Meta-Learning without Memorization . ICLR ‘19

Omniglot without label shu ffl ing: “non-mutually-exclusive” Omniglot On pose prediction task: (and it’s not just as simple as standard regularization) TAML: Jamal & Qi. Task-Agnostic Meta-Learning for Few-Shot Learning . CVPR ‘19 Yin, Tucker, Yuan, Levine, Finn. Meta-Learning without Memorization . ICLR ‘19

Does meta-regularization lead to better generalization? P ( θ ) θ Let be an arbitrary distribution over that doesn’t depend on the meta-training data. P ( θ ) = 𝒪 ( θ ; 0 , I ) (e.g. ) 1 − δ For MAML, with probability at least , ∀ θ μ , θ σ meta-regularization error on the generalization meta-training set error β With a Taylor expansion of the RHS + a particular value of —> recover the MR MAML objective . Proof: draws heavily on Amit & Meier ‘18 Yin, Tucker, Yuan, Levine, Finn. Meta-Learning without Memorization . ICLR ‘19

Summary of Memorization Problem meta-learning standard supervised learning meta overfitting standard overfitting f i ( x i , y i ) memorize training functions memorize training datapoints corresponding to tasks in your meta-training dataset in your training dataset meta regularization standard regularization controls information fl ow regularize hypothesis class regularizes description length (though not always for DNNs) of meta-parameters

Plan for Today Brief Recap of Meta-Learning & Task Construction Memorization in Meta-Learning - When it arises - A potential solutions Meta-Learning without Tasks - Unsupervised Meta-Learning - Meta-Learning from Unsegmented Task Stream (time permitting) 23

Where do tasks come from? Requires labeled data from other regions Rußwurm et al. Meta-Learning for Few- Shot Land Cover Classifica5on. 2020 What if we only have unlabeled data? few-shot meta-learning from: unlabeled images unlabeled text

A general recipe for unsupervised meta-learning Given unlabeled dataset(s) Propose tasks Run meta-learning Goal of unsupervised meta-learning methods: Automatically construct tasks from unlabeled data Question: What do you want 1. diverse (more likely to cover test tasks) the task set to look like? 2. structured (so that few-shot meta-learning is possible) (answer in chat or raise hand) Task construction from unlabeled image data Next: Task construction from unlabeled text data

Can we meta-learn with only unlabeled images? — — Task construction — — Propose cluster Unsupervised learning Run meta-learning discrimination tasks (to get an embedding space) x class 1 x x xx x x x x class 2 x x x x class 1 class 2 … Result: representation suitable for learning downstream tasks Hsu, Levine, Finn. Unsupervised Learning via Meta-Learning . ICLR ‘19

Can we meta-learn with only unlabeled images? Propose cluster Unsupervised learning Run meta-learning discrimination tasks (to get an embedding space) MAML — Finn et al. ’17 Clustering to Automatically A few options: ProtoNets — Snell et al. ’17 Construct Tasks for Unsupervised BiGAN — Donahue et al. ’17 Meta-Learning (CACTUs) DeepCluster — Caron et al. ’18 miniImageNet 5-way 5-shot accuracy method Same story for : MAML with labels 62.13% - 4 di ff erent embedding methods - 4 datasets (Omniglot, CelebA, BiGAN kNN 31.10% miniImageNet, MNIST) BiGAN logistic 33.91% - 2 meta-learning methods (*) BiGAN MLP + dropout 29.06% - Test tasks with larger datasets BiGAN cluster matching 29.49% BiGAN CACTUs MAML 51.28% *ProtoNets underperforms in some cases. DeepCluster CACTUs MAML 53.97% CACTUs MAML Hsu, Levine, Finn. Unsupervised Learning via Meta-Learning . ICLR ‘19

Can we use domain knowledge when constructing tasks? e.g. image’s label often won’t change when you: - drop out some pixels - translate the image - re fl ect the image Task construction: For each i. Randomly sample images & assign labels N 1,…, N task : 𝒰 i —> Store in 𝒠 tr i 1 2 3 𝒠 tr ii. For each datapoint in , augment image using domain i knowledge —> Store in 𝒠 ts i 1 2 3 Khodadadeh, Bölöni, Shah. Unsupervised Meta-Learning for Few-Shot Image Classification . NeurIPS ‘19

Advanced Meta-Learning: Task Construction CS 330 1 Logistics - PowerPoint PPT Presentation

Advanced Meta-Learning: Task Construction CS 330 1 Logistics Homework 2 out, due Friday, October 16th Project group form due Weds, October 7th (encouraged to do it early) Proposal proposal due & presentations on October 14th 2 Question of

Meta- Meta -Programming with Programming with Modelica Modelica for Meta- for Meta

Bayesian Model-Agnostic Meta-Learning Taesup Kim* (presenter), Jaesik Yoon* Ousmane Dia,

META Seal of Recognition and META Prize Award Ceremony Georg Rehm (DFKI) on behalf of the

Meta Learning Shengchao Liu Background Meta Learning (AKA Learning to Learn) A

A few meta learning papers Guy Gur-Ari Machine Learning Journal Club, September 2017 Meta

The Meta-Learning Problem & Black-Box Meta-Learning CS 330 Logistics Homework 1 posted today,

MetaFun: Meta-Learning with Iterative Functional Updates Jin Xu, Jean-Francois Ton, Hyunjik Kim,

Intelligent Tutoring Systems: A Meta-Analysis Meta-Analysis Wenting Ma March, 2011

Company profile Capabilities Customers & References META-LRA Kft. 8400 Ajka,

Individual Participant Data (IPD) Reviews and Meta analyses Lesley Stewart Director, CRD Larysa

Lecture 31/Chapter 25 More about Meta-Analysis Benefits and Pitfalls An Application:

Simultaneous meta and data manipulation in Blaise Marien Lina Statistics netherlands Statistics

META-SHARE META SHARE the Open Resource Exchange Facility Stelios Piperidis ILSP-Athena RC,

CS 671 Automated Reasoning Meta Reasoning Object Level versus Meta Level Object level:

Bond Task Force Draft Bond Task Force Recommendations Tuesday, February 27 , 2018 Bond Task

Task 1d: River basin management Task leader: LNEC; Involved partners EU: ISPRA, DTU, EWA Task

Data storage at the RIPE NCC Robert Kisteleki RIPE NCC R&D CAIDA AIMS-5 Data collection

Hi! Welcome to the land of JellyBOX! We are incredibly excited you we have crossed paths. Cant

A rough guide Intro Common causes of tech disputes Somethings going wrong,

Designing the Browser @joshcarpenter, Google Oct 19 2016, W3C WebVR Workshop All browsers

Compiler Support for GPUs: Challenges, Obstacles, & Opportunities or Why doesnt GCC

NDLUG NetBSD : Portable Hottness November 17, 2005 WTF is it? Not Linux, But Unix-Like

DNSSEC on Campus By Michael Sinatra University of California, Berkeley You didnt think I

The quest for the IdM holy grail Stig Wennevold University of Troms Disclaimer The idea

Advanced Meta-Learning: Task Construction CS 330 1 Logistics - PowerPoint PPT Presentation

Advanced Meta-Learning: Task Construction CS 330 1 Logistics Homework 2 out, due Friday, October 16th Project group form due Weds, October 7th (encouraged to do it early) Proposal proposal due & presentations on October 14th 2 Question of

Meta- Meta -Programming with Programming with Modelica Modelica for Meta- for Meta

Bayesian Model-Agnostic Meta-Learning Taesup Kim* (presenter), Jaesik Yoon* Ousmane Dia,

META Seal of Recognition and META Prize Award Ceremony Georg Rehm (DFKI) on behalf of the

Meta Learning Shengchao Liu Background Meta Learning (AKA Learning to Learn) A

A few meta learning papers Guy Gur-Ari Machine Learning Journal Club, September 2017 Meta

The Meta-Learning Problem &amp; Black-Box Meta-Learning CS 330 Logistics Homework 1 posted today,

MetaFun: Meta-Learning with Iterative Functional Updates Jin Xu, Jean-Francois Ton, Hyunjik Kim,

Intelligent Tutoring Systems: A Meta-Analysis Meta-Analysis Wenting Ma March, 2011

Company profile Capabilities Customers &amp; References META-LRA Kft. 8400 Ajka,

Individual Participant Data (IPD) Reviews and Meta analyses Lesley Stewart Director, CRD Larysa

Lecture 31/Chapter 25 More about Meta-Analysis Benefits and Pitfalls An Application:

Simultaneous meta and data manipulation in Blaise Marien Lina Statistics netherlands Statistics

META-SHARE META SHARE the Open Resource Exchange Facility Stelios Piperidis ILSP-Athena RC,

CS 671 Automated Reasoning Meta Reasoning Object Level versus Meta Level Object level:

Bond Task Force Draft Bond Task Force Recommendations Tuesday, February 27 , 2018 Bond Task

Task 1d: River basin management Task leader: LNEC; Involved partners EU: ISPRA, DTU, EWA Task

Data storage at the RIPE NCC Robert Kisteleki RIPE NCC R&amp;D CAIDA AIMS-5 Data collection

Hi! Welcome to the land of JellyBOX! We are incredibly excited you we have crossed paths. Cant

A rough guide Intro Common causes of tech disputes Somethings going wrong,

Designing the Browser @joshcarpenter, Google Oct 19 2016, W3C WebVR Workshop All browsers

Compiler Support for GPUs: Challenges, Obstacles, &amp; Opportunities or Why doesnt GCC

NDLUG NetBSD : Portable Hottness November 17, 2005 WTF is it? Not Linux, But Unix-Like

DNSSEC on Campus By Michael Sinatra University of California, Berkeley You didnt think I

The quest for the IdM holy grail Stig Wennevold University of Troms Disclaimer The idea

The Meta-Learning Problem & Black-Box Meta-Learning CS 330 Logistics Homework 1 posted today,

Company profile Capabilities Customers & References META-LRA Kft. 8400 Ajka,

Data storage at the RIPE NCC Robert Kisteleki RIPE NCC R&D CAIDA AIMS-5 Data collection

Compiler Support for GPUs: Challenges, Obstacles, & Opportunities or Why doesnt GCC