Deep Multi-Task and Meta-Learning CS 330 Introductions Chelsea - PowerPoint PPT Presentation

Deep Multi-Task and Meta-Learning CS 330

Introductions Chelsea Finn Karol Hausman Rafael Rafailov Dilip Arumugan Mason Swo ff ord Albert Tung Instructor Co-Lecturer TA TA TA TA More TAs coming soon.

We’re here Image source: h.ps://covid-19archive.org/s/archive/item/19465

The Plan for CS330 in 2020 Live lectures on zoom , as interac6ve as possible Assignments & Project - Ask ques6ons! - Short project spotlight presenta6ons - By raising your hand (preferred) - Less 6me for project than typical (no end-of-term - By entering the ques6on in chat period) - Camera use encouraged when possible, but not at - Making fourth assignment op6onal all required - Lectures from Karol, MaK Johnson, Jane Wang to mix things up - Project proposal spotlights, project presenta6ons - Op6ons for students in far-away 6mezones, conflicts, zoom fa6gue Zhao et al. Recommending What Video to Watch Next. 2019 Case studies of important & 6mely applica6ons - Mul6-objec6ve learning in YouTube recommenda6on system - Meta-learning for few-shot land cover classifica6on - Few-shot learning from GPT-3 Rußwurm et al. Meta-Learning for Few- Shot Land Cover Classifica6on. 2020 Brown et al. Language Models are Few-Shot Learners. 2020

First ques6on: How are you doing? (answer in chat)

The Plan for Today 1. Course logis6cs 2. Why study mul6-task learning and meta-learning?

Course Logistics

Information & Resources Course website : http://cs330.stanford.edu/ Piazza : Stanford, CS330 Sta ff mailing list : cs330-aut2021-sta ff @lists.stanford.edu O ffi ce hours : Check course website & piazza, start on Weds.

Pre-Requisites and Enrollment Pre-requisites: CS229 or equivalent, previous or concurrent RL knowledge highly recommended. Lectures are recorded , - will be internally released on Canvas after each lecture - will be edited & publicly released after the course

Assignment Infrastructure Assignments will require training networks in TensorFlow (TF) in Colab notebook. TF Review section : - Rafael will hold a TF 2.0 review session on Thursday, September 17, 6 pm PT. - You should be able to understand the overview here: https://www.tensor fl ow.org/guide/eager - If you don’t, go to the review session & ask questions!

Topics 1. Multi-task learning, transfer learning basics 2. Meta-learning algorithms (black-box approaches, optimization-based meta-learning, metric learning) 3. Advanced meta-learning topics (meta-over fi tting, unsupervised meta-learning) 4. Hierarchical Bayesian models & meta-learning 5. Multi-task RL, goal-conditioned RL 6. Meta-reinforcement learning 7. Hierarchical RL 8. Lifelong learning 9. Open problems Emphasis on deep learning techniques. Emphasis on reinforcement learning domain (6 lectures)

Topics We Won’t Cover Won’t cover AutoML topics: - architecture search - hyperparameter optimization - learning optimizers Though, many of the underlying techniques will be covered.

Assignments & Final Project Homework 1 : Multi-task data processing, black-box meta-learning Homework 2 : Gradient-based meta-learning & metric learning Homework 3 : Multi-task RL, goal relabeling Homework 4 (optional) : Meta-RL Project : Research-level project of your choice Form groups of 1-3 students, you’re encouraged to start early! Grading : 45% homework (15% each), 55% project HW4 either replaces one prior HW or part of project grade (whichever is better for grade). 6 late days total across: homeworks, project-related assignments maximum of 2 late dates per assignment

Homework Today 1. Sign up for Piazza 2. Start forming fi nal project groups if you want to work in a group 3. Review this: https://www.tensor fl ow.org/guide/eager

The Plan for Today 1. Course logis6cs 2. Why study mul--task learning and meta-learning?

Some of My Research (and why I care about multi-task learning and meta-learning)

How can we enable agents to learn a breadth of skills in the real world? Robots. Levine*, Finn*, Darrell, Abbeel. Xie, Ebert, Levine, Finn, RSS ‘19 Yu*, Finn*, Xie, Dasari, Zhang, JMLR ‘16 Abbeel, Levine, RSS ‘18 Why robots? Robots can teach us things about intelligence. faced with the real world must generalize across tasks, objects, environments, etc need some common sense understanding to do well supervision can’t be taken for granted

Beginning of my PhD The robot had its eyes closed. Levine et al. ICRA ‘15

Levine*, Finn* et al. JMLR ‘16

Finn et al. ICRA ‘16

Robot reinforcement learning Reinforcement learning Yahya et al. ‘17 Finn et al. ‘16 Chebotar et al. ’17 Ghadirzadeh et al. ’17 locomotion Atari Learn one task in one environment , starting from scratch

Behind the scenes… Yevgen Yevgen is doing more work than the robot! It’s not practical to collect a lot of data this way.

Robot reinforcement learning Reinforcement learning Yahya et al. ‘17 Finn et al. ‘16 Chebotar et al. ’17 Ghadirzadeh et al. ’17 locomotion Atari Learn one task in one environment , starting from scratch rely on detailed supervision and guidance . Not just a problem with reinforcement learning & robotics. specialists [single task] machine translation speech recognition object detection More diverse, yet still one task , from scratch , with detailed supervision

Humans are generalists . Source: https://youtu.be/8vNxjwt2AqY

vs. Source: https://i.imgur.com/hJIVfZ5.jpg

Why should we care about multi-task & meta-learning? …beyond the robots and general-purpose ML systems

deep v Why should we care about multi-task & meta-learning? …beyond the robots and general-purpose ML systems

Standard computer vision : hand-designed features Modern computer vision : end-to-end training Krizhevsky et al. ‘12 Deep learning allows us to handle unstructured inputs (pixels, language, sensor readings, etc.) without hand-engineering features, with less domain knowledge Slide adapted from Sergey Levine

Deep learning for object classifica-on Deep learning for machine transla-on AlexNet Human evalua6on scores on scale of 0 to 6 PBMT : Phrase-based GNMT : Google’s neural machine transla6on machine transla6on (in 2016) Source: Wikipedia Why deep mul--task and meta-learning ?

deep learning Large, diverse data Broad generaliza6on (+ large models) Russakovsky et al. ‘14 Vaswani et al. ‘18 Wu et al. ‘16 What if you don’t have a large dataset? medical imaging robo6cs personalized educa6on, Imprac6cal to learn from scratch for each disease, medicine, recommenda6ons transla6on for rare languages each robot, each person, each language, each task

What if your data has a long tail? # of datapoints big data small data objects encountered interac6ons with people words heard driving scenarios … This segng breaks standard machine learning paradigms.

What if you need to quickly learn something new? about a new person, for a new task, about a new environment, etc.

training data test datapoint Braque Cezanne By Braque or Cezanne?

What if you need to quickly learn something new? about a new person, for a new task, about a new environment, etc. “few-shot learning” How did you accomplish this? by leveraging prior experience!

What if you want a more general-purpose AI system? Learning each task from scratch won’t cut it. What if you don’t have a large dataset? medical imaging robo6cs personalized educa6on, transla6on for rare languages medicine, recommenda6ons What if your data has a long tail? # of datapoints big data small data What if you need to quickly learn something new? about a new person, for a new task, about a new environment, etc. This is where elements of mul--task learning can come into play.

What is a task?

What is a task? dataset D For now: model f θ loss func6on L Different tasks can vary based on: - different objects - different people - different objec6ves Not just different “tasks” - different ligh6ng condi6ons - different words - different languages - …

Cri6cal Assump6on The bad news: Different tasks need to share some structure. If this doesn’t hold, you are beKer off using single-task learning. The good news: There are many tasks with shared structure! - The laws of physics underly real data. Even if the tasks are - People are all organisms with inten6ons . seemingly unrelated: - The rules of English underly English language data. - Languages all develop for similar purposes . This leads to far greater structure than random tasks.

Informal Problem Defini6ons We’ll define these more formally next 6me. The mul6-task learning problem: Learn all of the tasks more quickly or more proficiently than learning them independently. The meta-learning problem: Given data/experience on previous tasks, learn a new task more quickly and/or more proficiently. This course : anything that solves these problem statements.

Doesn’t mul6-task learning reduce to single-task learning? [ X L = L i D = D i Are we done with the course?

Doesn’t mul6-task learning reduce to single-task learning? Aggrega6ng the data across tasks & learning a single Yes, it can! model is one approach to mul6-task learning. But, we can ocen Exploit the fact that we know that data do beder! is coming from different tasks.

Deep Multi-Task and Meta-Learning CS 330 Introductions Chelsea - PowerPoint PPT Presentation

Deep Multi-Task and Meta-Learning CS 330 Introductions Chelsea Finn Karol Hausman Rafael Rafailov Dilip Arumugan Mason Swo ff ord Albert Tung Instructor Co-Lecturer TA TA TA TA More TAs coming soon. Were here Image source:

Meta- Meta -Programming with Programming with Modelica Modelica for Meta- for Meta

Towards Deep Multi-View Stereo Silvano Galliani October 2, 2017 1 / 40 Towards Deep Multi-View

Robust Deep Learning Based on Meta-learning Deyu Meng Xian Jiaotong University

Identifying beneficial task relations for multi-task learning in deep neural networks Author:

Bayesian Model-Agnostic Meta-Learning Taesup Kim* (presenter), Jaesik Yoon* Ousmane Dia,

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

META Seal of Recognition and META Prize Award Ceremony Georg Rehm (DFKI) on behalf of the

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

Multi-Task Active Learning Yi Zhang Outline Active Learning Multi-Task Active Learning

Multi-Task & Meta-Learning Basics CS 330 Logistics Homework 1 posted today, due Wednesday,

Meta Learning Shengchao Liu Background Meta Learning (AKA Learning to Learn) A

Deep Multi-Task and Meta-Learning CS 330 Course Logistics Information & Resources Chelsea

PyTorch Review Session CS330: Deep Multi-task and Meta Learning 10/29/2020 Rafael Rafailov

Tensorflow 2.x Review Session CS330: Deep Multi-task and Meta Learning 9/17/2019 Rafael

Meta Reinforcement Learning as Task Inference Jan Humplik, Alexandre Galashov, Leonard

Marketing 360 Online Marketing Best Practices For Accommodation & Travel Organisations

CSE 311: Foundations of Computing Lecture 15: Recursion & Strong Induction Applications:

Performance Reviews and Goal Setting 1 Agenda The value of performance management

Product Reviews from Attributes Authors: Li Dong, Shaohan Huang, Furu Wei, Mirella Lapata, Ming

YouTube without (personalized) recommendations The feature they dont talk about Felix

Teaching old type systems Teaching old type systems new tricks with type providers new tricks

Multiuser Positioning Ronald Raulefs, Siwei Zhang, Wei Wang DLR: www.dlr.de/kn WHERE2 Project:

FPAKE: Fuzzy Password- Authen6cated Key Exchange Sophia

Deep Multi-Task and Meta-Learning CS 330 Introductions Chelsea - PowerPoint PPT Presentation

Deep Multi-Task and Meta-Learning CS 330 Introductions Chelsea Finn Karol Hausman Rafael Rafailov Dilip Arumugan Mason Swo ff ord Albert Tung Instructor Co-Lecturer TA TA TA TA More TAs coming soon. Were here Image source:

Meta- Meta -Programming with Programming with Modelica Modelica for Meta- for Meta

Towards Deep Multi-View Stereo Silvano Galliani October 2, 2017 1 / 40 Towards Deep Multi-View

Robust Deep Learning Based on Meta-learning Deyu Meng Xian Jiaotong University

Identifying beneficial task relations for multi-task learning in deep neural networks Author:

Bayesian Model-Agnostic Meta-Learning Taesup Kim* (presenter), Jaesik Yoon* Ousmane Dia,

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

META Seal of Recognition and META Prize Award Ceremony Georg Rehm (DFKI) on behalf of the

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

Multi-Task Active Learning Yi Zhang Outline Active Learning Multi-Task Active Learning

Multi-Task &amp; Meta-Learning Basics CS 330 Logistics Homework 1 posted today, due Wednesday,

Meta Learning Shengchao Liu Background Meta Learning (AKA Learning to Learn) A

Deep Multi-Task and Meta-Learning CS 330 Course Logistics Information &amp; Resources Chelsea

PyTorch Review Session CS330: Deep Multi-task and Meta Learning 10/29/2020 Rafael Rafailov

Tensorflow 2.x Review Session CS330: Deep Multi-task and Meta Learning 9/17/2019 Rafael

Meta Reinforcement Learning as Task Inference Jan Humplik, Alexandre Galashov, Leonard

Marketing 360 Online Marketing Best Practices For Accommodation &amp; Travel Organisations

CSE 311: Foundations of Computing Lecture 15: Recursion &amp; Strong Induction Applications:

Performance Reviews and Goal Setting 1 Agenda The value of performance management

Product Reviews from Attributes Authors: Li Dong, Shaohan Huang, Furu Wei, Mirella Lapata, Ming

YouTube without (personalized) recommendations The feature they dont talk about Felix

Teaching old type systems Teaching old type systems new tricks with type providers new tricks

Multiuser Positioning Ronald Raulefs, Siwei Zhang, Wei Wang DLR: www.dlr.de/kn WHERE2 Project:

FPAKE: Fuzzy Password- Authen6cated Key Exchange Sophia

Multi-Task & Meta-Learning Basics CS 330 Logistics Homework 1 posted today, due Wednesday,

Deep Multi-Task and Meta-Learning CS 330 Course Logistics Information & Resources Chelsea

Marketing 360 Online Marketing Best Practices For Accommodation & Travel Organisations

CSE 311: Foundations of Computing Lecture 15: Recursion & Strong Induction Applications: