MetaFun: Meta-Learning with Iterative Functional Updates Jin Xu, - PowerPoint PPT Presentation

MetaFun: Meta-Learning with Iterative Functional Updates Jin Xu, Jean-Francois Ton, Hyunjik Kim, Adam R. Kosiorek, Yee Whye Teh 37th International Conference on Machine Learning

Supervised Meta-Learning

Encoder-Decoder Approaches to Supervised Meta-Learning What is learning ?

Encoder-Decoder Approaches to Supervised Meta-Learning What is learning ? What is meta-learning? (in encoder-decoder approaches like CNP [1] )

Encoder-Decoder Approaches to Supervised Meta-Learning What is learning ? What is meta-learning? (in encoder-decoder approaches like CNP [1] ) “A model of learning”

Encoder-Decoder Approaches to Supervised Meta-Learning Summarises Predicts conditioned the context on the representation.

Encoder-Decoder Approaches to Supervised Meta-Learning Summarises Predicts conditioned the context on the representation. Both parameterised by NNs

Incorporating Inductive Biases into Deep Learning Models Convolutional structure as inductive bias. Classifier Dog

Incorporating Inductive Biases into Deep Learning Models Convolutional structure as inductive bias. Classifier Dog What are good inductive biases for “a model of learning”?

MetaFun Overview What is a better form of set representation?

MetaFun Overview What are good inductive biases/structures for the encoder? What is a better form of set representation?

MetaFun Overview Euclidean Space

MetaFun Overview Functional Representation Euclidean Space Function Space (e.g. Hilbert Space)

MetaFun Overview Functional Representation Euclidean Space Function Space Encoders with Iterative Structure (e.g. Hilbert Space)

MetaFun Overview Functional Representation Euclidean Space (permutation of data points should not change set representation) Encoders with Iterative Structure

MetaFun Overview Functional Representation Euclidean Space (permutation of data points should not change set representation) [1][2][7] Encoders with Iterative Structure

MetaFun Overview Functional Representation Euclidean Space (permutation of data points should not change set representation) [1][2][7] Encoders with Iterative Structure Fixed dimensional representation can be limiting for large set size [4] , and often lead to underfitting [3] .

MetaFun Overview Functional Representation Euclidean Space Permutation invariance Flexible capacity (permutation of data points should not change set representation) [1][2][7] Encoders with Iterative Structure Fixed dimensional representation can be limiting for large set size [4] , and often lead to underfitting [3] .

MetaFun Overview Functional Representation Euclidean Space Permutation invariance Flexible capacity (permutation of data points should not change set representation) [1][2][7] Encoders with Iterative Structure Fixed dimensional representation can be limiting for large set size [4] , and often lead to underfitting [3] . Self-attention modules [6] or relation network [9] can model interaction within the context, but not context-target interaction

MetaFun Overview Functional Representation Euclidean Space Permutation invariance Flexible capacity (permutation of data points should not change set representation) Within-context and context-target interaction [1][2][7] Encoders with Iterative Structure Fixed dimensional representation can be limiting for large set size [4] , and often lead to underfitting [3] . Self-attention modules [6] or relation network [9] can model interaction within the context, but not context-target interaction

MetaFun Overview Functional Representation Euclidean Space Permutation invariance Flexible capacity Within-context and context-target interaction Function Space Encoders with Iterative Structure (e.g. Hilbert Space)

MetaFun Overview Functional Representation Euclidean Space Permutation invariance Flexible capacity Within-context and context-target interaction Function Space Encoders with Iterative Structure (e.g. Hilbert Space) Learning to update representation with feedback is easier than learning representation directly

MetaFun Overview Functional Representation Euclidean Space Permutation invariance Flexible capacity Within-context and context-target interaction Function Space Encoders with Iterative Structure (e.g. Hilbert Space) Learning to update representation with feedback is easier than learning representation directly Iterative structure may be a good inductive bias for “the model of learning”. (Learning algorithms are often iterative, such as gradient descent)

MetaFun

MetaFun and Functional Gradient Descent Gradient Descent solve by iterative optimisation

MetaFun and Functional Gradient Descent Gradient Descent Functional Gradient Descent solve solve For supervised learning problems, the objective function often has this form: by iterative optimisation by iterative optimisation

MetaFun and Functional Gradient Descent ?

MetaFun and Functional Gradient Descent ? Evaluate functional representation at context:

MetaFun and Functional Gradient Descent ? Evaluate functional representation at context: Local update funcion:

MetaFun and Functional Gradient Descent ? Evaluate functional representation at context: Local update funcion: Functional pooling:

MetaFun MetaFun Iteration Local update funcion: Functional pooling: Apply functional updates: will be the final representation after iterations

MetaFun Functional Representation MetaFun Iteration Local update funcion: Permutation invariance ✔ Flexible capacity ✔ Functional pooling: Within-context and context-target interaction Apply functional updates:

MetaFun Functional Representation MetaFun Iteration Local update funcion: Permutation invariance ✔ Flexible capacity ✔ Functional pooling: Within-context and context-target interaction ✔ Apply functional updates: Both the within-context interaction and the interaction between context and target are considered when updating the representation at each iteration.

MetaFun MetaFun Iteration Local update funcion: Functional pooling: Apply functional updates:

MetaFun for Classification MetaFun Iteration Local update funcion: Functional pooling: Apply functional updates: Deep kernels or attention modules

MetaFun for Classification MetaFun Iteration Regression: MLP on concatenation of inputs Local update funcion: Classification: Functional pooling: ? Apply functional updates: Deep kernels or attention modules

MetaFun for Classification ? Evaluate functional representation at context: Local update funcion: Functional pooling:

MetaFun for Classification Local update funcion:

MetaFun for Classification MetaFun Iteration Regression: MLP on concatenation of inputs Local update funcion: Classification: Functional pooling: With structure similar to Apply functional updates: Deep kernels or attention modules

MetaFun for Classification MetaFun Iteration Regression: MLP on concatenation of inputs Local update funcion: Classification: Functional pooling: With structure similar to Apply functional updates: Incorporate label information into the network structure rather than concatenating the label to the inputs Deep kernels or attention modules

MetaFun for Classification MetaFun Iteration Regression: MLP on concatenation of inputs Local update funcion: Classification: Functional pooling: With structure similar to Apply functional updates: Incorporate label information into the network structure rather than concatenating the label to the inputs Deep kernels or attention modules Naturally integrate within-class and between-class interaction

MetaFun and Gradient-Based Meta-Learning Model Agnostic Meta-Learning (MAML) [8] MetaFun Iteration Local update funcion: Functional pooling: Apply functional updates:

MetaFun: Meta-Learning with Iterative Functional Updates Jin Xu, - PowerPoint PPT Presentation

MetaFun: Meta-Learning with Iterative Functional Updates Jin Xu, Jean-Francois Ton, Hyunjik Kim, Adam R. Kosiorek, Yee Whye Teh 37th International Conference on Machine Learning Supervised Meta-Learning Supervised Meta-Learning Supervised

Meta- Meta -Programming with Programming with Modelica Modelica for Meta- for Meta

META Seal of Recognition and META Prize Award Ceremony Georg Rehm (DFKI) on behalf of the

Bayesian Model-Agnostic Meta-Learning Taesup Kim* (presenter), Jaesik Yoon* Ousmane Dia,

Basic Techniques II: Iterative Compression Marek Cygan Institute of Informatics University of

Chapter 12: Iterative Methods ES 240: Scientific and Engineering Computation. Iterative Methods

Development Figures are from : Agile and Iterative Development: A Manager's Guide, Craig

Meta Learning Shengchao Liu Background Meta Learning (AKA Learning to Learn) A

FFR Guided Functional FFR Guided Functional FFR Guided Functional FFR Guided Functional

A few meta learning papers Guy Gur-Ari Machine Learning Journal Club, September 2017 Meta

The Meta-Learning Problem & Black-Box Meta-Learning CS 330 Logistics Homework 1 posted today,

Intelligent Tutoring Systems: A Meta-Analysis Meta-Analysis Wenting Ma March, 2011

Company profile Capabilities Customers & References META-LRA Kft. 8400 Ajka,

Individual Participant Data (IPD) Reviews and Meta analyses Lesley Stewart Director, CRD Larysa

Lecture 31/Chapter 25 More about Meta-Analysis Benefits and Pitfalls An Application:

Simultaneous meta and data manipulation in Blaise Marien Lina Statistics netherlands Statistics

META-SHARE META SHARE the Open Resource Exchange Facility Stelios Piperidis ILSP-Athena RC,

Exploring Properties of Normal Multimodal Logics in Simple Type Theory with LEO-II 1 Christoph E.

SatNOGS: An SDR-based Satellite Networked Open Ground Station Libre Space Foundation Manolis

Real Algebraic Strategies for MetiTarski Proofs Grant Passmore (Cambridge & Edinburgh),

System 83 Leo two planets orbit of one star: mapping possibilities for the system a 1 , N.

Superposition with Lambdas Alexander Bentkamp Jasmin Blanchette Sophie Tourret Petar Vukmirovi

JUST THE MATHS SLIDES NUMBER 13.4 INTEGRATION APPLICATIONS 4 (Lengths of curves) by

CS 591 S2Formal Language Theory: Integrating Experimentation and ProofFall 2019 Instructor

Combinatorial approaches to RNA folding Part III: Stocastic algorithms via language theory

MetaFun: Meta-Learning with Iterative Functional Updates Jin Xu, - PowerPoint PPT Presentation

MetaFun: Meta-Learning with Iterative Functional Updates Jin Xu, Jean-Francois Ton, Hyunjik Kim, Adam R. Kosiorek, Yee Whye Teh 37th International Conference on Machine Learning Supervised Meta-Learning Supervised Meta-Learning Supervised

Meta- Meta -Programming with Programming with Modelica Modelica for Meta- for Meta

META Seal of Recognition and META Prize Award Ceremony Georg Rehm (DFKI) on behalf of the

Bayesian Model-Agnostic Meta-Learning Taesup Kim* (presenter), Jaesik Yoon* Ousmane Dia,

Basic Techniques II: Iterative Compression Marek Cygan Institute of Informatics University of

Chapter 12: Iterative Methods ES 240: Scientific and Engineering Computation. Iterative Methods

Development Figures are from : Agile and Iterative Development: A Manager's Guide, Craig

Meta Learning Shengchao Liu Background Meta Learning (AKA Learning to Learn) A

FFR Guided Functional FFR Guided Functional FFR Guided Functional FFR Guided Functional

A few meta learning papers Guy Gur-Ari Machine Learning Journal Club, September 2017 Meta

The Meta-Learning Problem &amp; Black-Box Meta-Learning CS 330 Logistics Homework 1 posted today,

Intelligent Tutoring Systems: A Meta-Analysis Meta-Analysis Wenting Ma March, 2011

Company profile Capabilities Customers &amp; References META-LRA Kft. 8400 Ajka,

Individual Participant Data (IPD) Reviews and Meta analyses Lesley Stewart Director, CRD Larysa

Lecture 31/Chapter 25 More about Meta-Analysis Benefits and Pitfalls An Application:

Simultaneous meta and data manipulation in Blaise Marien Lina Statistics netherlands Statistics

META-SHARE META SHARE the Open Resource Exchange Facility Stelios Piperidis ILSP-Athena RC,

Exploring Properties of Normal Multimodal Logics in Simple Type Theory with LEO-II 1 Christoph E.

SatNOGS: An SDR-based Satellite Networked Open Ground Station Libre Space Foundation Manolis

Real Algebraic Strategies for MetiTarski Proofs Grant Passmore (Cambridge &amp; Edinburgh),

System 83 Leo two planets orbit of one star: mapping possibilities for the system a 1 , N.

Superposition with Lambdas Alexander Bentkamp Jasmin Blanchette Sophie Tourret Petar Vukmirovi

JUST THE MATHS SLIDES NUMBER 13.4 INTEGRATION APPLICATIONS 4 (Lengths of curves) by

CS 591 S2Formal Language Theory: Integrating Experimentation and ProofFall 2019 Instructor

Combinatorial approaches to RNA folding Part III: Stocastic algorithms via language theory

The Meta-Learning Problem & Black-Box Meta-Learning CS 330 Logistics Homework 1 posted today,

Company profile Capabilities Customers & References META-LRA Kft. 8400 Ajka,

Real Algebraic Strategies for MetiTarski Proofs Grant Passmore (Cambridge & Edinburgh),