CS 330 Paper Review Learning to learn distributions Why Learn - PowerPoint PPT Presentation

CS 330 Paper Review

Learning to learn distributions ● Why Learn distributions aka learn p(x)? ○ To generate data. But why generate data? Motivation ● Why learning to learn distributions? ○ For quick (few-shot) learning & generation of test tasks!

Learning Set-Up What is a Task? Given a support set of images generate an image that looks similar to the support set! To generate: Sample x’ ~ p(x | s ; θ) Problem Training Tasks: Statement Testing Tasks: Central Goal: Use the training tasks for learning* how to ‘quickly’ learn distributions so as to do Few Shot Image Generation on test tasks! *neural attention * meta learning

Pre-requisites: Autoregressive models ● Modelling Assumption: We are using parameterized functions to predict next pixel given all the previous pixels. ○ Sequential Ordering assumed to break joint distribution into product of marginals (chain rule) Method Overview

Pre-requisites: PixelCNN Loss (generated, target) Energy Distance as loss. Method Overview

Pre-requisites: Attention Method Overview

Baseline: Conditional PixelCNN (Gating) Challenge: “PixelRNNs, which use spatial LSTM layers instead of convolutional stacks, have previously been shown to outperform PixelCNNs as generative models” Explaination: “One potential advantage is that PixelRNNs contain multiplicative units (in the form of the LSTM gates), which may help it to model more complex interactions . To amend this we replaced the rectified linear units between the masked convolutions in Model Setup the original pixelCNN with the following gated activation unit” - C-PixelCNN Authors

Baseline: Conditional PixelCNN Key Idea: Given a high-level image description represented as a latent vector h, we model the conditional distribution p(x|h) of images suiting the description Model Setup Why not use a summary vector representing the support set h = f(s) f(s) is just a learned encoding of the support set!

Proposal 1: Attention PixelCNN (explicit conditioning with attention) Challenge: conditional PixelCNN works, the encoding f(s) was shared across all pixels . Key Idea: “ different points of generating the target image x, different aspects of the support images may become relevant .” -- Learning to learn distributions Authors Positional Features: Supporting images augmented with a channel encoding position within Attention the image normalized to [−1, 1] PixelCNN Loss(generated, target)

Proposal 2: Meta PixelCNN (implicit conditioning with gradient descent) Key Idea: The conditioning pathway (i.e. flow of information from supports s to the next pixel xt) introduces no additional parameters. Meta PixelCNN The features q are fed through a convolutional network g (parameters included in θ) producing a scalar, which is treated as the learned inner loss .

Tasks Character & Image Generation Task Difficulty Experiments Image Inversion

Datasets Stanford Online Product (SOP) Task Difficulty Experiments Omniglot ImageNet

Evaluation Metrics ● Qualitative and Quantitative ● Nats: a unit of information or entropy, based on natural Experiments logarithms and powers of e

1-shot Image Generation Conditional PixelCNN Attention PixelCNN Image Inversion Meta PixelCNN with ImageNet Attention PixelCNN’s attention head learns to move and copy in a right-to-left order while the output writes left-to-right.

Few-shot Character Generation Character Generation with Omniglot

Few-shot Image Generation Image Generation with SOP 2.14 nats/dim 2.15 nats/dim

Strengths: ● Attention is great for flipping images! (one-shot generation) ● Meta generative models can generate unseen characters. ● Inner loss function is learnable. Takeaways Weaknesses: ● Few shot image generation needs a new model. ● No analysis on inner loop gradient steps vs performance. ● Naive combination of meta learning and attention. ● Inconsistent experiments.

● Why Meta-PixelCNN is unable to perform well on one-shot generation (experiments on Imagenet Flipping)? ● Would multiple gradient steps in the inner loop of meta learning improve performance? Discussion & Future Work ● Sophisticated combination of attention & meta-learning? ○ Attentive Meta-Learning ● Learned Inner Loss: Since the loss function is learned and unconstrained, how are we guaranteed that it is actually emulating the loss on the task?

Ground Truth Discussion & Future Work

CS 330 Paper Review Learning to learn distributions Why Learn - PowerPoint PPT Presentation

CS 330 Paper Review Learning to learn distributions Why Learn distributions aka learn p(x)? To generate data. But why generate data? Motivation Why learning to learn distributions? For quick (few-shot) learning &

Pie iers 30-32 & Seawall Lot 330 Port Maritime Commerce Advisory Committee July 18, 2019

ELDER AND VULNERABLE ADULT REFERRAL AGENCY ACT RCW 18.330 Heidi S heldon Owner &

For immediate inquiries, please contact ProKure Klean at 330.273.0122 www.ProKureKlean.com, or

N -Marketing Director of SNP, Chairman of Newport Cypress Co Area: 330,966.9 km 2

1 Our About 330 million tons of plas4c were produced in 2016, of which about 37% was

July 2018 James Cooper-Jones - CEO CROPLOGIC LIMITED | ARBN 619 330 648 | NEW ZEALAND COMPANY

Pristina Capital Partners LLC 666 Godwin Avenue, Suite 330 Midland Park, NJ 07432 Tel: 201 670

DISTRIBUTED BY Diagnostics LIASYS MODELS LIASYS 330 - for POL with 10-40 patients/day LIASYS 450

West Plains and Airfields State Law RCW 35.13.177 & RCW 35A.14.330 -allows for the

Circular 330-AN/189 Civil/Military Cooperation in Air Traffic Management Gustavo De Leon ICAO-

Item #330: 2015 Resident Satisfaction Survey Findings for March 8, 2016 Performance &

Michigans largest landscape stone & building stone quarry . 330 Austin Road P.O. Box 335

United States Court of Appeals for the Federal Circuit 2009-1475 (Serial No. 10/330,732) IN RE

A R C H I T E C T U R E 330 WEST 38 STREET,#309 NEW YORK, NY 10018 TEL: 212 380 6114 E FAX:

Diagnostics Commercial & Residential www.vaporremoval.com (330) 915.2650 Tony McDonald

Circular 330-AN/189 Civil/Military Cooperation in Air Traffic Management Sven Halle ICAO- Paris

COLORIZATION USING KNET Jeffrey Lu and Kevin Liu 6.338/18.337 Fall 2017 MOTIVATION Image

Optimizing Pixel Predictors for Steganalysis Vojtch Holub and Jessica Fridrich Dept. of

Ms. Doane Monday, Feb 12 Rines Auditorium Corenna Behnke, PPD Nick Wilson, Maine

Investor Analyst Presentation Margaret Dawes 2 nd October 2014 Insurance Financial Planning

https://bankaust.com.au/responsible-banking/planet/environmental-policy/

The WiiMaze Shaun Salzberg Mack Lu Brian Ramos The Game Hardware Overview Actual maze,

CMOS Active Pixel Sensor for a Polarization-difference Camera Gregory J. Barlow SUNFEST

WCSP 2016 PPViBe: Privacy Preserving Background Extractor via Secret Sharing in Multiple Cloud

CS 330 Paper Review Learning to learn distributions Why Learn - PowerPoint PPT Presentation

CS 330 Paper Review Learning to learn distributions Why Learn distributions aka learn p(x)? To generate data. But why generate data? Motivation Why learning to learn distributions? For quick (few-shot) learning &

Pie iers 30-32 &amp; Seawall Lot 330 Port Maritime Commerce Advisory Committee July 18, 2019

ELDER AND VULNERABLE ADULT REFERRAL AGENCY ACT RCW 18.330 Heidi S heldon Owner &amp;

For immediate inquiries, please contact ProKure Klean at 330.273.0122 www.ProKureKlean.com, or

N -Marketing Director of SNP, Chairman of Newport Cypress Co Area: 330,966.9 km 2

1 Our About 330 million tons of plas4c were produced in 2016, of which about 37% was

July 2018 James Cooper-Jones - CEO CROPLOGIC LIMITED | ARBN 619 330 648 | NEW ZEALAND COMPANY

Pristina Capital Partners LLC 666 Godwin Avenue, Suite 330 Midland Park, NJ 07432 Tel: 201 670

DISTRIBUTED BY Diagnostics LIASYS MODELS LIASYS 330 - for POL with 10-40 patients/day LIASYS 450

West Plains and Airfields State Law RCW 35.13.177 &amp; RCW 35A.14.330 -allows for the

Circular 330-AN/189 Civil/Military Cooperation in Air Traffic Management Gustavo De Leon ICAO-

Item #330: 2015 Resident Satisfaction Survey Findings for March 8, 2016 Performance &amp;

Michigans largest landscape stone &amp; building stone quarry . 330 Austin Road P.O. Box 335

United States Court of Appeals for the Federal Circuit 2009-1475 (Serial No. 10/330,732) IN RE

A R C H I T E C T U R E 330 WEST 38 STREET,#309 NEW YORK, NY 10018 TEL: 212 380 6114 E FAX:

Diagnostics Commercial &amp; Residential www.vaporremoval.com (330) 915.2650 Tony McDonald

Circular 330-AN/189 Civil/Military Cooperation in Air Traffic Management Sven Halle ICAO- Paris

COLORIZATION USING KNET Jeffrey Lu and Kevin Liu 6.338/18.337 Fall 2017 MOTIVATION Image

Optimizing Pixel Predictors for Steganalysis Vojtch Holub and Jessica Fridrich Dept. of

Ms. Doane Monday, Feb 12 Rines Auditorium Corenna Behnke, PPD Nick Wilson, Maine

Investor Analyst Presentation Margaret Dawes 2 nd October 2014 Insurance Financial Planning

https://bankaust.com.au/responsible-banking/planet/environmental-policy/

The WiiMaze Shaun Salzberg Mack Lu Brian Ramos The Game Hardware Overview Actual maze,

CMOS Active Pixel Sensor for a Polarization-difference Camera Gregory J. Barlow SUNFEST

WCSP 2016 PPViBe: Privacy Preserving Background Extractor via Secret Sharing in Multiple Cloud

Pie iers 30-32 & Seawall Lot 330 Port Maritime Commerce Advisory Committee July 18, 2019

ELDER AND VULNERABLE ADULT REFERRAL AGENCY ACT RCW 18.330 Heidi S heldon Owner &

West Plains and Airfields State Law RCW 35.13.177 & RCW 35A.14.330 -allows for the

Item #330: 2015 Resident Satisfaction Survey Findings for March 8, 2016 Performance &

Michigans largest landscape stone & building stone quarry . 330 Austin Road P.O. Box 335

Diagnostics Commercial & Residential www.vaporremoval.com (330) 915.2650 Tony McDonald