HyperGAN: Generating Diverse, Performant Neural Networks Neale - PowerPoint PPT Presentation

Mar 25, 2023 •22 likes •172 views

HyperGAN: Generating Diverse, Performant Neural Networks Neale Ratzlaff, Fuxin Li Oregon State University 36th ICML 2019 1 Uncertainty High predictive accuracy is not sufficient for many tasks We want to know when our models are

HyperGAN: Generating Diverse, Performant Neural Networks Neale Ratzlaff, Fuxin Li   Oregon State University 36th ICML 2019 � 1
Uncertainty High predictive accuracy is not sufficient for many tasks   We want to know when our models are uncertain about the data   � 2
Fixing Overconfidence Given many models, each model behaves differently on outlier data   By averaging their predictions, we can detect anomalies } Model 1 Model 2 Model N � 3
Fixing Overconfidence Given many models, each model behaves differently on outlier data   By averaging their predictions, we can detect anomalies } Model 1 Low confidence — Outlier! Model 2 Model N � 4
Fixing Overconfidence Variational inference gives a model posterior where we can sample many models Ensembles of models from random starts may also detect outliers } Low confidence — Model 1 Outlier! Model 2 Model N � 5
Regularization is too Restrictive Learning with VI is restrictive, it cannot model the complex model posterior   Without regularization, our outputs mode collapse, losing diversity Data Generator Too simple weight � 6 Prediction distribution!
Implicit Model Distribution We learn an implicit distribution over network parameters with a GAN   We can instantly generate any number of diverse, fully trained networks Data GAN � 7 Prediction
Implicit Model Distribution With a GAN, we can sample many networks instantly   However, with just a Gaussian input, the generated networks tend to be similar Data GAN � 8 Prediction
Mixer Network for Diverse Ensembles Want to generate divers e ensembles, without repeatedly training models   Our novel Mixer, transforms the input noise to learn complex structure.   Mixer outputs are used to generate diverse layer parameters GAN Target Network Input Noise Mixer Generators Parameters � 9
Generating Diverse Neural Networks Every training step we sample a new batch of networks The diversity given by the mixer lets us find many different models which solve the target task Generators Conv Conv Mixer Classifier Linear Prediction
HyperGAN Training: Full Architecture Prevent mode collapse by regularizing the Mixer with a Discriminator   We use the target loss to train HyperGAN Generators Conv Conv Mixer Classifier Linear D Prediction � 11
Weight Diversity HyperGAN learns diverse weight posteriors beyond simple Gaussians imposed by variational inference � 12
Results - Classification MNIST 5000: train on 5k example subset. CIFAR-5: Restricted subset of CIFAR-10 � 13
Out of Distribution Experiments Outlier detection on CIFAR-10 and MNIST datasets   MNIST notMNIST   CIFAR (0-4) CIFAR (5-9)   Adversarial Examples: FGSM and PGD   Our increased diversity allows us to outperform other methods
  Conclusion HyperGAN generates diverse models   Makes few assumptions about output weight distribution   Method is straightforward and extensible   Come to our poster for more details! � 15

Recommend

Lock-Free Algorithms Martin Thompson - @mjpt777 Mike Barker - @mikeb2701 Modern Hardware Modern

Lock-Free Algorithms Martin Thompson - @mjpt777 Mike Barker - @mikeb2701 Modern Hardware Modern Hardware (Intel Nehalem) Registers/Buffers C1 C2 C3 C4 C1 C2 C3 C4 <1ns L1 L1 L1 L1 L1 L1 L1 L1 ~4 cycles ~1ns L2 L2 L2 L2

609 views • 40 slides

Synchronization Chapter 5 OSPP Part I Synchronization Motivation When threads concurrently

Synchronization Chapter 5 OSPP Part I Synchronization Motivation When threads concurrently read/write shared memory, program behavior is undefined Two threads write to the same variable; which one should win? Thread schedule is

755 views • 44 slides

Locks Do Not Compose! Example Code Thread 1 Thread 2 class Account { transfer(A, B, 10);

Locks Do Not Compose! Example Code Thread 1 Thread 2 class Account { transfer(A, B, 10); transfer(B, A, 10); float balance; -> call A.lock() -> call B.lock() void deposit(float amt) { -> call B.lock() -> call A.lock()

684 views • 3 slides

Operating System Principles: Semaphores and Locks for Synchronization CS 111 Operating Systems

Operating System Principles: Semaphores and Locks for Synchronization CS 111 Operating Systems Peter Reiher Lecture 9 CS 111 Page 1 Fall 2016 Outline Locks Semaphores Mutexes and object locking Getting good performance with

583 views • 54 slides

Decision aid methodologies in transportation Lecture 6: Miscellaneous Topics Prem Kumar

Decision aid methodologies in transportation Lecture 6: Miscellaneous Topics Prem Kumar prem.viswanathan@epfl.ch Transport and Mobility Laboratory Summary We learnt about the different scheduling models We also learnt about

743 views • 41 slides

TAPPI Shipping, Receiving & Warehousing Workshop TAPPI Shipping, Receiving & Warehousing

TAPPI Shipping, Receiving & Warehousing Workshop TAPPI Shipping, Receiving & Warehousing Workshop Chip Davis Chip Davis CSX TRANSPORTATION CSX TRANSPORTATION APRIL 28, 2009 APRIL 28, 2009 CSX TRANSPORTATION OVERVIEW CSX TRANSPORTATION

879 views • 30 slides

GEOS 24705 / ENST 24705 / ENSC 21100 Lecture 5 History of Energy Use II The heat to

GEOS 24705 / ENST 24705 / ENSC 21100 Lecture 5 History of Energy Use II The heat to work barrier the 18 th century technological impasse All technology involved only two energy

899 views • 31 slides

Rail Accident Investigation Branch Review of 2017 Presentation to the Railway Industry Health

Rail Accident Investigation Branch Review of 2017 Presentation to the Railway Industry Health and Safety Advisory Committee 1 Time to publish Time taken to publish RAIB investigations (excluding safety digests) 100% 16 Percentage taking

592 views • 19 slides

morphforge Biophysical simulation in Python Mike Hull (s0897465@sms.ed.ac.uk) University of

morphforge Biophysical simulation in Python Mike Hull (s0897465@sms.ed.ac.uk) University of Edinburgh University of Bristol FACETS Code Jam #4 Marseille 22-24 June 2010 Outline My research Introduction Motivation What is

691 views • 44 slides

The current process of checking capability 1 Loads Books Referenced to ascertain maximum tonnage

The current process of checking capability 1 Loads Books Referenced to ascertain maximum tonnage and length permissions Excel format, with one spreadsheet per previous Route Capability values are listed junction by junction and per

408 views • 7 slides

What we can do and how? Lingyun Meng, Ph.D., Associate Professor State Key Laboratory of Rail

Integrated railway operations planning: What we can do and how? Lingyun Meng, Ph.D., Associate Professor State Key Laboratory of Rail Traffic Control and Safety Beijing Jiaotong University, Beijing, China Email: lymeng@bjtu.edu.cn April, 2016

387 views • 26 slides

Functional Dependencies and Normalization There are many forms of constraints on relational

Functional Dependencies and Normalization There are many forms of constraints on relational database schemata other than key dependencies. Undoubtedly most important is the functional dependency. A functional dependency, or FD, is a

716 views • 50 slides

Computing Systems Wei Tang*, Narayan Desai # , Venkatram Vishwanarth# Daniel Buettner#, Zhiling

Job Coscheduling on Coupled High-End Computing Systems Wei Tang*, Narayan Desai # , Venkatram Vishwanarth# Daniel Buettner#, Zhiling Lan* * Illinois Institute of Technolology # Argonne National Laboratory Outline Background &

559 views • 27 slides

Course Introduction 17-654/17-765 Analysis of Software Artifacts Jonathan Aldrich Why is

Course Introduction 17-654/17-765 Analysis of Software Artifacts Jonathan Aldrich Why is Building Quality Software Hard? Compare to other engineering disciplines Often done; sometimes valid, sometimes not For other disciplines

320 views • 11 slides

Bias Also Matters: Bias Attribution for Deep Neural Network Explanation Shengjie Wang*, Tianyi

Bias Also Matters: Bias Attribution for Deep Neural Network Explanation Shengjie Wang*, Tianyi Zhou*, Jeff A. Bilmes University of Washington, Seattle Explain DNNs as a linear model per data point DNN with piecewise linear activations like

250 views • 9 slides

Wrapup CSE443 - Spring 2012 Introduction to Computer and Network Security Professor Jaeger

Wrapup CSE443 - Spring 2012 Introduction to Computer and Network Security Professor Jaeger www.cse.psu.edu/~tjaeger/cse443-s12/ CSE443 Introduction to Computer and Network Security - Spring 2012 - Professor Jaeger Final The final is on

558 views • 18 slides

Q2 2018 Management Commentary August 1, 2018 NYSE: DVN devonenergy.com Exe xecu cuting ting

Q2 2018 Management Commentary August 1, 2018 NYSE: DVN devonenergy.com Exe xecu cuting ting the 20 e 2020 Vi Visi sion on Q2 light-oil production exceeds high end of guidance Driven by strong well productivity in Delaware and STACK

482 views • 8 slides

Roadmap for Section 2.3. Environment Subsystems System Service Dispatching Windows on Windows -

Unit OS2: Operating System Principles 2.3. Windows on Windows - OS Personalities Windows Operating System Internals - by David A. Solomon and Mark E. Russinovich with Andreas Polze Roadmap for Section 2.3. Environment Subsystems System

678 views • 20 slides

ANALYSIS OF THE INFLUENCE OF THE URBAN DRAINAGE SYSTEM IN THE AREA OF THE POOL FORMED AFTER

ANALYSIS OF THE INFLUENCE OF THE URBAN DRAINAGE SYSTEM IN THE AREA OF THE POOL FORMED AFTER LEAKAGE OF OIL PIPELINES Edmilson Pinto da Silva Marcelo Bernardes Secron INTRODUCTION The dimensions of the pool depend on the portion of the

610 views • 14 slides

Trails and Networks: Loom; Going from Trails to Networks and Networks to Trails Mihovil

<Your Name> Trails and Networks: Loom; Going from Trails to Networks and Networks to Trails Mihovil Bartulovic mbartulovic@cmu.edu Dr. Kathleen M. Carley kathleen.carley@cs.cmu.edu Center for Computational Analysis of Social and

394 views • 18 slides

PowerLoom Overview, Features and PowerLoom Overview, Features and Examples Examples Hans

PowerLoom Overview, Features and PowerLoom Overview, Features and Examples Examples Hans Chalupsky Project Leader, USC/ISI Loom KR&R Group Loom Loom KR&R KR&R Group Group 1 Overview Overview Logic-based KR&R What

1.47k views • 103 slides

Introduction to Computer Science CSCI 109 China Tianhe-2 Andrew Goodney Fall 2019 Lecture

Introduction to Computer Science CSCI 109 China Tianhe-2 Andrew Goodney Fall 2019 Lecture 1: Introduction August 26, 2019 Purpose of this Course u Introduce computer science as a discipline, a body of knowledge, and

991 views • 52 slides

HyperLoom Stanislav Bhm, Vojt ch Cima https://hyperloom.eu It4Innovations / ADAS / DEF

HyperLoom Stanislav Bhm, Vojt ch Cima https://hyperloom.eu It4Innovations / ADAS / DEF 10:00-11:30 Intoduction to HyperLoom Hands-on: 'Hello world' 12:30-14:00 Hands-on: Building a real pipeline Best practices

723 views • 50 slides

Usability Testing is Super Important and Easier than You Think Victoria Merriman Creative

Usability Testing is Super Important and Easier than You Think Victoria Merriman Creative Director Digital Loom victoria@digital-loom.com Show of hands Who here manages or works on a website? (Many hands go up.) Who here has

652 views • 36 slides