ORNL is managed by UT-Battelle for the US Department of Energy
Adapting DL to New Data: An Evolutionary Algorithm for Optimizing Deep Networks
Steven R. Young Research Scientist Oak Ridge National Laboratory
Adapting DL to New Data: An Evolutionary Algorithm for Optimizing - - PowerPoint PPT Presentation
Adapting DL to New Data: An Evolutionary Algorithm for Optimizing Deep Networks Steven R. Young Research Scientist Oak Ridge National Laboratory ORNL is managed by UT-Battelle for the US Department of Energy Overview Deep Learning in
ORNL is managed by UT-Battelle for the US Department of Energy
Adapting DL to New Data: An Evolutionary Algorithm for Optimizing Deep Networks
Steven R. Young Research Scientist Oak Ridge National Laboratory
2 Adapting DL to New Data
Overview
3 Adapting DL to New Data
Deep Learning for Science Applications
Commercial Applications Science Applications
Object Recognition Face Recognition
State of the Art Results Characteristics
Challenging New Domains
Material Science High Energy Physics
Characteristics
4 Adapting DL to New Data
Problem: Adaptability Challenge
neural network that performs ideally with that data
parameters) for a particular data set ?
1. Pick some deep learning software (Caffe, Torch, Theano, etc) 2. Design a set of parameters that defines your deep learning network 3. Try it on your data 4. If it doesn’t work as well as you want, go back to step 2 and try again.
5 Adapting DL to New Data
The Challenge
Pooling Convolutional Input Output Fully Connected Pooling Convolutional Output Fully Connected Pooling Convolutional Learning Rate Batch Size Momentum Weight Decay
Deep Learning Toolbox
6 Adapting DL to New Data
Deep Learning Toolbox
The Challenge
Convolutional Input Convolutional Output Learning Rate Batch Size Momentum Weight Decay
7 Adapting DL to New Data
Hyper-parameter Selection
– Requires domain knowledge
– Exponential growth with high-dimensional hyper-parameter space – Doesn’t exploit low effective dimension for discovery
– By itself, not adaptive (no use of information from previous experiments)
8 Adapting DL to New Data
What can we do with Titan?
9 Adapting DL to New Data
MENNDL: Multi-node Evolutionary Neural Networks for Deep Learning
parameter space for deep learning
– Focus on Convolutional Neural Networks – Evolve only the topology with EA; typical SGD training process – Generally: Provide scalability and adaptability for many data sets and compute platforms
– Next generation, Summit, will have increased GPU capability
– Climate science, material science, physics, etc.
10 Adapting DL to New Data
Designing the Genetic Code
with sets of genes
– Fixed width set of genes corresponds to a layer
– Restrict layer types based on section
types
…
Population – Group of Networks Individual - Network Feature Layers Parameters
… …
Classification Layers
… …
11 Adapting DL to New Data
MENNDL: Communication
Genetic Algorithm Master Gene: Population Network Parameters Network 1 Parameters, Model Predictions Performance Metrics Network 2 Parameters, Model Predictions Performance Metrics Network N Parameters, Model Predictions Performance Metrics Fitness Metrics: Accuracy Worker (one per node)
MPI
12 Adapting DL to New Data
Hyper-parameter Values vs Performance
code that changes all possible parameters (e.g., # of layers, layer types, etc)
Accuracy
Evolved
13 Adapting DL to New Data
MINERvA
14 Adapting DL to New Data
MINERvA Vertex Segment Classification
Goal: Classify which segment the vertex is located in. Challenge: Events can have very different characteristics.
15 Adapting DL to New Data
Benefit of Parallelization
MINERvA dataset
12 hours 6 hours 2 hours
16 Adapting DL to New Data
Unusual layers
17 Adapting DL to New Data
Unusual Layers (limited training examples)
18 Adapting DL to New Data
Current Status
19 Adapting DL to New Data
Acknowledgements
Chicago)
Isele (University of Pennsylvania)
Derek Rose (ORNL)
20 Adapting DL to New Data
Questions