Distributed Deep Learning: Methods and Resources
Sergey Nikolenko
Chief Research Officer, Neuromation Researcher, Steklov Institute of Mathematics at St. Petersburg September 23, 2017, AI Ukraine, Kharkiv
Maxim Prasolov
CEO, Neuromation
Distributed Deep Learning: Methods and Resources Sergey Nikolenko - - PowerPoint PPT Presentation
Distributed Deep Learning: Methods and Resources Sergey Nikolenko Maxim Prasolov Chief Research Officer, Neuromation CEO, Neuromation Researcher, Steklov Institute of Mathematics at St. Petersburg September 23, 2017, AI Ukraine, Kharkiv
Distributed Deep Learning: Methods and Resources
Sergey Nikolenko
Chief Research Officer, Neuromation Researcher, Steklov Institute of Mathematics at St. Petersburg September 23, 2017, AI Ukraine, Kharkiv
Maxim Prasolov
CEO, Neuromation
Outline
for knowledge mining
(in part due to distributed computations)
What is a deep neural network
composition of simple functions implemented by artificial neurons
followed by nonlinearity, but can be anything as long as you can take derivatives
computational graph that computes the loss function for the model
Backpropagation
you take the gradient of the loss function w.r.t. weights
with backpropagation
gradient descent and all of its wonderful modifications, from Nesterov momentum to Adam
FEEDFORWARD NETWORKS CONVOLUTIONAL NETWORKS RECURRENT NETWORKS
Gradient descent is used for all kinds of neural networks
Distributed Deep Learning: The Problem
was the use of GPUs
and optimized for matrix computations
What Can Be Parallel
[pictures from (Black, Kokorin, 2016)]
Examples of data parallelism
then average the results
○ but how often? ○ and what do we do with advanced SGD variants?
○ much more interesting without synchronization ○ but the stale gradient problem
just read and write to shared memory, lock-free; whatever happens, happens
Examples of data parallelism
○ DP on a GPU cluster ○ communication through reduction trees
Model parallelism
○ DP: workers exchange weight updates ○ MP: workers exchange data updates
completely replaced by...
Distributed Learning in TensorFlow
First specify the structure of the cluster: Then assign (parts of) computational graph to workers and weights to parameter servers:
Example of Data Parallelism in TensorFlow
Interesting variations
the time (staleness) to updates
represent and send only u and v
wait-free backprop and hybrid communication
○ experience replay: store experience in replay memory and serve them for learning ○ target Q-network is separate from the Q-network which is learning now, updates are rare
Distributed reinforcement learning
Gorila from DeepMind: everything is parallel and asynchronous
Recap
Distributed deep learning works
BITCOIN OR ETHER MINING AMAZON DEEP LEARNING
$7-8 USD per DAY $3-4 USD HOUR
Chris
NOT ENOUGH LABELED DATA FOR NEURAL NETWORK TRAINING
“BOTTLENECK” OF AUTOMATION OF EVERY INDUSTRY:
BITCOIN OR ETHER MINING AMAZON DEEP LEARNING
$7-8 USD per DAY $3-4 USD HOUR
IMAGE RECOGNITION IN RETAIL
BITCOIN OR ETHER MINING AMAZON DEEP LEARNING
$7-8 USD per DAY $3-4 USD HOUR
ABOUT 40 BLN IMAGES PER YEAR
TO AUTOMATE THE RETAIL INDUSTRY MUST BE RECOGNIZED ON THE SHELVES
IMAGE RECOGNITION IN RETAIL BY ECR RESEARCH:
OSA HP CONTRACTED NEUROMATION TO PRODUCE LABELED DATA AND TO RECOGNIZE
30% OF THE COST IS COMPUTATIONAL POWER.
MORE THAN 7 MLN EURO REVENUE
BITCOIN OR ETHER MINING AMAZON DEEP LEARNING
$7-8 USD per DAY $3-4 USD HOUR
Chris LABELED PHOTOS ARE REQUIRED TO TRAIN IMAGE RECOGNITION MODELS
MORE THAN
WHERE CAN WE GET THIS HUGE AMOUNT OF LABELED DATA?
Chris
1 MAN = 8 HOURS x 50 IMAGES, x $0.2 PER IMAGE YEARS OF MECHANICAL WORK
1 BLN LABELED PHOTOS = $240 MLN
HAS BEEN MANUAL WORK TILL NOW
WE KNOW HOW TO GENERATE SYNTHETIC LABELED DATA FOR DEEP LEARNING
SYNTHETIC DATA:
A BREAKTHROUGH IN DEEP LEARNING BUT REQUIRES HUGE COMPUTATIONAL POWER TO RENDER DATA AND TRAIN NEURAL NETWORKS
BITCOIN OR ETHER MINING AMAZON DEEP LEARNING
$7-8 USD per DAY $3-4 USD HOUR
BITCOIN OR ETHER MINING AMAZON DEEP LEARNING
$7-8 per DAY $3-4 per HOUR
The AI industry is ready to pay miners for their computational resources more than they can ever get from mining Ether.
KNOWLEDGE MINING IS MORE PROFITABLE. DEEP LEARNING NEEDS YOUR COMPUTATION POWER!
WE CAN BRIDGE THIS GAP
GPU
x6
GPU
BITCOIN OR ETHER MINING AMAZON DEEP LEARNING
$7-8 USD per DAY $3-4 USD HOUR
BLOCKCHAIN + DEEP LEARNING
BITCOIN OR ETHER MINING AMAZON DEEP LEARNING
$7-8 USD per DAY $3-4 USD HOUR
NEUROMATION PLATFORM
TokenAI
will combine in one place all the components necessary to build deep learning solutions with synthetic data
THE UNIVERSAL MARKETPLACE OF NEURAL NETWORK DEVELOPMENT
BITCOIN OR ETHER MINING AMAZON DEEP LEARNING
$7-8 USD per DAY $3-4 USD HOUR
massive amounts of transactions
politically independent
transparency is required to build trust
NEUROMATION PLATFORM will be extending Etherium with TokenAI.
Blockchain is the only technology that can realistically accomplish this. Extending Ethereum instead of building our own blockchain is an obvious first step.
Neuromation needs to quickly deploy a massive network of computation nodes (converted from crypto miners).
BITCOIN OR ETHER MINING AMAZON DEEP LEARNING
$7-8 USD per DAY $3-4 USD HOUR
DEEP LEARNING RESEARCH GRANTS
industry
WE ARE OPEN FOR COOPERATION
partnerships@neuromation.io
coming)
VAST APPLICATIONS OF SYNTHETIC DATA NEUROMATION LABS:
RETAIL AUTOMATION LAB PHARMA AND BIOTECH LAB synthetic data for: ENTERPRISE AUTOMATION LAB synthetic data for:
Eastern European Retail Market only
revenue
tumors and melanomas)
cameras)
self-driving cars, and industrial robots in virtual environments
supply-chain solutions
(live)
OUR TEAM:
Maxim Prasolov CEO Fedor Savchenko CTO Sergey Nikolenko Chief Research Officer Denis Popov VP of Engineering Constantine Goltsev Investor / Chairman Andrew Rabinovich Adviser Yuri Kundin ICO Compliance Adviser Aleksey Spizhevoi Researcher Esther Katz VP Communication Kiryl Truskovskyi Lead Researcher
OCTOBER 15th, 2017 Presale of TOKENAI starts NOVEMBER, 2017 Public sale starts UNKNOWN DATE Secret cap is reached, and token sale ends in 7 days Jan 1st, 2018 Token sale ends (if secret cap is not reached)
ICO ROADMAP
ICO DETAILS
Jurisdiction: Estonia, EU
* Neuromation is fully compliant with Estonian crowdfunding legislation
KNOWLEDGE MINING - A NEW ERA OF DISTRIBUTED COMPUTING
THANK YOU!
neuromation.io