SLIDE 1
Gonçalo Mordido, Matthijs Van keirsbilck, Alexander Keller
- neural network quantization/sparsity
○ lower cost: compute, memory, power, bandwidth, ...
- quantization usually requires retraining
- idea: use importance sampling
using Monte Carlo Methods EMC2 Workshop Gonalo Mordido Hasso - - PowerPoint PPT Presentation
Instant Quantization of Neural Networks using Monte Carlo Methods EMC2 Workshop Gonalo Mordido Hasso Plattner Institute @ NeurIPS 2019 Matthijs Van Keirsbilck NVIDIA Alexander Keller NVIDIA 1 Motivation and idea neural network
Gonçalo Mordido, Matthijs Van keirsbilck, Alexander Keller
Gonçalo Mordido, Matthijs Van keirsbilck, Alexander Keller
Gonçalo Mordido, Matthijs Van keirsbilck, Alexander Keller
Gonçalo Mordido, Matthijs Van keirsbilck, Alexander Keller
Gonçalo Mordido, Matthijs Van keirsbilck, Alexander Keller