Neuroscience The Study of Neurons and Brains 2 /32 DNN did this - - PowerPoint PPT Presentation

neuroscience
SMART_READER_LITE
LIVE PREVIEW

Neuroscience The Study of Neurons and Brains 2 /32 DNN did this - - PowerPoint PPT Presentation

Flexon: A Flexible Digital Neuron for Efficient Spiking Neural Network Simulations Dayeol Lee , Gwangmu Lee * , Dongup Kwon * , Sunghwa Lee * , Youngsok Kim * , and Jangwoo Kim * * Dept. of Electrical and Computer Engineering, Seoul National


slide-1
SLIDE 1

Flexon: A Flexible Digital Neuron

for Efficient Spiking Neural Network Simulations

Dayeol Lee†, Gwangmu Lee*, Dongup Kwon*, Sunghwa Lee*, Youngsok Kim*, and Jangwoo Kim*

*Dept. of Electrical and Computer Engineering, Seoul National University †Dept. of Electrical Engineering and Computer Sciences, University of California, Berkeley

slide-2
SLIDE 2

2/32

Neuroscience

The Study of Neurons and Brains Recognition

Object detection, Classification Morality, Social Value Parkinson's Disease, CJD, Dementia

Degeneration Consciousness Emotion

Sympathy, Happiness, Apathy

slide-3
SLIDE 3

3/32

Neuroscience

The Study of Neurons and Brains Recognition

Object detection, Classification Morality, Social Value Parkinson's Disease, CJD, Dementia

Degeneration Consciousness Emotion

Sympathy, Happiness, Apathy DNN did this

Unexplored yet

slide-4
SLIDE 4

4/32

Spike Gen. Spike Gen.

for each time-step:

Neuron Spike Generator Time-step 1 Neuron

Modeling a Brain: Spiking Neural Network

How to compute spiking neural network?

Synapse

slide-5
SLIDE 5

5/32

Spike Gen. Spike Gen.

for each time-step: 1) stimulus generation

Neuron Spike Generator Time-step 1 Neuron

Modeling a Brain: Spiking Neural Network

How to compute spiking neural network?

Synapse

slide-6
SLIDE 6

6/32

Spike Gen. Spike Gen.

for each time-step: 1) stimulus generation 2) neuron computation

Neuron Spike Generator Time-step 1 Neuron

Modeling a Brain: Spiking Neural Network

How to compute spiking neural network?

Synapse

slide-7
SLIDE 7

7/32

Spike Gen. Spike Gen.

Neuron Spike Generator Time-step 1 Neuron

Modeling a Brain: Spiking Neural Network

How to compute spiking neural network?

Synapse

for each time-step: 1) stimulus generation 2) neuron computation 3) synapse calculation …

slide-8
SLIDE 8

8/32

10 Representative Benchmarks on CPU/GPU

CPU: Intel Xeon E5-2630 v4 CPU (12-core, 2.2 GHz) / GPU: NVIDIA Titan X (Pascal) GPU

Where Does Time Go?

~50% of overheads coming from

Neuron Computation

slide-9
SLIDE 9

9/32 Membrane Voltage

(millivolt scale)

Threshold Resting Potential Time

(millisecond scale) Neuron

Biological Neuron

How Does a Neuron Behave?

slide-10
SLIDE 10

10/32 Membrane Voltage

(millivolt scale)

Threshold Resting Potential Time

(millisecond scale) Neuron

Biological Neuron

How Does a Neuron Behave?

slide-11
SLIDE 11

11/32 Membrane Voltage

(millivolt scale)

Threshold Resting Potential Time

(millisecond scale) Neuron

Biological Neuron

How Does a Neuron Behave?

slide-12
SLIDE 12

12/32 Membrane Voltage

(millivolt scale)

Threshold Resting Potential Time

(millisecond scale) Neuron

Biological Neuron

How Does a Neuron Behave?

slide-13
SLIDE 13

13/32

Axon Dendrite Soma

Biological Neuron

Time

(millisecond scale)

Membrane Voltage

(millivolt scale)

Spike initiation

Tons of variants exist, depending on their feature set.

Various Neuron Behaviors

Input spike accumulation Membrane decay Spike inhibition

We need to support various features for accurate brain simulations.

slide-14
SLIDE 14

14/32

Custom Hardware Framework Software Simulation

Solutions and Limitations

Flexibility Accuracy High Performance Low Energy

slide-15
SLIDE 15

15/32

Design Goals & Key Ideas High Performance Goal 1 High Flexibility Goal 2 Low cost Goal 3

Hardware-based Feature-driven Spatially-folded

slide-16
SLIDE 16

16/32

Neuron Feature #1: Input Spike Accumulation

Current-based Conductance-based

(Exponential-shaped)

Conductance-based

(Alpha function-shaped)

slide-17
SLIDE 17

17/32

Neuron Feature #1: Input Spike Accumulation

Current-based Conductance-based

(Exponential-shaped)

Conductance-based

(Alpha function-shaped)

slide-18
SLIDE 18

18/32

Neuron Feature #2: Spike Initiation

Quadratic

(with feature) (without feature)

Exponential

slide-19
SLIDE 19

19/32

Neuron Feature #3: Spike-triggered Current

Adaptation Sub-threshold Oscillation

(without) (with) (without) (with)

slide-20
SLIDE 20

20/32

Neuron Feature #4: Refractory Period

Absolute Relative

(without) (with) (without) (with)

slide-21
SLIDE 21

21/32

(without) (with) (without) (with)

Neuron Feature: Flexible Feature Support

Absolute Relative

“Feature-driven” Flexon

supports 11 major neuron models

(LLIF, SLIF, DSRM0, DLIF, QIF, EIF, Izhikevich, AdEx, …)

slide-22
SLIDE 22

22/32

1 10 100 1000 Geomean Nowotny et al. Izhikevich Geomean Vogels et al. Vogels-Abbott Potjans-Diesmann Muller et al. Destexhe-UpDown Destexhe-LTS Brunel Brette et al. GPU CPU

Speed-up

(Normalized to the baseline)

8 CPU + 2 GPU Representative Benchmarks

Flexon: TSMC 45nm, Synopsys Design Compiler (neuron), CACTI 6.5 (SRAM)

87.4x

Over CPU

8.19x

Over GPU

Evaluation (12x Feature-driven Design)

CPU

Intel Xeon E5-2630 v4 (12-core, 2.2 GHz)

GPU

NVIDIA Titan X (Pascal)

slide-23
SLIDE 23

23/32

1 10 100 1000 10000 100000 Geomean Nowotny et al. Izhikevich Geomean Vogels et al. Vogels-Abbott Potjans-Diesmann Muller et al. Destexhe-UpDown Destexhe-LTS Brunel Brette et al. GPU CPU

Energy Efficiency

(Normalized to the baseline)

8 CPU + 2 GPU Representative Benchmarks

Flexon: TSMC 45nm, Synopsys Design Compiler (neuron), CACTI 6.5 (SRAM)

6,186x

Over CPU

422x

Over GPU

Evaluation (12x Feature-driven Design)

CPU

Intel Xeon E5-2630 v4 (12-core, 2.2 GHz)

GPU

NVIDIA Titan X (Pascal)

slide-24
SLIDE 24

24/32

Intrinsic Space-inefficiency

(without) (with) (without) (with)

Absolute Relative

“Feature-driven” Flexon

supports 11 major neuron models

(LLIF, SLIF, DSRM0, DLIF, QIF, EIF, Izhikevich, AdEx, …)

Lots of redundant units

(multiplier, adder, …)

slide-25
SLIDE 25

25/32

Constructing Spatially-folded Flexon

Conductance-based

(Exponential)

Quadratic Adaptation

Spatially-folded design è reduce area

  • Remove redundant MAC operators
slide-26
SLIDE 26

26/32

Constructing Spatially-folded Flexon

Exponential Relative

(ADT: Adaptation)

  • Remove redundant MAC operators

Modifications from the baseline

  • 2-stage pipeline, multi-cycle implementation

Spatially-folded design è reduce area

slide-27
SLIDE 27

27/32

Constructing Spatially-folded Flexon “Spatially-folded” Flexon

supports various major neuron models

6x area saving

  • Remove redundant MAC operators

What we should change

  • 2-stage pipeline, multi-cycle implementation

Spatially-folded design è reduce area

slide-28
SLIDE 28

28/32

1 10 100 1000 Geomean Nowotny et al. Izhikevich Geomean Vogels et al. Vogels-Abbott Potjans-Diesmann Muller et al. Destexhe-UpDown Destexhe-LTS Brunel Brette et al. GPU CPU

Speed-up

(Normalized to the baseline)

8 CPU + 2 GPU Representative Benchmarks

Flexon: TSMC 45nm, Synopsys Design Compiler (neuron), CACTI 6.5 (SRAM)

122x

Over CPU

9.83x

Over GPU

Evaluation (72x Spatially-folded Design)

CPU

Intel Xeon E5-2630 v4 (12-core, 2.2 GHz)

GPU

NVIDIA Titan X (Pascal) Feature-driven Spatially-folded

slide-29
SLIDE 29

29/32

1 10 100 1000 10000 100000 Geomean Nowotny et al. Izhikevich Geomean Vogels et al. Vogels-Abbott Potjans-Diesmann Muller et al. Destexhe-UpDown Destexhe-LTS Brunel Brette et al. GPU CPU

Energy Efficiency

(Normalized to the baseline)

8 CPU + 2 GPU Representative Benchmarks

Flexon: TSMC 45nm, Synopsys Design Compiler (neuron), CACTI 6.5 (SRAM)

Evaluation (72x Spatially-folded Design)

CPU

Intel Xeon E5-2630 v4 (12-core, 2.2 GHz)

GPU

NVIDIA Titan X (Pascal)

5,413x

Over CPU

135x

Over GPU

Feature-driven Spatially-folded

slide-30
SLIDE 30

30/32

Baseline Flexon vs. Spatially-folded Flexon

  • Baseline “Feature-driven” Flexon

− Fast: 87.4x over CPUs, 8.19x over GPUs − Energy-efficient: 6,186x over CPUs, 422x over GPUs

  • “Spatially-folded” Flexon

− Fast: 122x over CPUs, 9.83x over GPUs − Energy-efficient: 5,413x over CPUs, 135x over GPUs

slide-31
SLIDE 31

31/32

Conclusion

  • Flexon is a flexible feature-driven digital neuron design,

capable of realizing various major neuron models.

− Flexible & power-efficient (6,186x over CPU)

  • Spatially-folded Flexon makes features share units,

reducing 6x circuit area.

− Flexible & fast when integrated (122x over CPU)

slide-32
SLIDE 32

32/32

Thank you for listening

Flexon

A Flexible Digital Neuron for Efficient Spiking Neural Network Simulations