Discrete event-based neural simulation using the SpiNNaker system - PowerPoint PPT Presentation

Discrete event-based neural simulation using the SpiNNaker system Andrew Brown Jeff Reeve University of Southampton University of Southampton adb@ecs.soton.ac.uk jsr@ecs.soton.ac.uk Kier Dugan Steve Furber University of Southampton University of Manchester kjd1v07@ecs.soton.ac.uk steve.furber@manchester.ac.uk CPA'15 Kent 1 24 August 2015

What is SpiNNaker? • It is not : Just another massive parallel machine 64k:32k D:I memory, 64k:32k D:I memory, 64k:32k D:I memory, • It is : 1,000,000 1,000,000 1,000,000 ARM 9, no floating point ARM 9, no floating point ARM 9, no floating point A large number of relatively small cores embedded in a powerful bespoke hardware communication fabric Bisection bandwidth Bisection bandwidth 250 Gb/s 250 Gb/s CPA'15 Kent 2 24 August 2015

Bio-inspiration: BIMPA • How can massively parallel computing resources accelerate our understanding of brain function? • How can our growing understanding of brain function point the way to more efficient parallel, fault-tolerant computation? CPA'15 Kent 3 24 August 2015

Outline • The SpiNNaker system • Configuration • Time models itself • Neural simulation CPA'15 Kent 4 24 August 2015

Machine architecture • 1 engine = • Triangular 256x256 toroid = mesh of nodes 65536 nodes • 1 node = 18 cores + comms + 128M SDRAM • 1 core = ARM9 + 64k DTCM + 32k ITCM CPA'15 Kent 5 24 August 2015

A Spinnaker node • 6 bi-directional comms links • Core farm • (1 monitor) • System... • NoC • RAM • Watchdogs • Off-die SDRAM CPA'15 Kent 6 24 August 2015

102 machine 18 cores CPA'15 Kent 7 24 August 2015

Physical construction 103 machine 48 nodes: 48 nodes x 18 cores = 864 cores CPA'15 Kent 8 24 August 2015

Physical construction 104 machine 24 boards: 24 boards x 48 nodes x 18 cores = 20736 cores CPA'15 Kent 9 24 August 2015

Physical construction 105 machine 5 racks: 5 racks x 24 boards x 48 nodes x 18 cores = 103680 cores CPA'15 Kent 10 24 August 2015

…and the machine yet to be assembled: 104 machine:20,736 cores, 1 rack, ~1900W 103 machine: 864 cores, 1 PCB, ~75W (24 PCBs, operation without aircon) 105 machine: 103,680 cores, 1 cabinet, ~9kW 106 machine: 1M cores, 10 cabinets, ~90kW CPA'15 Kent 11 24 August 2015

Scalable system ... ... arbitrary topology • We like tori • But the node topology is almost arbitrary CPA'15 Kent 12 24 August 2015

A conventional multi- processor program: MPI farm (or similar) Myranet (or similar) Problem: represented as a network of programs with a certain behaviour... ...compile, link... ...binary files ...embodied as data loaded into structures and instruction algorithms in code... memory... Interface presented to the application is a homogenous set of processes of arbitrary size; Messages addressed at process can talk to process by runtime from arbitrary messages under application process to arbitrary process software control CPA'15 Kent 14 24 August 2015

...and you might reasonably expect: • Blocking and non-blocking send/receive • Probing the queues • Broadcasting • Scatter-gather • Parallel I/O • Remote memory access • Dynamic process management CPA'15 Kent 15 24 August 2015

On SpiNNaker... • The problem ( C ircuit u nder S imulation) is defined as a graph • Torn into two components: – CuS topology • Embodied as hardware route tables in the nodes – Circuit device behaviour • Embodied as software event handlers running on cores CPA'15 Kent 16 24 August 2015

On SpiNNaker: ...abstract problem topology... ...problem topology loaded into firmware routing tables... Problem: represented as a network of devices with a certain behaviour... ...problem is split into two parts... ...compile, link... Messages launched at runtime take a path defined by ...binary files loaded into ...behaviour of each device the firmware core instruction memory... embodied as an interrupt router handler in code... The code says "send message" but has no control where the output message goes - the route tables in each node decide CPA'15 Kent 17 24 August 2015

OS, S/W environment • What you expect: • What each handler gets: – Read access to 72 bits of the – File I/O packet that woke it – Console output – Knowledge of incoming port (0..5) - not very useful – Memory management – I/O to its own memory map – Interactive debug – Ability to send packets – Libraries – Knowledge of local node and core identifier – The time – Coarse interval signal And that's all, folks CPA'15 Kent 18 24 August 2015

SpiNNaker configuration Offline configuration software maps neurons:cores (~1000:1) Maps each individual neuron to a SpiNNaker core Defines the router tables for each node Connectivity of neural topology is 1000 neurons per processor distributed throughout the system in the routing tables Defines the index structures necessary in each core to allow fast retrieval of neuron and synapse state Defines the packet handling code (interrupt handlers) CPA'15 Kent 19 24 August 2015

SpiNNaker configuration Biology Neurons communicate via spikes traveling along axons/dendrites Cores (and hence the neuron models resident within them) communicate via 72-bit hardware packets traveling through the routing structure, hopping from node to node as directed by the routing tables in each node SpiNNaker CPA'15 Kent 20 24 August 2015

Event handlers? Interrupts? • Packet arrives at a core: – Hardware invokes an interrupt handler • Tied to a neuron – Handler modifies neuron state • May/may not launch packets as a consequence • Handlers are tiny ; they execute ; they stop And that's all you have to play with CPA'15 Kent 21 24 August 2015

What exactly is a packet? – Hardware • Fixed bit length – Address event representation (AER) – Packets delivered from source neuron to target neuron • Source node address|source core address|source neuron address – Physical route embodied in route tables • Distributed CPA'15 Kent 22 24 August 2015

Time Biology: Axonal delay O (ms) – Neuron processing time O (ms) – fn(biological geometry) fn(biology & state(history)) CPA'15 Kent 24 24 August 2015

Time Axonal delay stored as parameter in synapse state local to neuron model Neuron-core mapping – fn(graph mapping software) Neuron-neuron wallclock delay maximum O (10us) – fn(graph mapping, traffic density & engine size) Node-node wallclock hop delay O (100ns) – fn(graph mapping & traffic density) SpiNNaker CPA'15 Kent 25 24 August 2015

There are different sorts of interrupts • Each core – Packet handling interrupt • Invoked by incoming packet • Each node – Biological clock tick handling interrupt – Clocks are not phase locked – Slow O (kHz) – '(Biological) time is passing' signal – Asserted on every core CPA'15 Kent 26 24 August 2015

Back to biology B A • A fires when it fires • Pulse propagates to B No synchronising clock • Arrives when it arrives Event driven • B integrates incoming pulse(s) Data push • Fires when it fires CPA'15 Kent 27 24 August 2015

Back to SpiNNaker In parallel with (and not synchronised to) this: Biological clock ticks B Triggers an interrupt with each tick A A fires when it fires Launches a packet to B Arrives O (us) later Triggers 'packet arrived' interrupt CPA'15 Kent 28 24 August 2015

A closer look at the interrupt handlers Packet arrival handler Clock tick handler Increment age of buffered packets; If any 'arrived' (age == Remove packet from synapse delay), assert router; onto neuron state Store in buffer in equations; synapse (age = 0) Integrate (one timestep) neuron state equations CPA'15 Kent 29 24 August 2015

Neural simulation sn Individual message s2 frequencies < real- time clock s1 Superposition of all inputs: exact timing = Σ s fn(neuron:core) i.e. independent of CuS (bad) BUT message latency << CuS time constants (so it doesn't matter) clock Change of neuron state derived locally, stored until next (biological) timestep Change of neuron state broadcast (or not) at next (biological) timestep CPA'15 Kent 30 24 August 2015

And this works because: • Biological wallclock time modelled locally at each node – (and thus each neuron modelled within it) • At each time tick – Inputs added if age suitable – Equations integrated – States updated • Wallclock packet transit delay is negligible and ignored • Biological delay captured in target synaptic model state • Differential equations controlling neuron model behaviour are not stiff – All time constants >> biological clock tick – Forward Euler / Runge/Kutta stable CPA'15 Kent 31 24 August 2015

Limitations • SpiNNaker designed to operate in real time – Simulation 'speed' a hard metric to interpret • Communication via hardware packets – 16 bits/node => 65536 nodes/machine – 4 bits/core => 16 cores/node – 10 bits/neuron => 1024 neurons/core • Hard limit of 1,073,741,825 neurons CPA'15 Kent 32 24 August 2015

Discrete event-based neural simulation using the SpiNNaker system - PowerPoint PPT Presentation

Discrete event-based neural simulation using the SpiNNaker system Andrew Brown Jeff Reeve University of Southampton University of Southampton adb@ecs.soton.ac.uk jsr@ecs.soton.ac.uk Kier Dugan Steve Furber University of Southampton

Stochastic Simulation Discrete simulation/event-by-event Bo Friis Nielsen Institute of

Stochastic Simulation Discrete simulation/event-by-event Bo Friis Nielsen Institute of

Simulation Discrete event systems Discrete event simulation 1 Consider systems with finitely

Discrete Event Simulation And Discrete Event Simulation And Evaluation Of A Firewall Aware

Chapter 3 General Principles in Simulation Banks, Carson, Nelson & Nicol Discrete-Event

Chapter 2 Simulation Examples Banks, Carson, Nelson & Nicol Discrete-Event System Simulation

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Discrete-Event Systems and Generalized Semi-Markov Processes Discrete-Event Systems and

Discrete event simulation Prof.dr.ir. Alexander Verbraeck Professor, Faculty of TPM, TU Delft

Simulation Simulation Modeling and Performance Analysis with Discrete-Event Simulation g y Dr.

Simulation Simulation Modeling and Performance Analysis with Discrete-Event Simulation g y Dr.

Simulation Simulation Modeling and Performance Analysis with Discrete-Event Simulation g y Dr.

Simulation Simulation Modeling and Performance Analysis with Discrete-Event Simulation g y Dr.

Simulation Simulation Modeling and Performance Analysis with Discrete-Event Simulation g y Dr.

Simulation Simulation Modeling and Performance Analysis with Discrete-Event Simulation g y Dr.

Simulation Simulation Modeling and Performance Analysis with Discrete-Event Simulation g y Dr.

Unusual Suspects Unusual Suspects of of Amyotrophic Lateral Sclerosis (ALS) Amyotrophic

Neural Decoding Matthias Hennig School of Informatics, University of Edinburgh January 2019 0

nervous system: : Modelling tadpole lo locomotor behaviour in in response to sensory ry sig

Lecture 27: Neural Networks and Deep Learning Mark Hasegawa-Johnson April 6, 2020 License:

The Muscular System II PSK 4U NGDHS S. Kelly The Neuromuscular System Simply put: the

Electric Linear Slides EAS AR Equipped Electric Cylinders AR EAC DRL Hollow Rotary

Fields, Units, Magnetostatics European School on Magnetism Laurent Ranno

Electric Linear Slides Electric Cylinders AZ EAC AZ DRS2 Hollow Rotary Actuators AZ DG

Discrete event-based neural simulation using the SpiNNaker system - PowerPoint PPT Presentation

Discrete event-based neural simulation using the SpiNNaker system Andrew Brown Jeff Reeve University of Southampton University of Southampton adb@ecs.soton.ac.uk jsr@ecs.soton.ac.uk Kier Dugan Steve Furber University of Southampton

Stochastic Simulation Discrete simulation/event-by-event Bo Friis Nielsen Institute of

Stochastic Simulation Discrete simulation/event-by-event Bo Friis Nielsen Institute of

Simulation Discrete event systems Discrete event simulation 1 Consider systems with finitely

Discrete Event Simulation And Discrete Event Simulation And Evaluation Of A Firewall Aware

Chapter 3 General Principles in Simulation Banks, Carson, Nelson &amp; Nicol Discrete-Event

Chapter 2 Simulation Examples Banks, Carson, Nelson &amp; Nicol Discrete-Event System Simulation

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Discrete-Event Systems and Generalized Semi-Markov Processes Discrete-Event Systems and

Discrete event simulation Prof.dr.ir. Alexander Verbraeck Professor, Faculty of TPM, TU Delft

Simulation Simulation Modeling and Performance Analysis with Discrete-Event Simulation g y Dr.

Simulation Simulation Modeling and Performance Analysis with Discrete-Event Simulation g y Dr.

Simulation Simulation Modeling and Performance Analysis with Discrete-Event Simulation g y Dr.

Simulation Simulation Modeling and Performance Analysis with Discrete-Event Simulation g y Dr.

Simulation Simulation Modeling and Performance Analysis with Discrete-Event Simulation g y Dr.

Simulation Simulation Modeling and Performance Analysis with Discrete-Event Simulation g y Dr.

Simulation Simulation Modeling and Performance Analysis with Discrete-Event Simulation g y Dr.

Unusual Suspects Unusual Suspects of of Amyotrophic Lateral Sclerosis (ALS) Amyotrophic

Neural Decoding Matthias Hennig School of Informatics, University of Edinburgh January 2019 0

nervous system: : Modelling tadpole lo locomotor behaviour in in response to sensory ry sig

Lecture 27: Neural Networks and Deep Learning Mark Hasegawa-Johnson April 6, 2020 License:

The Muscular System II PSK 4U NGDHS S. Kelly The Neuromuscular System Simply put: the

Electric Linear Slides EAS AR Equipped Electric Cylinders AR EAC DRL Hollow Rotary

Fields, Units, Magnetostatics European School on Magnetism Laurent Ranno

Electric Linear Slides Electric Cylinders AZ EAC AZ DRS2 Hollow Rotary Actuators AZ DG

Chapter 3 General Principles in Simulation Banks, Carson, Nelson & Nicol Discrete-Event

Chapter 2 Simulation Examples Banks, Carson, Nelson & Nicol Discrete-Event System Simulation