Silicon Kernel Learning Machines Gert Cauwenberghs Johns Hopkins - PowerPoint PPT Presentation

Silicon Kernel Learning “Machines” Gert Cauwenberghs Johns Hopkins University gert@jhu.edu 520.776 Learning on Silicon http://bach.ece.jhu.edu/gert/courses/776 G. Cauwenberghs 520.776 Learning on Silicon

Silicon Kernel Learning “Machines” OUTLINE • Introduction – Kernel Machines and array processing – Template-based pattern recognition • Kerneltron – Support vector machines: learning and generalization – Modular vision systems – CID/DRAM internally analog, externally digital array processor – On-line SVM learning • Applications – Example: real-time biosonar target identification G. Cauwenberghs 520.776 Learning on Silicon

Massively Parallel Array Kernel “Machines” • “Neuromorphic” – distributed representation – local memory and adaptation – sensory interface – physical computation – internally analog, externally digital • Scalable throughput scales linearly with silicon area • Ultra Low-Power factor 100 to 10,000 less energy than CPU or DSP Example: VLSI Analog-to-digital vector quantizer (Cauwenberghs and Pedroni, 1997) G. Cauwenberghs 520.776 Learning on Silicon

Acoustic Transient Processor (ATP) with Tim Edwards and Fernando Pineda Frequency from cochlea Time – Models time-frequency tuning of an auditory cortical cell (S. Shamma) – Programmable template (matched filter) in time and frequency – Operational primitives: correlate, shift and accumulate – Algorithmic and architectural simplifications reduce complexity to one bit per cell, implemented essentially with a DRAM or SRAM at high density... G. Cauwenberghs 520.776 Learning on Silicon

Acoustic Transient Processor (ATP) Cont’d... Algorithmic and Architectural Simplifications (1) – Channel differenced input, and binarized {-1,+1} template values, give essentially the same performance as infinite resolution templates. – Correlate and shift operations commute, implemented with one shift register only. G. Cauwenberghs 520.776 Learning on Silicon

Acoustic Transient Processor (ATP) Cont’d... Algorithmic and Architectural Simplifications (2) – Binary {-1,+1} template values can be replaced with {0,1} because of normalized inputs. – Correlation operator reduces to a simple one-way (on/off) switching element per cell. G. Cauwenberghs 520.776 Learning on Silicon

Acoustic Transient Processor (ATP) Cont’d... Algorithmic and Architectural Simplifications (3) – Channel differencing can be performed in the correlator, rather than at the input. The cost seems like a factor of two in complexity. Not quite: – Analog input is positive, simplifying correlation to single-quadrant, implemented efficiently with current-mode switching circuitry. – Shift-and-accumulate is differential. G. Cauwenberghs 520.776 Learning on Silicon

Acoustic Transient Processor (ATP) Memory-Based Circuit Implementation Shift-and- Accumulate Correlation G. Cauwenberghs 520.776 Learning on Silicon

Acoustic Transient Processor (ATP) with Tim Edwards and Fernando Pineda “Can” template 2.25mm correlation 64 time X 16 freq shift-accumulate “Can” response “Snap” response calc. meas. 2.2mm calc. meas. 2.2mm X 2.2mm in 1.2 µ m – CMOS – 64 time X 16 frequeny bins 30 µ W power at 5V – G. Cauwenberghs 520.776 Learning on Silicon

Generalization and Complexity – Generalization is the key to supervised learning, for classification or regression. – Statistical Learning Theory offers a principled approach to understanding and controlling generalization performance. • The complexity of the hypothesis class of functions determines generalization performance. • Support vector machines control complexity by maximizing the margin of the classified training data. G. Cauwenberghs 520.776 Learning on Silicon

Kernel Machines Mercer, 1909; Aizerman et al., 1964 Boser, Guyon and Vapnik, 1992 Φ ( ⋅ ) = Φ X ( x ) i i = Φ x X ( x ) X ⋅ = Φ ⋅ Φ X X ( x ) ( x ) i i ∑ ∑ = α Φ ⋅ Φ + = α ⋅ + y sign ( y ( x ) ( x ) b ) y sign ( y X X b ) i i i i i i ∈ ∈ i S i S ( ⋅ ⋅ Φ ⋅ Φ = K , ) ( x ) ( x ) K ( x , x ) i i Mercer’s Condition ∑ = α + y sign ( y K ( x , x ) b ) i i i ∈ i S G. Cauwenberghs 520.776 Learning on Silicon

Some Valid Kernels Boser, Guyon and Vapnik, 1992 – Polynomial (Splines etc.) ν = + ⋅ K ( x , x ) ( 1 x x ) i i – Gaussian (Radial Basis Function Networks) 2 − = − x x x x K ( , ) exp( i ) 2 σ i 2 – Sigmoid (Two-Layer Perceptron) = + ⋅ K ( x , x ) tanh( L x x ) only for certain L i i α x 1 y k 1 1 sign x α y k 2 y x 2 2 G. Cauwenberghs 520.776 Learning on Silicon

Trainable Modular Vision Systems: The SVM Approach Papageorgiou, Oren, Osuna and Poggio, 1998 – Strong mathematical foundations in Statistical Learning Theory (Vapnik, 1995) – The training process selects a small fraction of prototype support vectors from the data set, located at the margin on both sides of the SVM classification for classification boundary (e.g., pedestrian and face barely faces vs. barely non- object detection faces) G. Cauwenberghs 520.776 Learning on Silicon

Trainable Modular Vision Systems: The SVM Approach Papageorgiou, Oren, Osuna and Poggio, 1998 – The number of support vectors and their dimensions, in relation to the available data, determine the generalization performance – Both training and run- time performance are severely limited by the computational complexity of evaluating kernel functions ROC curve for various image representations and dimensions G. Cauwenberghs 520.776 Learning on Silicon

Scalable Parallel SVM Architecture – Full parallelism yields very large computational throughput – Low-rate input and output encoding reduces bandwidth of the interface G. Cauwenberghs 520.776 Learning on Silicon

The Kerneltron : Support Vector “Machine” Genov and Cauwenberghs, 2001 • 512 inputs, 128 support vectors • 3mm X 3mm in 0.5um CMOS • Fully parallel operation using “computational memories” in hybrid DRAM/CCD technology 512 X 128 128 ADCs • Internally analog, CID/DRAM array externally digital • Low bit-rate, serial I/O interface • Supports functional extensions on SVM paradigm G. Cauwenberghs 520.776 Learning on Silicon

Mixed-Signal Parallel Pipelined Architecture – Externally digital processing and interfacing • Bit-serial input, and bit-parallel storage of matrix elements • Digital output is obtained by combining quantized partial products G. Cauwenberghs 520.776 Learning on Silicon

CID/DRAM Cell and Analog Array Core All “1” stored All “0” stored input, shifted serially Linearity of parallel analog summation – Internally analog computing • Computational memory integrates DRAM with CID G. Cauwenberghs 520.776 Learning on Silicon

Feedthrough and Leakage Compensation in an extendable multi-chip architecture − − N 1 N 1 ∑ ∑ = = ( m ) ( m , n ) ( n ) ( m , n ) Y w x y i , j i j i , j = = n 0 n 0 − N 1 ∑ = + ε ( m ) ( m , n ) ( n ) Y ( 1 ) w x i , j i j = n 0 − N 1 ∑ + ε − ( m , n ) ( n ) ( 1 w ) x i j = n 0 − − N 1 N 1 ∑ ∑ = + ε ( m , n ) ( n ) ( n ) w x x i j j = = n 0 n 0 G. Cauwenberghs 520.776 Learning on Silicon

Oversampled Input Coding/Quantization Oversampled Input Coding/Quantization • Binary support vectors are stored in bit-parallel form • Digital inputs are oversampled (e.g. unary coded) and presented bit- serially − − I 1 J 1 ∑ ∑ − − = = i 1 ( i ) ; ( j ) W 2 w X x Data encoding mn mn n n = = i 0 j 0 − − N 1 I 1 ∑ ∑ − − = = i 1 ( i ) Y W X 2 Y , where Digital accumulation m mn n m = = n 0 i 0 − J 1 ∑ = ( i ) ( i , j ) Analog delta-sigma Y Y and m m accumulation = j 0 − N 1 ∑ = ( i , j ) ( i ) ( j ) Y w x Analog charge-mode m mn n accumulation = n 0 G. Cauwenberghs 520.776 Learning on Silicon

Oversampling Architecture Oversampling Architecture – Oversampled input coding (e.g. unary) – Delta-sigma modulated ADCs accumulate and quantize row outputs for all unary bit-planes of the input − J 1 = ∑ + ( i ) ( i , j ) Q Y e m m = j 0 G. Cauwenberghs 520.776 Learning on Silicon

Kerneltron II II Kerneltron Genov, Cauwenberghs, Mulliken and Adil, 2002 • 3mm x 3mm chip in 0.5 µ m CMOS • Contains 256 x 128 cells and 128 8-bit delta-sigma algorithmic ADCs • 6.6 GMACS throughput • 5.9 mW power dissipation • 8 bit full digital precision • Internally analog, externally digital • Modular; expandable • Low bit-rate serial I/O G. Cauwenberghs 520.776 Learning on Silicon

Delta- -Sigma Algorithmic ADC Sigma Algorithmic ADC Delta V res Q V os sh 8-bit resolution in 32 cycles V Residue voltage res S/H voltage V sh Q Oversampled digital os output G. Cauwenberghs 520.776 Learning on Silicon

Silicon Kernel Learning Machines Gert Cauwenberghs Johns Hopkins - PowerPoint PPT Presentation

Silicon Kernel Learning Machines Gert Cauwenberghs Johns Hopkins University gert@jhu.edu 520.776 Learning on Silicon http://bach.ece.jhu.edu/gert/courses/776 G. Cauwenberghs 520.776 Learning on Silicon Silicon Kernel Learning

Kernel Machines Support Vector Machines 1 Kernel Machines Optimal Separating HyperPlanes Soft

Kernel Machines Steven J Zeil Old Dominion Univ. Fall 2010 1 Support Vector Machines Kernel

WARS OF THE WARS OF THE WARS OF THE WARS OF THE WARS OF THE WARS OF THE WARS OF THE WARS OF

PV Technology Based on Crystalline Silicon Wafers Manufacturing of Crystalline Silicon Week 4.2

Tight Kernel Query Complexity of Kernel Ridge Regression and Kernel -means Clustering Manuel

Learning From Data Lecture 26 Kernel Machines Popular Kernels The Kernel Measures Similarity

Learning on Silicon: Overview Gert Cauwenberghs Johns Hopkins University gert@jhu.edu 520.776

Finite State Machines (FSM) Chapter 8 State Machines Introduction State Machines Mealy and

Tight Kernel Query Complexity of Kernel Ridge Regression and Kernel -means Clustering Manuel

Silicon Labs Corporate Overview J A N U A R Y 2 0 2 0 The leader in silicon, software and

Proposed Newport, Washington Silicon Metal Facility - Private & Confidential -

Silicon Europe - your connection to innovative European SMEs! www.silicon-europe.eu Silicon

Introduction to Statistical Learning and Kernel Machines Hichem SAHBI CNRS UPMC June 2018

Black Kernel Rot Malady of Pecan B Wood, C Bock, l Wells, T Cottrell, M Hotchkiss Black Kernel

Kernel Properties - Convexity Leila Wehbe October 1st 2013 Leila Wehbe Kernel Properties -

Processes, Protection and the Kernel: Processes, Protection and the Kernel: Mode, Space, and

Our Plan for Today Part 1a: ATP and Proof Calculi Build Your Own First-Order Prover Part 1b:

Modelling Biochemical Reaction Networks Lecture 9: Glycerol metabolism, Part I Marc R. Roussel

The BR ATP System Hoang Nga Nguyen Department of Computer Science Swansea University February,

Why IC 50 s Are Bad for You And Other Surprises Petr Kuzmi , Ph.D. BioKin, Ltd. What is

Steering Committee Meeting #2 Tuesday, September 1st 2020 ACTIVE TRANSPORTATION PLAN Welcome!

Cardiovascular Hospitalizations and Permanent Atrial Fibrillation Compared to Conventional Dual

Structure, mechanism, and inhibition of the membrane motor of the ATP synthase inferred from

Mathematical models of erythrocyte. What they give us for understanding the disorders and ageing

Silicon Kernel Learning Machines Gert Cauwenberghs Johns Hopkins - PowerPoint PPT Presentation

Silicon Kernel Learning Machines Gert Cauwenberghs Johns Hopkins University gert@jhu.edu 520.776 Learning on Silicon http://bach.ece.jhu.edu/gert/courses/776 G. Cauwenberghs 520.776 Learning on Silicon Silicon Kernel Learning

Kernel Machines Support Vector Machines 1 Kernel Machines Optimal Separating HyperPlanes Soft

Kernel Machines Steven J Zeil Old Dominion Univ. Fall 2010 1 Support Vector Machines Kernel

WARS OF THE WARS OF THE WARS OF THE WARS OF THE WARS OF THE WARS OF THE WARS OF THE WARS OF

PV Technology Based on Crystalline Silicon Wafers Manufacturing of Crystalline Silicon Week 4.2

Tight Kernel Query Complexity of Kernel Ridge Regression and Kernel -means Clustering Manuel

Learning From Data Lecture 26 Kernel Machines Popular Kernels The Kernel Measures Similarity

Learning on Silicon: Overview Gert Cauwenberghs Johns Hopkins University gert@jhu.edu 520.776

Finite State Machines (FSM) Chapter 8 State Machines Introduction State Machines Mealy and

Tight Kernel Query Complexity of Kernel Ridge Regression and Kernel -means Clustering Manuel

Silicon Labs Corporate Overview J A N U A R Y 2 0 2 0 The leader in silicon, software and

Proposed Newport, Washington Silicon Metal Facility - Private &amp; Confidential -

Silicon Europe - your connection to innovative European SMEs! www.silicon-europe.eu Silicon

Introduction to Statistical Learning and Kernel Machines Hichem SAHBI CNRS UPMC June 2018

Black Kernel Rot Malady of Pecan B Wood, C Bock, l Wells, T Cottrell, M Hotchkiss Black Kernel

Kernel Properties - Convexity Leila Wehbe October 1st 2013 Leila Wehbe Kernel Properties -

Processes, Protection and the Kernel: Processes, Protection and the Kernel: Mode, Space, and

Our Plan for Today Part 1a: ATP and Proof Calculi Build Your Own First-Order Prover Part 1b:

Modelling Biochemical Reaction Networks Lecture 9: Glycerol metabolism, Part I Marc R. Roussel

The BR ATP System Hoang Nga Nguyen Department of Computer Science Swansea University February,

Why IC 50 s Are Bad for You And Other Surprises Petr Kuzmi , Ph.D. BioKin, Ltd. What is

Steering Committee Meeting #2 Tuesday, September 1st 2020 ACTIVE TRANSPORTATION PLAN Welcome!

Cardiovascular Hospitalizations and Permanent Atrial Fibrillation Compared to Conventional Dual

Structure, mechanism, and inhibition of the membrane motor of the ATP synthase inferred from

Mathematical models of erythrocyte. What they give us for understanding the disorders and ageing

Proposed Newport, Washington Silicon Metal Facility - Private & Confidential -