Time-delay differential equations in machine learning Lyudmila - PowerPoint PPT Presentation

Time-delay differential equations in machine learning Lyudmila Grigoryeva 1 , Julie Henriques 2 , Laurent Larger 2 , Juan-Pablo Ortega 3 , 4 1 Universit¨ at Konstanz, Germany 2 Universit´ e Bourgogne Franche-Comt´ e, France 3 Centre National de la Recherche Scientifique (CNRS), France 4 Universit¨ at Sankt Gallen, Switzerland L. Grigoryeva, J. Henriques, L. Larger, J.-P. Ortega ( Universit¨ at Konstanz, Germany, Universit´ e Bourgogne Franche-Comt´ e, France, Centre National 1 / 54 TDDE in machine learning

Outline of the presentation L. Grigoryeva, J. Henriques, L. Larger, and J.-P. Ortega. Stochastic time series forecasting using time-delay reservoir computers: performance and universality. Neural Networks , 55:59–71, 2014. L. Grigoryeva, J. Henriques, L. Larger, and J.-P. Ortega. Optimal nonlinear information processing capacity in delay-based reservoir computers. Scientific Reports , 5(12858):1–11, 2015. L. Grigoryeva, J. Henriques, L. Larger, J.-P. Ortega, 2016. Nonlinear memory capacity of parallel time-delay reservoir computers in the processing of multidimensional signals. To appear in Neural Computation . L. Grigoryeva, J. Henriques, J.-P. Ortega, 2015. Quantitative evaluation of the performance of discrete-time reservoir computers in the forecasting, filtering, and reconstruction of stochastic stationary signals. Preprint. L. Grigoryeva, J.-P. Ortega, 2016. Ridge regression with homoscedastic residuals: generalization error with estimated parameters. Preprint. L. Grigoryeva, J. Henriques, L. Larger, J.-P. Ortega ( Universit¨ at Konstanz, Germany, Universit´ e Bourgogne Franche-Comt´ e, France, Centre National 2 / 54 TDDE in machine learning

Outline of the presentation Outline Reservoir computing: brain-inspired machine learning paradigm 1 Time-Delay Reservoir (TDR) computers: 2 Physical implementation with opto- and electronic systems High-speed and excellent computational performance Architecture of TDR computers Preliminary empirical results: 3 Application of TDR to stochastic nonlinear time series forecasting (multivariate VEC-GARCH models) Parallel reservoir architectures and task-universality Theoretical results on optimal TDR architecture: 4 Unimodality versus bimodality; stability of the TDR VAR(1) model as the TDR approximating model Nonlinear capacity as a quantative measure of performance Further research 5 L. Grigoryeva, J. Henriques, L. Larger, J.-P. Ortega ( Universit¨ at Konstanz, Germany, Universit´ e Bourgogne Franche-Comt´ e, France, Centre National 3 / 54 TDDE in machine learning

Reservoir computing: brain-inspired machine learning paradigm Machine learning and brain-inspired neural networks Machine learning: construction and development of algorithms that can “learn” from the data and are able to adaptively make decisions. Neural networks: brain-inspired family of statistical models and algorithms that are repre- sented as the collection of interconnected neurons-nodes that have task-adaptive features. Proved to perform in estimation or approximation of functions that are generally unknown (pattern recognition, classification, forecasting). Figure 1: Conventional NN: the weights of the nodes and the activation function have to be chosen at the training stage depending on the task. Disadvantages: convoluted and sometimes ill-defined optimization algorithms for weights determining. L. Grigoryeva, J. Henriques, L. Larger, J.-P. Ortega ( Universit¨ at Konstanz, Germany, Universit´ e Bourgogne Franche-Comt´ e, France, Centre National 4 / 54 TDDE in machine learning

Reservoir computing: brain-inspired machine learning paradigm Reservoir computing: brain-inspired machine learning paradigm Fundamentally new approach to neural computing [Jae01, JH04, MNM02, VSDS07, LJ09]; defining features of RC: the fading-memory, separation, and approximation properties [LJ09] Modification of the traditional RNN in which the architecture and the neuron weights of the network are created in advance (for example randomly) and remain unchanged during the training stage The output signal is obtained in the RC with a linear readout layer that is trained using the teacher signal via a ridge (Tikhonov regularized) regression L. Grigoryeva, J. Henriques, L. Larger, J.-P. Ortega ( Universit¨ at Konstanz, Germany, Universit´ e Bourgogne Franche-Comt´ e, France, Centre National 5 / 54 TDDE in machine learning

Reservoir computing: brain-inspired machine learning paradigm Physical implementation: reservoir computing (RC) devices A major feature of the RC is the possibility of constructing physical realizations of reservoirs instead of simulating them (numerically) Chaotic dynamical systems can be used to construct reservoirs that exhibit the RC features: in [ASV + 11] using chaotic electronic oscillators or using optoelectronic devices like in [LSB + 12] Figure 3: Optoelectronic implementation of RC with a single nonlinear element subject to delayed feedback [LSB + 12] L. Grigoryeva, J. Henriques, L. Larger, J.-P. Ortega ( Universit¨ at Konstanz, Germany, Universit´ e Bourgogne Franche-Comt´ e, France, Centre National 6 / 54 TDDE in machine learning

Reservoir computing: brain-inspired machine learning paradigm Objectives address the reservoir design and working principle problems application of RC in the non-deterministic tasks: forecasting of stochastic time series L. Grigoryeva, J. Henriques, L. Larger, J.-P. Ortega ( Universit¨ at Konstanz, Germany, Universit´ e Bourgogne Franche-Comt´ e, France, Centre National 7 / 54 TDDE in machine learning

Construction of Time-Delay Reservoir (TDR) computers z 1 z 2 z T c c c A I (1) I (2) I ( T ) I 1 (1) I 2 (1) I N (1) I 1 (2) I 2 (2) I N (2) I 1 ( T ) I 2 ( T ) I N ( T ) B X 1(1) X 2(1) XN (1) X 1(2) X 2(2) XN (2) X 1( T ) X 2( T ) XN ( T ) W out W out W out C Figure 4: Diagram of architecture of the time-delay reservoir (TDR) and 3 modules of the reservoir computer (RC): the input layer A, the time-delay reservoir B, and the readout layer C. L. Grigoryeva, J. Henriques, L. Larger, J.-P. Ortega ( Universit¨ at Konstanz, Germany, Universit´ e Bourgogne Franche-Comt´ e, France, Centre National 8 / 54 TDDE in machine learning

Construction of Time-Delay Reservoir (TDR) computers Input module Construction of the input layer depends on the computational task of interest and involves the values of the input signal at a given t and the input mask ; consists of multiplexing the input signal over the delay period and forcing its mean to be zero. Consider multi-dimensional time series as the input signal: in this case z ( t ) ∈ R n and for each t define I ( t ) := C z ( t ) ∈ R N , where C ∈ M N , n is the input mask [GHLO14] L. Grigoryeva, J. Henriques, L. Larger, J.-P. Ortega ( Universit¨ at Konstanz, Germany, Universit´ e Bourgogne Franche-Comt´ e, France, Centre National 9 / 54 TDDE in machine learning

Construction of Time-Delay Reservoir (TDR) computers Construction of the time-delay reservoir (TDR) TDRs are based on the “interaction” of the discrete input signal z ( t ) ∈ R with the solution space of a TDDE of the form x ( t ) = − x ( t ) + f ( x ( t − τ ) , I ( t ) , θ ) , ˙ (1) where f is a nonlinear smooth function ( nonlinear kernel ), θ ∈ R K is the parameter vector, τ > 0 is the delay , x ( t ) ∈ R , and I ( t ) ∈ R is obtained via temporal multiplexing of the input signal z ( t ) over the delay period; x ∈ C 1 ([ − τ, 0] , R ) needs to be specified prior. The choice of nonlinear kernel f is determined by the physical implementation; consider two parametric sets of kernels: η ( x + γ I ) the Mackey-Glass [MG77]: f ( x , I , θ ) = 1+( x + γ I ) p , θ = ( η, γ, p ) the Ikeda [Ike79]: f ( x , I , θ ) = η sin 2 ( x + γ I + φ ), θ = ( η, γ, φ ) Used in the RC electronic [ASV + 11] and optoelectronic [LSB + 12] realizations. L. Grigoryeva, J. Henriques, L. Larger, J.-P. Ortega ( Universit¨ at Konstanz, Germany, Universit´ e Bourgogne Franche-Comt´ e, France, Centre National 10 / 54 TDDE in machine learning

Construction of Time-Delay Reservoir (TDR) computers Continuous time model of TDR Consider the regular sampling of solution x ( t ) of (1) during a given time-delay interval and define x i ( t ) the value of the i th neuron of the reservoir at time t τ as t ∈ Z , x i ( t ) := x ( t τ − ( N − i ) d ) , i ∈ { 1 , . . . , N } , where τ := dN , d the separation between neurons and we also say that x i ( t ) is the i th neuron value of the t th layer of the reservoir . L. Grigoryeva, J. Henriques, L. Larger, J.-P. Ortega ( Universit¨ at Konstanz, Germany, Universit´ e Bourgogne Franche-Comt´ e, France, Centre National 11 / 54 TDDE in machine learning

Time-delay differential equations in machine learning Lyudmila - PowerPoint PPT Presentation

Time-delay differential equations in machine learning Lyudmila Grigoryeva 1 , Julie Henriques 2 , Laurent Larger 2 , Juan-Pablo Ortega 3 , 4 1 Universit at Konstanz, Germany 2 Universit e Bourgogne Franche-Comt e, France 3 Centre National

Interconnect Gate delay Wire delay The delay in VLSI circuits have two components Gate delay (

Modelling with Differential Equations Modelling with Differential Equations Modelling with

Differential equations Programming of Differential Equations A differential equation (ODE)

Differential equations Programming of Differential Equations A differential equation (ODE)

1.3 Differential Equations as Mathematical Models a lesson for MATH F302 Differential Equations

Differential Equations Classification and Simulation November 25, 2008 M. Emmerich, LIACS 1

15. Partial differential equations; double integrals 15.1. Partial differential equations. Recall

Differential Equations x i +1 x i x i +1 = x i + x x Physics-based simulation

TOC 1 Introduction to Differential Equations 1.1 Preliminaries 1.2 Differential Equations; Basic

Chapter 2 Chapter 2 Systems Defined by Systems Defined by Differential or Difference

Notes about ordinary differential equations. Master BME, Math Level 2 October 10, 2019 1/33

Math 211 Math 211 Lecture #3 Solutions to Differential Equations August 29, 2003 2

Chapter 2 Chapter 2 Systems Defined by Systems Defined by Differential or Difference

Higher Order Linear Differential Equations Familiar stuff Example Homogeneous equations Math

TOC 2. First Order Differential Equations and Applications 2.1 Preliminaries 2.2 Linear

DIFFERENTIAL AROMA VOL DIFFERENTIAL AROMA VOL DIFFERENTIAL AROMA VOLATILES DIFFERENTIAL AROMA

15-16/08/2009 Nicola Galesi 46 History Resolution is a proof system for DNF formulas or a

HOST Hardware Trojans III ECE 525 Seminal Trojan Detection Method D. Agrawal, S. Baktir, D.

CAN BLOCKCHAIN SOLVE THE HOLD-UP PROBLEM? RICHARD HOLDEN AND ANUP MALANI UNSW AND UNIVERSITY OF

DAT USEDAT SRI MDC: Prof. David Quesada with students at SRI Summer School, Miami Dade College

Q2 2009 CONFERENCE CALL C O R P O R A T E P A R T I C I P A N T S Caution Regarding

PERFORMANCE MANAGEMENT SYSTEMS CHAPTER VI PAY FOR PERFORMANCE PERFORMANCE MANAGEMENT SYSTEMS

Telecom Italia Group 9M 2014 Results Marco Patuano Piergiorgio Peluso Safe Harbour This

the compelling alternative South East England Councils 8 September 2017 John Stebbings Chief