Machine learning theory for time series Exponential inequalities for - PowerPoint PPT Presentation

Short introduction to machine learning theory Machine learning and time series Machine learning theory for time series Exponential inequalities for nonstationary Markov chains Pierre Alquier CIMFAV seminar January 16, 2019 Pierre Alquier Machine learning theory for time series

Short introduction to machine learning theory Machine learning and time series Short introduction to machine learning theory 1 Machine learning and time series 2 Machine learning & stationary time series Nonstationary Markov chains Pierre Alquier Machine learning theory for time series

Short introduction to machine learning theory Machine learning and time series Generic machine learning problem Main ingredients : Pierre Alquier Machine learning theory for time series

Short introduction to machine learning theory Machine learning and time series Generic machine learning problem Main ingredients : observations : ( X 1 , Y 1 ) , ( X 2 , Y 2 ) , ..., ( X n , Y n ) Pierre Alquier Machine learning theory for time series

Short introduction to machine learning theory Machine learning and time series Generic machine learning problem Main ingredients : observations : ( X 1 , Y 1 ) , ( X 2 , Y 2 ) , ..., ( X n , Y n ) → usually i.i.d from an unknown distribution P ... Pierre Alquier Machine learning theory for time series

Short introduction to machine learning theory Machine learning and time series Generic machine learning problem Main ingredients : observations : ( X 1 , Y 1 ) , ( X 2 , Y 2 ) , ..., ( X n , Y n ) → usually i.i.d from an unknown distribution P ... a restricted set of predictors ( f θ , θ ∈ Θ) Pierre Alquier Machine learning theory for time series

Short introduction to machine learning theory Machine learning and time series Generic machine learning problem Main ingredients : observations : ( X 1 , Y 1 ) , ( X 2 , Y 2 ) , ..., ( X n , Y n ) → usually i.i.d from an unknown distribution P ... a restricted set of predictors ( f θ , θ ∈ Θ) → f θ ( X ) meant to predict Y . Pierre Alquier Machine learning theory for time series

Short introduction to machine learning theory Machine learning and time series Generic machine learning problem Main ingredients : observations : ( X 1 , Y 1 ) , ( X 2 , Y 2 ) , ..., ( X n , Y n ) → usually i.i.d from an unknown distribution P ... a restricted set of predictors ( f θ , θ ∈ Θ) → f θ ( X ) meant to predict Y . A loss function ℓ → ℓ ( y ′ − y ) incurred by predicting y ′ while the truth is y . Pierre Alquier Machine learning theory for time series

Short introduction to machine learning theory Machine learning and time series Generic machine learning problem Main ingredients : observations : ( X 1 , Y 1 ) , ( X 2 , Y 2 ) , ..., ( X n , Y n ) → usually i.i.d from an unknown distribution P ... a restricted set of predictors ( f θ , θ ∈ Θ) → f θ ( X ) meant to predict Y . A loss function ℓ → ℓ ( y ′ − y ) incurred by predicting y ′ while the truth is y . the risk R ( θ ) Pierre Alquier Machine learning theory for time series

Short introduction to machine learning theory Machine learning and time series Generic machine learning problem Main ingredients : observations : ( X 1 , Y 1 ) , ( X 2 , Y 2 ) , ..., ( X n , Y n ) → usually i.i.d from an unknown distribution P ... a restricted set of predictors ( f θ , θ ∈ Θ) → f θ ( X ) meant to predict Y . A loss function ℓ → ℓ ( y ′ − y ) incurred by predicting y ′ while the truth is y . the risk R ( θ ) → R ( θ ) = E ( X , Y ) ∼ P [ ℓ ( f θ ( X ) − Y )] . Not observable. Pierre Alquier Machine learning theory for time series

Short introduction to machine learning theory Machine learning and time series Generic machine learning problem Main ingredients : observations : ( X 1 , Y 1 ) , ( X 2 , Y 2 ) , ..., ( X n , Y n ) → usually i.i.d from an unknown distribution P ... a restricted set of predictors ( f θ , θ ∈ Θ) → f θ ( X ) meant to predict Y . A loss function ℓ → ℓ ( y ′ − y ) incurred by predicting y ′ while the truth is y . the risk R ( θ ) → R ( θ ) = E ( X , Y ) ∼ P [ ℓ ( f θ ( X ) − Y )] . Not observable. an empirical proxy r ( θ ) for R ( θ ) Pierre Alquier Machine learning theory for time series

Short introduction to machine learning theory Machine learning and time series Generic machine learning problem Main ingredients : observations : ( X 1 , Y 1 ) , ( X 2 , Y 2 ) , ..., ( X n , Y n ) → usually i.i.d from an unknown distribution P ... a restricted set of predictors ( f θ , θ ∈ Θ) → f θ ( X ) meant to predict Y . A loss function ℓ → ℓ ( y ′ − y ) incurred by predicting y ′ while the truth is y . the risk R ( θ ) → R ( θ ) = E ( X , Y ) ∼ P [ ℓ ( f θ ( X ) − Y )] . Not observable. an empirical proxy r ( θ ) for R ( θ ) � n → for example r ( θ ) = 1 i = 1 ℓ ( f θ ( X i ) − Y i ) . n empirical risk minimizer ˆ θ Pierre Alquier Machine learning theory for time series

Short introduction to machine learning theory Machine learning and time series Generic machine learning problem Main ingredients : observations : ( X 1 , Y 1 ) , ( X 2 , Y 2 ) , ..., ( X n , Y n ) → usually i.i.d from an unknown distribution P ... a restricted set of predictors ( f θ , θ ∈ Θ) → f θ ( X ) meant to predict Y . A loss function ℓ → ℓ ( y ′ − y ) incurred by predicting y ′ while the truth is y . the risk R ( θ ) → R ( θ ) = E ( X , Y ) ∼ P [ ℓ ( f θ ( X ) − Y )] . Not observable. an empirical proxy r ( θ ) for R ( θ ) � n → for example r ( θ ) = 1 i = 1 ℓ ( f θ ( X i ) − Y i ) . n empirical risk minimizer ˆ θ → ˆ θ = argmin r ( θ ) . θ ∈ Θ Pierre Alquier Machine learning theory for time series

Short introduction to machine learning theory Machine learning and time series Sub-gamma random variables Definition T is said to be sub-gamma iff ∃ ( v , w ) such that ∀ k ≥ 2, ≤ k ! vw k − 2 � | T | k � . E 2 Pierre Alquier Machine learning theory for time series

Short introduction to machine learning theory Machine learning and time series Sub-gamma random variables Definition T is said to be sub-gamma iff ∃ ( v , w ) such that ∀ k ≥ 2, ≤ k ! vw k − 2 � | T | k � . E 2 Examples : T ∼ Γ( a , b ) , holds with ( v , w ) = ( ab 2 , b ) . Pierre Alquier Machine learning theory for time series

Short introduction to machine learning theory Machine learning and time series Sub-gamma random variables Definition T is said to be sub-gamma iff ∃ ( v , w ) such that ∀ k ≥ 2, ≤ k ! vw k − 2 � | T | k � . E 2 Examples : T ∼ Γ( a , b ) , holds with ( v , w ) = ( ab 2 , b ) . any Z with P ( | Z | ≥ t ) ≤ P ( | T | ≥ t ) . Pierre Alquier Machine learning theory for time series

Short introduction to machine learning theory Machine learning and time series Bernstein’s inequality Theorem Let T 1 , . . . , T n be i.i.d and ( v , w ) -sub-gamma random variables. Then, ∀ ζ ∈ ( 0 , 1 / w ) , � � n � � nv ζ 2 � E exp ζ [ T i − E T i ] ≤ exp . 2 ( 1 − w ζ ) i = 1 Pierre Alquier Machine learning theory for time series

Short introduction to machine learning theory Machine learning and time series Bernstein’s inequality Theorem Let T 1 , . . . , T n be i.i.d and ( v , w ) -sub-gamma random variables. Then, ∀ ζ ∈ ( 0 , 1 / w ) , � � n � � nv ζ 2 � E exp ζ [ T i − E T i ] ≤ exp . 2 ( 1 − w ζ ) i = 1 Consequence in ML : put T i = − ℓ ( f θ ( X i ) − Y i ) and assume T i is ( v , w ) -sub-gamma, then Pierre Alquier Machine learning theory for time series

Short introduction to machine learning theory Machine learning and time series Bernstein’s inequality Theorem Let T 1 , . . . , T n be i.i.d and ( v , w ) -sub-gamma random variables. Then, ∀ ζ ∈ ( 0 , 1 / w ) , � � n � � nv ζ 2 � E exp ζ [ T i − E T i ] ≤ exp . 2 ( 1 − w ζ ) i = 1 Consequence in ML : put T i = − ℓ ( f θ ( X i ) − Y i ) and assume T i is ( v , w ) -sub-gamma, then for any s > 0, � � � � � R ( θ ) − r ( θ ) > t = exp [ s ( R ( θ ) − r ( θ ))] > exp( st ) P P Pierre Alquier Machine learning theory for time series

Short introduction to machine learning theory Machine learning and time series Bernstein’s inequality Theorem Let T 1 , . . . , T n be i.i.d and ( v , w ) -sub-gamma random variables. Then, ∀ ζ ∈ ( 0 , 1 / w ) , � � n � � nv ζ 2 � E exp ζ [ T i − E T i ] ≤ exp . 2 ( 1 − w ζ ) i = 1 Consequence in ML : put T i = − ℓ ( f θ ( X i ) − Y i ) and assume T i is ( v , w ) -sub-gamma, then for any s > 0, � � � � � R ( θ ) − r ( θ ) > t ≤ E exp s ( R ( θ ) − r ( θ )) − st P Pierre Alquier Machine learning theory for time series

Short introduction to machine learning theory Machine learning and time series Bernstein’s inequality Theorem Let T 1 , . . . , T n be i.i.d and ( v , w ) -sub-gamma random variables. Then, ∀ ζ ∈ ( 0 , 1 / w ) , � � n � � nv ζ 2 � E exp ζ [ T i − E T i ] ≤ exp . 2 ( 1 − w ζ ) i = 1 Consequence in ML : put T i = − ℓ ( f θ ( X i ) − Y i ) and assume T i is ( v , w ) -sub-gamma, then for any s > 0, � � � � � � � � n s R ( θ ) − r ( θ ) > t ≤ E exp i = 1 [ T i − E T i ] − st P n Pierre Alquier Machine learning theory for time series

Machine learning theory for time series Exponential inequalities for - PowerPoint PPT Presentation

Short introduction to machine learning theory Machine learning and time series Machine learning theory for time series Exponential inequalities for nonstationary Markov chains Pierre Alquier CIMFAV seminar January 16, 2019 Pierre Alquier

Lead Screw Motors LSM08 Series LSM11 Series LSM14 Series LSM17 Series

Time Series Analysis and Mining with R Time Series Decomposi- tion Time Series Forecasting

Outline Time series and forecasting Time series objects 1 in R Basic time series functionality

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

standard series Overview DP series DX series H series M series bitte hier

Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

INTRODUCTION TO MACHINE LEARNING Joseph C. Osborn CS 51A Spring 2020 Machine Learning is

Human and Machine Learning Tom Mitchell Machine Learning Department Carnegie Mellon University

Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification

Machine Learning - Intro Aarti Singh Machine Learning 10-701/15-781 Sept 8, 2010 You tell me

Regularization with Lipschitz Loss Pierre Alquier Sequential, structured, and/or statistical

Probabilistic Inequalities and Examples Lecture 3 January 22, 2019 Chandra (UIUC) CS498ABD 1

Concentration inequalities and the entropy method G abor Lugosi ICREA and Pompeu Fabra

Concentration inequalities Jean-Yves Audibert 1 , 2 1. Imagine - ENPC/CSTB - universit e Paris

On Estimation of Modal Decompositions Anuran Makur, Gregory W. Wornell, and Lizhong Zheng

Quantum Chebyshevs Inequality and Applications Yassine Hamoudi, Frdric Magniez IRIF ,

Formal Proofs of Inequalities and Semi-Definite Programming Supervisor: Benjamin Werner (TypiCal)

Polynomial Inequalities in the Complex Plane Vladimir Andrievskii Kent State University Crete,