Experiments in Value Function Approximation with Sparse Support - PowerPoint PPT Presentation

Experiments in Value Function Approximation with Sparse Support Vector Regression Tobias Jung and Thomas Uthmann { tjung,uthmann } @informatik.uni-mainz.de Fachbereich Mathematik & Informatik Johannes Gutenberg-Universit¨ at Mainz, Germany Value Function Approximation with Sparse SVR – ECML 2004 – p. 1/17

✓ ✻ ✟ ✆ ✟ ✁ ✓ ☎ ✶ � ✶ ✹✺ ☞ ✰ ✭ ✓ ✲ ✖ ✪ ✚ ✜ ✕ ✕ ✠ ✣ ✓ ✫ ✣ ✢ ✧ ★ ✗ ✑ ✕ ✫ ✏ ☞ ✔ ✚ ✏ ✎ ☞ ✡ ✍ ✆ ✍ ✓ ✪ ✓ ✖ ✔ ✓ ✜ ✩ ✔ ✚ ✢ ✕ ✗ ★ ✚ ✗ ✒ � ✔ ✓ ✭ ✒ ✓ ✕ ✏ ✣ ✓ ✢ ✖ ❉ ✢ ✏ ✔ ✗ ✕ ✖ ✔ ★ ✫ ✗ ✏ ✚ ★ ✚ ✕ ✪ ✓ ✒ ✗ ★ ✣ ✓ ✕ ✓ ✒ ✒ ✑ ✏ ✓ ✜ ✕ ✚ ✖ ✕ ✓ ★ ✫ ✣ ✗ ✖ ✩ ✓ ✔ ✔ ✕ ✪ ✔ ✲ ✤ ✛ ✢ ✪ ✚ ✣ ✓ ✏ ✓ ✒ ✑ ❈ ✓ ✖ ✭ ✔ ✢ ✔ ✢ ✗ ✪ ✗ ✓ ✑ ❁ ☞ ✗ ✿ ✾ ✽ ✼ ✖ ✓ ✖ ✟ ✖ ✢ ✔ ✚ ✢ ✕ ✗ ✕ ✔ ☞ ❂ ✚ ✔ ✕ ✖ ✪ ✓ ✓ ✔ ✹ ✻ ✓ ✢ ✪ ★ ✘ ✔ ✚ ❆ ✡ ❆❇ ☞ ✡ ❅ ✕ ✥ ✫ ✘ ★ ✥ ✚ ✒ ✫ ★✪ ✒ ✚ ✩ ★ ✣ ✗ ✓ ✒ ✧ ✦ ✒ ✗ ✣ ✜ ✓ ✖ ✔ ✥ ✔ ✓ ✜ ✩ ✣ ✢ ✪ ✬ ✤ ✪ ★ ✓ ✔ ✓ ✪ ✒ ✑ ✥ ✖ ✖ ✓ ✏ ✓ ✚ ✌ ✑ ✏ ✎ ☞ ✡ ✍ ✆ ✍ ✌ ✝ ✒ ☞ ✔ ✟ ✆ ✟ ✁ ✓ ☎ ✖ ✒ ✓ ✥ ✭ ✤ ✔ ✗ ✣ ✔ ✢ ✕ ✒ ✗ ✜ ✔ ✕ ✘ ✛ ✚ ✚ ✕ ✗ ✕ ✖ ✕ ✏ ✣ ✶ ✕ ✳✴ ✚ ✔ ✲ ✖ ✦ ✒ ✚ ✩ ✓ ✚ ✱ ★ ✗ ✒ ✑ ✓ ✱ ✚ ✕ ✛ ✒ ✓ ✚ ✗ ✣ ✢ ✔ ✢ ✣ ★ ✗ ✏ ★ ✭ ✚ ✔ ✮ ✳✵ ✭ ✔ ✢ ✕ ✕ ✓ ✪ ✒ ✫ ✪ ✢ ✪ ✚ ✏ ✓ ★ ✢ ✕ ✢ ✢ ✭ ✒ ✔ ✕ ✢ ✩ ✪ ✓ ✒ ✗ ✔ ✮ ✗ ★✪ ✫ ✣ ✚ ✏ ✔ ✓ ✜ ✩ ✤ ✓ ✓ ✢ ✩ ✔ ✑ ✖ ✖ ✓ ★ ✕ ✕ ✑ Value Function Approximation with Sparse SVR – ECML 2004 – p. 2/17 ✓✯✰ ✏✯✰ ✥✞✗ ✓✙✘ ✁❄❃ ✖✯✮ ✜✄✭ ✆✸❀ ✓✙✘ ✢✸✷ ✓✙✘ ✠☛✡ ✠☛✡ Why SVR? ✆✞✝ ✆✞✝ ✁✄✂ ✁✄✂

✒ ✏ ✛ ✚ ✔ ✚ ✢ ✕ ✓ ✑ ★ ✓ ✑ ✓ ✔ ✢ ★ ✑ ✥ ✔ ✔ ✔ ✮ ★ ✓ ✭ ✔ ✓ ✚ ✖ ✪ ✓ ✖ ❇ ✲ ✕ ✓ ✘ ✒ ✔ ☞ ✣ ✢ ❉ ✚ ✒ ✫ ✫ ✓ ✕ ✖ � ✗ ✫ ✑ ✮ ✏ ✗ ✢ ✮ ✪ ✣ ✓ ★ ✥ ✚ ✒ ✏ ✓ ✚ ✏ ✑ ✪ ✓ ✻ ✮ ✔ ✗ ✔ ✓ ✙ ☞ ☎ ☎ ✁ ✽ ✰ ✒ ✂ ✗ ✘ ✔ ✢ ✗ ✕ ✔ ❆ ☞ ✚ ✍ ☞ ✠ ✡ ✁ ✛ ☞❀ ✡ ✚ ❇ ✡ ❆ ✁ ✟ ✁ ✄ ✍ ✑ ✔ ✚ ✲ ✕ ✶ ✶ ✍ ✌ ✌ ✍ ✒ ✖ ✢ ✓ ✔ ✪ ✔ ✗ ✒ ✰ ✞ ✮ ✗ ★✪ ✒ ✚ ✩ ✪ ✢ ✒ ❀ ✆ ✟ ❇ ✡ ☎ ✆ ❆ ✡ ✎ ✕ ❅ ✔ ✣ ✓ ✏ ✒ ✚ ✛ ✢ ✔ ✓ ✻ ❇ ✌ ✆ ✟ ☞ ✓ ✕ ✆ ✟ ★ ✗ ✒ ✚ ✫ ✣ ✓ ✮ ✹ ✭ ✔ ✢ ✔ ✒ ✗ ✓ ☎ ✞ ✠ ❀ ✁ ✌ ✠ ☞ ❅ � ☞ ✟ ✆ ✎ ✁ ✟ ❀ ✆ ✟ ☞ ✁ ✟ ✂ ✌ ❂ ❆ ✆ ✜ ❇ ✌ ✆ ✟ ❇ ✰ ✁ ✄ ✡ ✁ ❅ ☞ ✾ ✘ ✢ ✗ ✆ ✡ ✾ ✟ ❆ ✌ ✆ ✁ ✟ ✽ ✰ ✍ ✶ ✌ ✲ ✠ ❂ ✌ ✪ ✌ ★ ✑ ✣ ✒ ✚ ☛ ❇ ✆ ❆ ❀ ❀ ✡ ❆ ✝ ✡ ✿ ✟ ✔ ✡ ✗ ☛ ✮ ✭ ✔ ✢ ✔ ✒ ✓ ✔ ✹ ✓ ✏ ✔ ✓ ✒ ✓ ✑ ✏ ✗ ❉ ✔ ✚ ✢ ✕ ✗ ✣ ✢ ✚ ✕ ✒ ✫ ✫ ☞ ✔ ✚ ✢ ✆ Value Function Approximation with Sparse SVR – ECML 2004 – p. 3/17 ✥✞✗ Sparse regressor SVR A very small training set Reduce (states, values) A very big list of update ☎✝✆ add RL Contents ✆✞✝

✟ ✔ ✣ ✠ ✏ ☞ ✍ ✟ ✰ ✑ ✻ ✌ ✹ ★ ✓ ✗ ✒ ✔ ✓ ✒ ✠ ✓ ✚ ☎ ✒ ✓ ✕ ✭ ✔ ✚ ✝ ✆ ❂ ★ ✁ ✜ ✞ ✖ ✑ ✗ ★ ★ ✤ ✪ ✓ ✒ ✓ ✁ ✡ ✜ ✏ ✜ ✚ ✚ ✖ ✗ ✟ ✏ ✕ ✢ ✚ ✔ ✖ ✕ ✚ ✆ ❂ ✖ ✜ ✔ ✚ ✕ ✦ ✔ ✚ ✩ ✕ ✓ ✡ ✒ ✓ ✢ ☛ ❉ ✰ ✔ ✠ ✕ ✣ ✼ ✗ ✗ ❀ ✪ ✟ ❀ ✌ ☎ ✑ ✕ ✕ ✌ ✓ ✖ ✒ ☞ ✏ ✕ ✢ ✚ ❇ ❂ ✖ ❂ � ☞ ❆ � ✌ ✁ ✂ ✡ ✰ ❀ ✆ ✌ ❇ ✄ ❆ ✌ ❂ ✡ ❀ ✔ ✗ ✶ ✕ ✫ ✒ ✚ ✓ ✥ ✢ ★ ✢ ✢ ✚ ✓ ✖ ✲ ✔ ✗ ✒ ✦ ✚ ✆ ✔ ✢ ✻ ✪ ✓ ✩ ✗ ✒ ✪ ✖ ✣ ✚ ✓ ✕ ★ ✼ ✩ ✟ ✒ ✗ ✔ ✖ ✢ ✣ Value Function Approximation with Sparse SVR – ECML 2004 – p. 4/17 P a ( s, s ′ ) , R a ( s, s ′ ) s t +1 s t Environment t = 0 , 1 , 2 , . . . Agent r t ✟✡✠ a t P a ( s, s ′ ) Reinforcement Learning I ✢✸✷ ✆✸❀ A = { a 1 , . . . , a M } R a ( s, s ′ ) S = { s 1 , . . . , s N } ✟✎✍ ✥✞✗ ☛✡☞ ✟✡✠ ✆✸❀

✌ ✔ ✓ ✔ ✚ ✢ ✕ ✏ ✑ ✕ ✛ ✓ ✑ ★ ✗ ✆ ✮ � ✗ ✌ ✆ ✡ ✁ ❅ ✌ ❀ ✟ ☞ ❀ ✂ ☞ ✂ ✂ ❇ ★ ✣ ✜ ✢ ❅ ☞ � ✆ ✗ ✕ ✚ ✚ ✔ ✲ ✭ ✰ ✎ ✼ ✜ ✫ ✢ ✏ ✕ ✫ ✚ ✮ ✒ ✤ ✢ ✕ ★ ✚ ✫ ★ ✗ ✣ ✢ ✟ ✔ ✤ ✏ ✒ ✘ ✘ ✒ ✚ ✕ ☞ ✕ ✮ ✑ ✏ ✏ ✢ ✕ ✢ ✢ ✢ ✑ ✄ ✔ ✚ ✪ ✓ ✖ ✖ ✏ ✪ ✚ ✜ ✕ ✓ ✔ ✶ ✖ ✣ ✓ ✓ ★ ✚ ✏ ✔ ✚ ✪ ✖ ✏ ✑ ✖ ✪ ✚ ✜ ✕ ✢ ✤ ✢ ✲ ✕ ✫ ✒ ✰ ✭ ✓ ✔ ✑ ✚ ✢ ✕ ✗ ✒ ✓ ✕ ✖ ✗ ★ ✪ ✪ ✒ ✗ ✩ ✓ ✒ ✓ ✌ ✕ ✔ ✑ ✚ ✏ ✖ ✝ ✂ ✪ ✁ ✚ ✏ ✎ ✡ ❆ ✡ ✟ ✍ ✟ ✡ ✝ ✡ ✂ ✌ ✢ ★ ✢ ❇ ✢ ✔ ✁ ✔ ✢ ✜ ✌ ✶ ✆ ❆ ✡ ✟ ✆ ❆ ✕ ✜ ✗ ✏ ✕ ✚ ✕ ✪ ✓ ✕ ✓ ✚ ✫ ❉ ✓ ✔ ✚ ✭ ✒ ★ ✏ ✖ ✪ ✔ ✑ ✚ ✏ ✖ ✢ ✢ ✒ ✲ ✼ ✔ ✚ ✢ ✕ ✕ ✗ ✔ ☎ ✔ ✗ ✣ ★ ★ ✓ ✘ ✕ ★ ✓ ✗ ✒ ✔ ✶ ✓ ✏ ✑ ✤ ✒ ✕ ✖ ✢ ✔ ✢ ✣ ✓ ✏ ✕ ✓ ✪ ✲ ✔ ✼ ✢ ✮ ✛ ✤ ✓ ✑ ★ ✗ ✄ ✶ ✒ ✖ ✗ ✔ ✚ ✢ ✕ ✗ ✕ ✗ Value Function Approximation with Sparse SVR – ECML 2004 – p. 5/17 ∀ s V ∗ ( s ) = max π V π ( s ) ∀ s , � R π ( s ) ( s, s ′ ) + γV π ( s ′ ) ∀ s γ k r k | s t = s, π } , � k =0 � P π ( s ) ( s, s ′ ) ∞ π ∗ = argmax π V π V π ( s ) = E π { Reinforcement Learning II ✓✯✰ ✓✯✰ � s ′ V π ( s ) = π : S → A γ ✢✸✷ ✓✙✘ ✥✞✗ ✥✞✗

❃ ✦ ✢ ✒ ✚ ✕ ✖ ✤ ✥ ✖ ✒ ✭ ✚ ✩ ✮ ✤ ★ ★ ✗ ✏ ✔ ☎ ✖ ✠ ✢ ★ ✗ ✔ ✢ ✟ ✁ ✆ ✞ ✁ ✍ ☞ ✆ ✝ ✠ ✆ ☞ ✆ ✢ ✗ ✕ ☎ ✍ ✡ ☞❀ ✠ � ✂ ❆ ✌ ✡ ✁ � ✡ ❆ ✂ ✜ ☎ ✑ ✢ ☎ ❇ ☎ ✞ ✜ ❇ ✌ ✆ ✟ ☞ ☎ ✆ ✌ ❂ ❆ ✆ ✆ ☞ ❇ ✌ ✆ ✟ ✖ ✼ ✲ ✖ ✁ ☛ ✡ ✓ ✖ ★ ✓ ✕ ✓ ✠ ✒ ✣ ✚ ✒ ✛ ✍ ☞ ✠ ✆ ✕ ✢ ☞ ✕ ✗ ✕ ✖ ✆ ✁ ✠ ✍ ✠ ✗ ☞ ✒ ✚ ✛ ✕ ✓ ✭ ✒ ✖ ✓ ✛ ☞ ✖ ✔ ✢ ✩ ✓ ✔ ✟ ✟ ✼ ✗ ✡ ✟ ☞ ✍ ✆ ✁ ❃ ✂ ✕ ✔ ✕ ✑ ✗ ✕ ✖ ✕ ✔ ✓ ✒ ✒ ✏ ✏ ✒ ✓ ✆ ✓ ✔ ✓ ✜ ✩ ✓ ★✪ ✼ ✌ ✢ ✲ ✓ ★ ✤ ✕ ✖ ✭ ✔ ✣ ✚ ✣ ✗ ✒ ✭ ✚ ✒ ✏ ✏ ✣ ✪ ✣ ✓ ✢ ★ ✚ ✫ ✪ ✓ ❉ ✁ ✖ ✓ ✑ ✮ ✪ ✓ ✖ ✔ ✘ ★ ✢ ✗ ✤ ✟ ✌ ☎ ❆ ✡ ✆ ❀ ☎ ✁ ✆ ☎ ❆ ✌ ✝ ❅ ☞ ✂ ❇ ☞ ❆ ✆ ✔ ✁ ✤ ✠ ✜ ❇ ✌ ✆ ✟ ☞ ❅ ✌ ☞ ✁ ✡ ❃ ✂ ❂ ✆ ❅ ✏ ✚ ✶ ✠ ✗ ✩ ✓ ✒ ✟ ✠ ✄ ✍ ✗ ✪ ✓ ✓ ✖ ✑ ✮ ✓ ✓ ✒ ✒ ✏ ✘ ✕ ★ ✭ ✔ ✢ ✖ ✑ ✚ ✓ ✗ ✗ ✕ ✖ ✕ ❉ ✓ ✔ ✪ ✔ ✛ ★ ✶ ✫ ✫ ✣ ✓ ✟ ★ ✶ ✗ ✫ ✒ ✒ ✚ ❉ ✢ ✣ ✗ ✕ ✢ ✼ ✚ ✗ ✓ ✖ ✪ ✚ ✣ ✲ ✓ ★ ✤ ✕ ✓ ★ ✏ ✔ ✓ ✒ ✓ ✡ ✢ ✠ ✘ ✓ Value Function Approximation with Sparse SVR – ECML 2004 – p. 6/17 π � − V t ( s ) s ′ � � � R π ( s ) ( s, s ′ ) + γV t ( s ′ ) − V t ( s ) r t π target (unbiased estimate) r t + γV t ( s ′ ) � �� target �� P π ( s ) ( s, s ′ ) � �✂✁ � V t +1 ( s ) = V t ( s ) + α Reinforcement Learning III ✥✞✗ �� s ′ � V t +1 ( s ) = V t ( s ) + ✟✡☞

Experiments in Value Function Approximation with Sparse Support - PowerPoint PPT Presentation

Experiments in Value Function Approximation with Sparse Support Vector Regression Tobias Jung and Thomas Uthmann { tjung,uthmann } @informatik.uni-mainz.de Fachbereich Mathematik & Informatik Johannes Gutenberg-Universit at Mainz, Germany

Sparse Matrices Example Of Sparse Matrices diagonal tridiagonal sparse many elements are

6. Approximation and fitting norm approximation least-norm problems regularized

Sparse Matrices sparse many elements are zero dense few elements are zero Example Of

Empirical Testing of Sparse Approximation and Matrix Completion Algorithms Jared Tanner Workshop

Lecture 5: Value Function Approximation Emma Brunskill CS234 Reinforcement Learning. Winter 2018

Lecture 5: Value Function Approximation Emma Brunskill CS234 Reinforcement Learning. Winter 2020

MLSS 06 - Canberra Elements Hierarchical Basis Sparse Grids Sparse Grids Combination

Practical Linear- -value value Practical Linear Approximation Techniques Approximation

LOCAL LINEAR APPROXIMATION MATH 200 GOALS Be able to compute the local linear approximation

Experiments on deflection of charged Experiments on deflection of charged Experiments on

Function Representation & Spherical Harmonics Function approximation G (x) ... function

Sparse tensors are a natural way of representing real-world data 1 Sparse tensors are a natural

CNBC Matlab Mini-Course Sparse Matrices Sparse matrices provide an efficient means to store

Tutorial: TF-Ranking for sparse features Tutorial: TF-Ranking for sparse features This tutorial

Parallel Numerical Algorithms Chapter 4 Sparse Linear Systems Section 4.1 Direct Methods

Extremal results for sparse pseudorandom graphs Yufei Zhao Massachusetts Institute of Technology

Machine Learning Techniques Alejandro Bellogn, Ivn Cantador, Pablo Castells, lvaro Ortigosa

Clustering Rankings in the Fourier Domain Stphan Clmenon and Romaric Gaudel and Jrmie

Functional Bid Landscape Forecasting for Display Advertising Yuchen Wang 1 Kan Ren 1 Weinan Zhang

Fast Training of Support Vector Machines for Survival Analysis Sebastian Plsterl 1 , Nassir

Distribution-Free Uncertainty Quantification for Kernel Methods by Gradient Perturbations Bal

Drawing Parallels between Multi-label Classification and Multi-target Regression Grigorios

Stochastic approximation for speeding up LSTD/LSPI (and least squares regression/LinUCB) Prashanth

Factorization of the Label Conditional Distribution for Multi-Label Classification ECML PKDD 2015

Experiments in Value Function Approximation with Sparse Support - PowerPoint PPT Presentation

Experiments in Value Function Approximation with Sparse Support Vector Regression Tobias Jung and Thomas Uthmann { tjung,uthmann } @informatik.uni-mainz.de Fachbereich Mathematik & Informatik Johannes Gutenberg-Universit at Mainz, Germany

Sparse Matrices Example Of Sparse Matrices diagonal tridiagonal sparse many elements are

6. Approximation and fitting norm approximation least-norm problems regularized

Sparse Matrices sparse many elements are zero dense few elements are zero Example Of

Empirical Testing of Sparse Approximation and Matrix Completion Algorithms Jared Tanner Workshop

Lecture 5: Value Function Approximation Emma Brunskill CS234 Reinforcement Learning. Winter 2018

Lecture 5: Value Function Approximation Emma Brunskill CS234 Reinforcement Learning. Winter 2020

MLSS 06 - Canberra Elements Hierarchical Basis Sparse Grids Sparse Grids Combination

Practical Linear- -value value Practical Linear Approximation Techniques Approximation

LOCAL LINEAR APPROXIMATION MATH 200 GOALS Be able to compute the local linear approximation

Experiments on deflection of charged Experiments on deflection of charged Experiments on

Function Representation &amp; Spherical Harmonics Function approximation G (x) ... function

Sparse tensors are a natural way of representing real-world data 1 Sparse tensors are a natural

CNBC Matlab Mini-Course Sparse Matrices Sparse matrices provide an efficient means to store

Tutorial: TF-Ranking for sparse features Tutorial: TF-Ranking for sparse features This tutorial

Parallel Numerical Algorithms Chapter 4 Sparse Linear Systems Section 4.1 Direct Methods

Extremal results for sparse pseudorandom graphs Yufei Zhao Massachusetts Institute of Technology

Machine Learning Techniques Alejandro Bellogn, Ivn Cantador, Pablo Castells, lvaro Ortigosa

Clustering Rankings in the Fourier Domain Stphan Clmenon and Romaric Gaudel and Jrmie

Functional Bid Landscape Forecasting for Display Advertising Yuchen Wang 1 Kan Ren 1 Weinan Zhang

Fast Training of Support Vector Machines for Survival Analysis Sebastian Plsterl 1 , Nassir

Distribution-Free Uncertainty Quantification for Kernel Methods by Gradient Perturbations Bal

Drawing Parallels between Multi-label Classification and Multi-target Regression Grigorios

Stochastic approximation for speeding up LSTD/LSPI (and least squares regression/LinUCB) Prashanth

Factorization of the Label Conditional Distribution for Multi-Label Classification ECML PKDD 2015

Function Representation & Spherical Harmonics Function approximation G (x) ... function