Efficient Determination of the Hyperparameter via L-curve in Large - PowerPoint PPT Presentation

Efficient Determination of the Hyperparameter via L-curve in Large Scale Least Squares and Total Least Squares Problems Jörg Lampe joerg.lampe@tuhh.de Hamburg University of Technology Institute of Numerical Simulation Joint work with Heinrich Voß 8th GAMM Workshop on Applied and Numerical Linear Algebra TUHH Jörg Lampe RTLS GAMM Workshop ’08 1 / 27

Outline Background 1 LS and TLS problems RLS and RTLS problems Solving RLS and RTLS problems 2 Solving RLS problem via QEP Solving RTLS problem via QEPs Determining the hyperparamter 3 L-curve Numerical Examples Conclusions 4 TUHH Jörg Lampe RTLS GAMM Workshop ’08 2 / 27

Background LS and TLS problems Outline Background 1 LS and TLS problems RLS and RTLS problems Solving RLS and RTLS problems 2 Solving RLS problem via QEP Solving RTLS problem via QEPs Determining the hyperparamter 3 L-curve Numerical Examples Conclusions 4 TUHH Jörg Lampe RTLS GAMM Workshop ’08 3 / 27

Background LS and TLS problems Least Squares Consider overdetermined linear system A ∈ R m × n , b ∈ R m , m ≥ n Ax ≈ b , with A and b contaminated by noise. Least Squares (LS) approach: � Ax − b � 2 = min ! or equivalently � ∆ b � 2 = min ! subject to Ax = b + ∆ b . Total Least Squares (TLS) approach is suitable: � [∆ A , ∆ b ] � 2 F = min ! subject to ( A + ∆ A ) x = b + ∆ b . TUHH Jörg Lampe RTLS GAMM Workshop ’08 4 / 27

Background LS and TLS problems Total Least Squares With singular value decomposition of [ A , b ] [ A , b ] = U Σ V T , Σ = diag { σ 1 , . . . , σ n + 1 } and σ ′ 1 ≥ · · · ≥ σ ′ n singular values of A . Lemma (Golub, van Loan 1980) If σ n + 1 ([ A , b ]) < σ ′ n ( A ) holds, a unique TLS solution exist. Closed form solution of TLS problem: x TLS = ( A T A − σ 2 n + 1 I ) − 1 A T b . or equivalently � � 1 x TLS v n + 1 ( n + 1 ) v n + 1 . = − − 1 TUHH Jörg Lampe RTLS GAMM Workshop ’08 5 / 27

Background RLS and RTLS problems Outline Background 1 LS and TLS problems RLS and RTLS problems Solving RLS and RTLS problems 2 Solving RLS problem via QEP Solving RTLS problem via QEPs Determining the hyperparamter 3 L-curve Numerical Examples Conclusions 4 TUHH Jörg Lampe RTLS GAMM Workshop ’08 6 / 27

Background RLS and RTLS problems Regularized (Total) Least Squares For ill-conditioned LS and TLS problems regularization is necessary. Apply Tikhonov regularization with hyperparameter λ or adding quadratic constraint with hyperparameter δ . � [∆ b ] � 2 = min ! ( A ) x = b + ∆ b , � Lx � ≤ δ (RLS) subject to � [∆ A , ∆ b ] � 2 = min ! ( A + ∆ A ) x = b + ∆ b , � Lx � ≤ δ (RTLS) subject to F with δ > 0 and L ∈ R k × n , k ≤ n defining a seminorm on the solution. Lemma (Beck, Ben-Tal 2006) Let K be an orthonormal basis of ker ( L ) . If σ min ([ AK , b ]) < σ min ( AK ) holds, a solution of the RTLS problem exists. A solution of RLS problem always exists. Assume active quadratic constraint: δ < � Lx LS � and δ < � Lx TLS � respectively. Hence � Lx � ≤ δ replaced by � Lx � = δ . TUHH Jörg Lampe RTLS GAMM Workshop ’08 7 / 27

Solving RLS and RTLS problems Solving RLS problem via QEP Outline Background 1 LS and TLS problems RLS and RTLS problems Solving RLS and RTLS problems 2 Solving RLS problem via QEP Solving RTLS problem via QEPs Determining the hyperparamter 3 L-curve Numerical Examples Conclusions 4 TUHH Jörg Lampe RTLS GAMM Workshop ’08 8 / 27

Solving RLS and RTLS problems Solving RLS problem via QEP Regularized Least Squares How to solve a quadratically constrained Least Squares Problem? One possibility is by one QEP (another by LSTRS): Consider Lagrangian L ( x , µ ) = � Ax − b � 2 + µ ( � Lx � 2 − δ 2 ) with first-order optimality conditions 2 ( A T A + µ L T L ) x 2 A T b , = � Lx � 2 δ 2 . = Lemma (Gander 1981) Choose largest value of µ to obtain the RLS solution. Assume L is square and nonsingular: Substitute z := Lx , W := L − T A T AL − 1 and h := L − T A T b Wz + µ z = h , z T z δ 2 . = TUHH Jörg Lampe RTLS GAMM Workshop ’08 9 / 27

Solving RLS and RTLS problems Solving RLS problem via QEP Connection to QEP Remark: If rank ( L ) = k < n , basis for range and kernel is needed. h T u = z T z = δ 2 h = δ − 2 hh T u Denoting u := ( W + µ I ) − 2 h ⇒ ⇒ ( W + µ I ) 2 u − δ − 2 hh T u = 0 . (Gander, Golub, von Matt 1989) µ, ˆ Reconstruct x RLS from rightmost eigenpair ( ˆ u ): u = δ 2 ˆ u Scale ˜ u , then it holds h T ˆ x RLS = L − T ( W + ˆ µ I )˜ u . The same idea can be carried over to RTLS problems In general there exist no closed form solution for x RTLS This leads to a converging sequence of QEPs TUHH Jörg Lampe RTLS GAMM Workshop ’08 10 / 27

Solving RLS and RTLS problems Solving RTLS problem via QEPs Outline Background 1 LS and TLS problems RLS and RTLS problems Solving RLS and RTLS problems 2 Solving RLS problem via QEP Solving RTLS problem via QEPs Determining the hyperparamter 3 L-curve Numerical Examples Conclusions 4 TUHH Jörg Lampe RTLS GAMM Workshop ’08 11 / 27

Solving RLS and RTLS problems Solving RTLS problem via QEPs Regularized Total Least Squares Assume x RTLS exists and constraint is active, then (RTLS) is equivalent to f ( x ) := � Ax − b � 2 � Lx � 2 = δ 2 . 1 + � x � 2 = min ! subject to Consider Lagrangian again L = � Ax − b � 2 1 + � x � 2 + µ ( � Lx � 2 − δ 2 ) , First-order optimality conditions are equivalent to ( A T A + λ I I + λ L L T L ) x A T b , = � Lx � 2 δ 2 µ ≥ 0 , = with µ = b T ( b − Ax ) + λ I λ I = −� Ax − b � 2 λ L = µ ( 1 + � x � 2 ) , 1 + � x � 2 , . δ 2 ( 1 + � x � 2 ) TUHH Jörg Lampe RTLS GAMM Workshop ’08 12 / 27

Efficient Determination of the Hyperparameter via L-curve in Large - PowerPoint PPT Presentation

Efficient Determination of the Hyperparameter via L-curve in Large Scale Least Squares and Total Least Squares Problems Jrg Lampe joerg.lampe@tuhh.de Hamburg University of Technology Institute of Numerical Simulation Joint work with

Hyperparameter tuning in caret Dr. Shirin Glander Data Scientist DataCamp Hyperparameter

Curve Curve Ninjas December 19, 2012 Curve Ninjas Curve Overview Using Curve Implementation

Parameters vs hyperparameters Dr. Shirin Glander Data Scientist DataCamp Hyperparameter Tuning

Machine learning with H2O Dr. Shirin Glander Data Scientist DataCamp Hyperparameter Tuning in R

Machine learning with mlr Dr. Shirin Elsinghorst Data Scientist DataCamp Hyperparameter Tuning

Elliptic Curve Cryptography Applications of Elliptic Curve Cryptography Elliptic Curve

Bending the Cost Curve and Improving Bending the Cost Curve and Improving Bending the Cost Curve

AN INTRODUCTION TO CONTENT DETERMINATION Gerard Casamayor Chris Mellish Contents 1. The place

CSC321 Lecture 21: Bayesian Hyperparameter Optimization Roger Grosse Roger Grosse CSC321

Hyperparameter Search in Machine Learning Marc Claesen and Bart De Moor

Hyperparameter Optimization with SHERPA Lars Hertel, Julian Collado, Peter Sadowski, Pierre Baldi

Improving Bug Prediction Accuracy by Regularization and Hyperparameter Optimization Haidar Osman

Hyperparameter optimization strategies git clone

Local Analysis of 2D Curve Patches Local Analysis of 2D Curve Patches Topic 4.2: Topic 4.2:

Maths Summary Intro: Zero coupon yield curve (Ex 13) TV concept Yield curve

Efficient Finite Field and Elliptic Curve Arithmetic Laurent Imbert CNRS, LIRMM, Universit e

Independence, Variance, Bayes Theorem Russell Impagliazzo and Miles Jones Thanks to Janine

Approximate Conditional-mean Type Filtering for State-space Models Bernhard Spangl, Universit

Time Series Mining and Forecasting Duen Horng (Polo) Chau Georgia Tech Slides based on

Runtime analysis RLS on OneMax 10 trials of n { 1 , . . . , 200 } . 2 , 000 n ln n

Further Discussions and Beyond EE630 Further Discussions and Beyond EE630 Final exam: two

BA Larder, AD Revell, D Wang, R Hamers, H Tempelman, R Barth, AMJ Wensing, C Morrow, R Wood, F

Bistability in ODE and Boolean network models Matthew Macauley Department of Mathematical

Package robKalman . Kalmans revenge or obustness for Kalman Filtering evisited