Least Weighted Absolute Value Estimator with an Application to - PowerPoint PPT Presentation

Least Weighted Absolute Value Estimator with an Application to Investment Data Petra Vidnerov´ a, Jan Kalina The Czech Academy of Sciences, Institute of Computer Science, Prague Vidnerov´ a & Kalina 1/12

Least squares vs. robust regression Vidnerov´ a & Kalina 2/12

Linear regression model Model Y i = β 0 + β 1 X i 1 + · · · + β p X ip + e i , i = 1 , . . . , n var e 1 = · · · = var e n = σ 2 (=nuisance parameter) . . .  X 11 X 12 X 1 p  . . . X 21 X 22 X 2 p   X = . . .  ...  . . .   . . .   . . . X n 1 X n 2 X np Least squares n over ( b 0 , . . . , b p ) T ∈ ❘ p +1 � ( Y i − b 0 − b 1 X i 1 − · · · − b p X ip ) 2 min i =1 b LS = ( X T X ) − 1 X T Y var b LS = σ 2 ( X T X ) − 1 Vidnerov´ a & Kalina 3/12

Least weighted squares estimator (LWS) Linear regression model Y i = β 0 + β 1 X i 1 + · · · + β p X ip + e i , i = 1 , . . . , n . Residuals for a fixed value of b = ( b 0 , b 1 , . . . , b p ) T ∈ ❘ p +1 : u i ( b ) = Y i − b 0 − b 1 X i 1 − · · · − b p X ip , i = 1 , . . . , n . We arrange squared residuals in ascending order: u 2 (1) ( b ) ≤ u 2 (2) ( b ) ≤ · · · ≤ u 2 ( n ) ( b ) . Weight function ψ : [0 , 1] → [0 , 1] The least weighted squares (LWS) estimator n � k − 1 � b LWS = ( b LWS ) T = arg min � , b LWS , . . . , b LWS u 2 ψ ( i ) ( b ) 0 1 p n b ∈ ❘ p +1 k =1 Appealing properties sek J.´ V´ ıˇ A. (2011): Consistency of the least weighted squares under heteroscedasticity. Kybernetika 47 (2), 179 – 206. Kalina J., Tichavsk´ y J. (2020): On robust estimation of error variance in (highly) robust regression. Measurement Science Review 20 (1), 6 – 14. Vidnerov´ a & Kalina 4/12

Weight functions for the LWS estimator LWS-A: linear weight function ψ ( t ) = 1 − t , t ∈ [0 , 1] LWS-B: logistic weight function 1 + exp {− s / 2 } ψ ( t ) = 2 ) } , t ∈ [0 , 1] , s > 0 (fixed) 1 + exp { s ( t − 1 LWS-C: trimmed linear weights for a fixed τ ∈ [1 / 2 , 1) 1 − t � � ψ ( t ) = · ✶ [ t < τ ] , t ∈ [0 , 1] τ where ✶ [ . ] denotes an indicator function. Vidnerov´ a & Kalina 5/12

Least trimmed squares Least trimmed squares ( LTS ) estimator h b LTS = ( b LTS ) T = arg min , b LTS , . . . , b LTS � u 2 ( i ) ( b ) 0 1 p b ∈ ❘ p +1 i =1 High robustness, low efficiency Least trimmed absolute values ( LTA ) estimator h � arg min | u ( b ) | ( i ) , (1) b ∈ ❘ p i =1 where | u ( b ) | (1) ≤ | u ( b ) | (2) ≤ · · · ≤ | u ( b ) | ( n ) , (2) Trimmed version of the regression median ( L 1 estimator). Least weighted absolute value ( LWA ) estimator n � arg min w i | u ( b ) | ( i ) . (3) b ∈ ❘ p i =1 Implicitly weighted regression median Vidnerov´ a & Kalina 6/12

Investment dataset Dataset of U.S. investments with n = 22 yearly values (in 10 9 USD) X : Gross domestic product Y : Gross private domestic investments Results of four robust regression estimators in the investment dataset. Left: results of the LTS fit (triangles) and LWS fit (crosses). Right: results of the LTA fit (triangles) and LWA fit (crosses). Vidnerov´ a & Kalina 7/12

Principles of bootstrap Vidnerov´ a & Kalina 8/12

Nonparametric bootstrap for var b LWS Data rows ( X i 1 , . . . , X ip , Y i ), i = 1 , . . . , n S > 0 Compute the least weighted squares estimator ˆ β LWS of β in the model Y ∼ X FOR s = 1 to S Generate n new bootstrap data rows ( ( s ) X ∗ j 1 , . . . , ( s ) X ∗ jp , ( s ) Y ∗ j ) , j = 1 , . . . , n , by sampling with replacement from data rows ( X i 1 , . . . , X ip , Y i ), i = 1 , . . . , n Consider a linear regression model in the form ( s ) Y ∗ j = ( s ) γ 0 + ( s ) γ 1( s ) X ∗ j 1 + · · · + ( s ) γ p ( s ) X ∗ jp + ( s ) v j , j = 1 , . . . , n (4) Estimate ( s ) γ = ( ( s ) γ 0 , ( s ) γ 1 , . . . , ( s ) γ p ) T in (1) by the LWS Store the estimate from the previous step as ( s ) ˆ γ LWS Compute the empirical covariance matrix from values ( s ) ˆ γ LWS , r = 1 , . . . , R Vidnerov´ a & Kalina 9/12

Analysis of the investment dataset 1 The classical and robust estimates of the intercept and slope are accompanied by nonparametric bootstrap estimates of standard deviances ( s 0 and s 1 ) and covariances ( s 01 ). MSE denotes the mean square error evaluated within a leave-one-out cross validation. Estimator Intercept Slope MSE s 0 s 1 s 01 Least squares − 582 0.239 108.9 0.016 − 1 . 67 10 948 LTS − 375 0.207 742.0 0.106 − 5 . 74 16 489 LWS − 601 0.242 207.2 0.031 − 2 . 40 12 033 LTA − 312 0.204 721.6 0.112 − 5 . 58 16 207 LWA − 551 0.232 224.8 0.030 − 2 . 49 12 251 Vidnerov´ a & Kalina 10/12

Analysis of the investment dataset 2 Values of five different loss functions computed for five estimators over the investment dataset. This reveals the tightness of the algorithms for computing the individual robust regression estimators. Loss function � n � h � n � h � n i =1 u 2 i =1 u 2 i =1 w i u 2 Estimator i =1 | u | ( i ) i =1 w i | u | ( i ) i ( i ) ( i ) LS 198 796 80 834 4225 995 51.7 LTS 245 484 61 298 4019 835 45.4 LWS 223 132 63 661 3914 844 45.0 LTA 247 037 62 597 4004 791 46.2 LWA 220 925 64 076 3985 826 41.3 Vidnerov´ a & Kalina 11/12

Conclusions LTS popular LWS more promising but little known LTA with a small number of applications Novel proposal of LWA Reliable algorithm More flexible than LTA Performance similar to LWS Future research Vidnerov´ a & Kalina 12/12

Least Weighted Absolute Value Estimator with an Application to - PowerPoint PPT Presentation

Least Weighted Absolute Value Estimator with an Application to Investment Data Petra Vidnerov a, Jan Kalina The Czech Academy of Sciences, Institute of Computer Science, Prague Vidnerov a & Kalina 1/12 Least squares vs. robust

Absolute and Local Extrema Definition (Absolute Maximum) A function f has an absolute maximum at c

Mean Absolute Deviation Mean Absolute Deviation O Definition: Mean Absolute Deviation (MAD) is

Absolute and Local Extrema Definition 1 (Absolute Maximum) . A function f has an absolute max- imum

Weighted graphs Weighted graphs Weighted graphs Weighted graphs Graphs with numbers, called

d i E Absolute Value a l l u d Dr. Abdulla Eid b A College of Science . r D MATHS

Testing proportions BIO5312 FALL2017 STEPHANIE J. SPIELMAN, PHD Estimation An estimator is a

Weighted graphs 2 Weighted graphs So far we have only considered weighted graphs with

Weighted graphs 3 Weighted graph Edges in weighted graph are assigned a weight: w(v 1 , v 2 ),

One Step Studentized M -estimator M -Estimator Marek Omelka Department of Probability and

Complex models - large p, small n Shrinkage estimation Applying statistical methods to analyze

Weight Selection for a Model Weight Selection for a Model Average Estimator Average Estimator Alan

Dynamic Programming Today: Weighted Interval Scheduling Segmented Least Squares Weighted

Dynamic Programming: Interval Scheduling and Knapsack 6.1 Weighted Interval Scheduling Weighted

Practical Least-Squares for Computer Graphics Siggraph Course 11 Siggraph Course 11 Practical

11.4 The Pricing Method: Vertex Cover Weighted Vertex Cover Weighted vertex cover. Given a

Heuristic search Weighted A Kustaa Kangas October 17, 2013 K. Kangas () Heuristic search

Weighted Linear Bandits for Non-Stationary Environments Yoan Russac 1 , Claire Vernade 2 and

PPMLHDFE: Fast Poisson Estimation with High Dimensional Fixed Effects Paulo Guimares 2020

Family Achievements?: How Wealth Trumps Education Among White and Black College Graduates

Why Threshold Models: Need to Go Beyond . . . The Above Idea Works . . . A Theoretical

HMM-based acoustic model adaptation and discriminative training Steven Wegmann ICSI 11 April

Time series Decomposing a series into meaningful components R.W. Oldford Time series data -

PAUP* Lab Note: Parts of this computer lab exercise wer written by Paul O. Lewis. Paul has

Introduction to Computer Graphics Modeling (3) April 27, 2017 Kenshi Takayama Solid