On the Least Median Square On the Least Median Square Problem - PowerPoint PPT Presentation

On the Least Median Square On the Least Median Square Problem Problem Jeff Erickson University of Illinois Jeff Erickson University of Illinois Sariel Har- Sariel Har -Peled Peled University of Illinois University of Illinois David Mount University of Maryland David Mount University of Maryland

Robust Linear Regression Robust Linear Regression Linear Regression: Given a set of n points P = {p 1 , p 2 , …, p n } in R d , fit a (d-1)-dimensional ordinary least squares fit ordinary least squares fit hyperplane to these points. Robust Regression: Some fraction (up to 50%) of points may be arbitrarily far from the hyperplane. Ideally, an estimator should not be biased by these outliers . desired fit desired fit Breakdown Point: The fraction of outliers (up to 50%) that can bias a given estimator.

LMS/LQS Regression LMS/LQS Regression Residual: Given a parameter vector θ = ( θ 1 ,…, θ d ), define the i-th residual to be the vertical distance from the hyperplane r i r i to p i : p i p r x (x θ x θ θ ) = − + + + i L i i,d i,1 1 i,d 1 d 1 d − − LMS Estimator (Least Median of Squares): The hyperplane that minimizes the median squared residual [Rou84]. A 50% breakdown estimator. LQS (Least Quantile of Squares): Given integer k, the hyperplane that minimizes the k-th smallest squared residual.

LMS/LQS: Geometric Formulation LMS/LQS: Geometric Formulation Slab: The region bounded by two parallel hyperplanes. LMS is median residual median residual equivalent to computing the slab of minimum width that encloses at t* t* least 50% of the points. The vertical height t* of the slab is twice the median absolute residual. The central hyperplane of the slab is the LMS estimator. Vertical height or Perpendicular Width? Our results apply to both cases. We will only present the vertical case.

Prior Results Prior Results Exact: Plane: O(n 2 ) time and O(n) space by plane sweep. [Edelsbrunner,Souvaine-90] d-Space: O(n d+1 log n) by enumeration of elemental sets. [Rousseeuw,Leroy-87] [Stromberg-93] Few outliers: O(n(n-k) d+1 ) by LP with few violations [Matousek- 95] [Chan-02] and related methods. [H-P,Wang-02] Approximation: (to the optimum slab height t*) Practical Heuristics: Random sampling and branch-and-bound search. [MNPSW-97] Factor-2 approximation: O(n d-1 log n). [Olson-97]

Our Results Our Results What is the computational complexity of LMS and LQS? LQS Lower LMS Prior (k outliers) Bound n d+1 log n [LR87] n d log n Exact Ω (n d )* n(n-k) d+1 [Mat95] ε -Approx (n d-1 / ε ) log n (n d /k ε )log 2 n Ω (n d-1 )* Affine Degeneracy: Given n points, are any d+1 coplanar? *Assumptions: Affine degeneracy in dimension d requires Ω (n d ) time, d is a constant, and min(k,n-k) is Ω (n).

Overview Overview Remainder of the presentation: • Geometric preliminaries • Exact algorithm for LMS • ε -Approximation for LMS • Hardness of exact LMS • Concluding Remarks See paper for: • Generalizations to LQS • Results for perpendicular slab width • Hardness results on approximating LMS • Hardness results for LQS

Geometric Preliminaries Geometric Preliminaries Duality Transformation: Maps point p h* p h* p=(a 1 ,…,a d ) in R d to a (d-1)-dim hyperplane: p* p* h h p*: x d = a 1 x 1 + ... + a d x d - a a d p*: x d = a 1 x 1 + ... + a 1 x 1 - d- -1 d- -1 d and vice versa. * * h 2 h Slab: The dual of a slab containing k 2 h 1 h points is a vertical segment stabbing 1 k hyperplanes. The height of the slab * * h 1 h 1 equals the length of the segment. h 2 h 2

Exact Algorithm for LMS/LQS Exact Algorithm for LMS/LQS Theorem: Given a set H of n hyperplanes in R d and an integer k, the shortest vertical segment that stabs k hyperplanes can be computed in O(n d log n) time, with high probability. Approach: Randomized parametric search. Let t* be the (unknown) length of the shortest such segment. Decision Problem: Given any length t, determine whether t < t*. Discrete candidate values: For a segment to be minimal, its endpoints must together be incident to at least d+1 hyperplanes. O(n d+1 ) candidates result by considering all subsets of d+1 hyperplanes.

The Decision Procedure The Decision Procedure Decision Procedure: Given any length t, in O(n d ) time we can determine whether t < t*. Proof: – Replace each hyperplane h of H with a slab , bounded by h and a vertical translation of h by t. t t – Construct the arrangement of these slabs in O(n d ) time. – Determine whether there is any t t cell of this arrangement whose slab depth is k or more. This is true iff t ≥ t*. t t

Exact Algorithm: Sample and Sweep Exact Algorithm: Sample and Sweep Sample: sample sample – Take a random sample of O(n d ) subsets of (d+1) hyperplanes. – Compute the associated t values. – Using the decision procedure and binary t 1 t search , find consecutive sample values 1 such that t* lies in the interval [t 0 ,t 1 ]. – With high probability, the expected sweep sweep t* t* number of candidate values in the interval [t 0 ,t 1 ] is: t t 0 O((n d 1 /n )logn) d O(nlogn) + = 0 Sweep: Consider the parametric arrangement of slabs of height t, as t varies over [t 0 ,t 1 ]. Sweep this arrangement as a function of t. Total Time: O(n d log n). O(n d+1 ) O(n d ) O(n d+1 ) O(n d )

Approximation Algorithm for LMS Approximation Algorithm for LMS Theorem: Given a set of n hyperplanes in R d , an integer k and ε > 0, we can compute a vertical segment that stabs n/2 hyperplanes whose length is at most (1+ ε ) times optimum in O((n d-1 / ε )log 2 n) time, with high probability. Approach: Reduce to the following conditional problem . Conditional problem: Given a set H of n hyperplanes in R d and a hyperplane g (not necessarily in H), compute the g g shortest vertical segment that stabs n/2 hyperplanes and whose midpoint lies on g .

Solving the Conditional Problem Solving the Conditional Problem Lemma: The conditional problem can be solved in O(n d-1 log n) time. τ (h,t) h h Parametric Search: Let t* be the optimum segment length for the g g conditional problem.For h ∈ H, let t t τ (h,t) be the set of points of g such that a segment of length t centered here stabs h. This is a slab on g. Decision Problem (t ≥ t*): Construct the (d-1)-dimensional arrangement of τ (h,t) for all h ∈ H. g g If the slab depth of any point exceeds n/2, then t ≥ t*. Sample and sweep: As before.

Approximation Algorithm (cont) Approximation Algorithm (cont) Let s* be the shortest vertical segment that stabs n/2 hyperplanes of H. s* s* Sample a set R of O(log n) hyperplanes of H. s* stabs at least one of these with high probability. Solve the conditional problem for each g ∈ R. The overall minimum length t is at most twice optimal, t ≤ 2t*. Let δ = ε t/4. For each g ∈ R, construct O(1/ ε ) vertical translates of g in O(1/ ε ) O(1/ ε ) s* increments of δ . With high s* probability, at least one passes within ε t*/2 of the midpoint of s*. Solve the conditional problem on each such translate. One of the solutions will be the required ε -approximation.

Hardness of Exact LMS Hardness of Exact LMS Affine Degeneracy (AD): – Given n points, are any d+1 coplanar? – Conjectured to require Ω (n d ) time. Approach: – We show that AD is reducible to LMS in O(n) time, implying that LMS is at least as hard.

Hardness of Exact LMS Hardness of Exact LMS P P Reduction: Given a point set P of size m = n/2 – (d+1) for AD. Let Y Y Y be the height of the set. Q consists of: – One copy of P. +2(d+1) points +2(d+1) points – One copy of P translated Q Q vertically by 2 · Y. – n - 2m = 2(d+1) additional points placed way above. Correctness: d+1 points of P are coplanar iff there is a slab 2Y 2Y containing m + (d+1) = n/2 points of Q.

Concluding Remarks Concluding Remarks Presented exact and approximation algorithms for LMS and LQS: – Can solve LMS/LQS in O(n d log n) time with high probability . – An ε -approximation to LMS/LQS in O((n d /k ε ) polylog n) time. For fixed ε and k = Ω (n), this is O(n d-1 polylog n). – Shown that these running times are within a polylog factor of optimal, assuming the hardness of affine degeneracy. Open Problems: – Can space bounds be reduced from O(n d )? – How practical? Can this be combined with branch-and-bound? – Applicable to related estimators, such as least trimmed squares (LTS)?

Thank you Thank you

On the Least Median Square On the Least Median Square Problem - PowerPoint PPT Presentation

On the Least Median Square On the Least Median Square Problem Problem Jeff Erickson University of Illinois Jeff Erickson University of Illinois Sariel Har- Sariel Har -Peled Peled University of Illinois University of Illinois David

the nerves sensory radial median ulnar median median sensory median median ulnar radial

I - -75 Median Cable Barrier 75 Median Cable Barrier 75 Median Cable Barrier I 75 Median Cable

Linear-time Median Def: Median of elements A=a 1 , a 2 , , a n is the (n/2)-th smallest element

Spartanburg Nation Median Value of a $115,900 $184,700 Home Median Gross Rent $705 $950

African American Strategy Equitable Access to Homeownership Presentation April 16, 2018

Median Finding Test Cases What's Next 1. Median finding, part 2 2. Why we write test cases 3.

Business Statistics CONTENTS Hypotheses on the median The sign test The Wilcoxon signed ranks

Square Root of Not: Square Root of Not: . . . A Major Difference Between Square Root of

Practical Least-Squares for Computer Graphics Siggraph Course 11 Siggraph Course 11 Practical

1 Outline Chi-square test Logistic regression 2 Chi-square test 3 Chi-Square Test -

Homeowners Association 2018 Annual Meeting Baldwin Square November 7, 2018 BALDWIN SQUARE

Jackson Square Site III, Phase 3 Redevelopment August 11, 2016 Jackson Square Partners, LLC

Unit 5: Inference for categorical variables Lecture 3: Chi-square tests Statistics 101 Thomas

Median Drains Hydraulic Policy Bill P Schmidt, PE INDOT Hydraulics Team Leader

Events MedIAN Jobs Contact The Network Background About MedIAN UKs national Medical

Harbor Bay Median Sheet Mulch project HARBOR BAY MEDIAN Located in the City of Alameda 2

The use of area frame surveys and remote sensing Javier.gallego@jrc.ec.europa.eu Main approaches

De-biasing the Lasso: Optimal Sample Size for Gaussian Designs Adel Javanmard USC Marshall

METHODS OF REGULARIZATION AND THEIR JUSTIFICATIONS WON (RYAN) LEE We turn to the question of

Evaluation of Point Estimators Lecture 11 Biostatistics 602 - Statistical Inference . .

Using a Hybrid Rate Estimator Based On Hydrometeor Type Michael J. Dixon, S. M. Ellis, T. M.

Conditional Choice Probability Estimators of Single-Agent Dynamic Discrete Choice Models Hotz

Estimating average causal effects under general interference between units Peter M. Aronow and

Verification of communication node effective bandwidth estimator Alexandra Borodina Institute of