Section 6.6 Least Squares Problems Data Modeling: Best fit line - PowerPoint PPT Presentation

Section 6.6 Least Squares Problems

Data Modeling: Best fit line What does it minimize? Best fit line minimizes the sum of the squares of the vertical distances from the data points to the line. (0 , 6) 1 y = − 3 x + 5 − 2 (2 , 0) (1 , 0) 1

Data modeling: best fit parabola What least squares problem Ax = b finds the best parabola through the points ( − 1 , 0 . 5), (1 , − 1), (2 , − 0 . 5), (3 , 2)? The general equation for a parabola is ax 2 + bx + c = y . So we want to solve: a ( − 1) 2 + b ( − 1) + c = 0 . 5 a (1) 2 + b (1) + c = − 1 a (2) 2 + b (2) + c = − 0 . 5 a (3) 2 + b (3) + c = 2 In matrix form:       1 − 1 1 0 . 5 a     1 1 1 − 1  =      b  .    4 2 1 − 0 . 5 c 9 3 1 2 88 so best fit is: 53 x 2 − 379 a = 53 88 , � b = 379 c = 82 Answer: � 5 x − 82 = 88 y 440 , �

Data modeling: best fit parabola Picture 88 y = 53 x 2 − 379 5 x − 82 (3 , 2) ( − 1 , 0 . 5) (2 , − 0 . 5) (1 , − 1)

Data modeling: best fit ellipse Find the best fit ellipse for the points (0 , 2), (2 , 1), (1 , − 1), ( − 1 , − 2), ( − 3 , 1). The general equation for an ellipse is x 2 + ay 2 + bxy + cx + dy + e = 0 So we want to solve: (0) 2 + A (2) 2 + B (0)(2) + C (0) + D (2) + E = 0 (2) 2 + A (1) 2 + B (2)(1) + C (2) + D (1) + E = 0 (1) 2 + A ( − 1) 2 + B (1)( − 1) + C (1) + D ( − 1) + E = 0 ( − 1) 2 + A ( − 2) 2 + B ( − 1)( − 2) + C ( − 1) + D ( − 2) + E = 0 ( − 3) 2 + A (1) 2 + B ( − 3)(1) + C ( − 3) + D (1) + E = 0 In matrix form:       4 0 0 2 1 a 0       1 2 2 1 1 b − 4             1 − 1 1 − 1 1 c = − 1 .             4 2 − 1 − 2 1 d − 1 1 − 3 − 3 1 1 − 9 e

Data modeling: best fit ellipse Complete procedure 4 0 0 2 1 0     1 2 2 1 1 − 4     A = 1 − 1 1 − 1 1 b = − 1  .         4 2 − 1 − 2 1 − 1    1 − 3 − 3 1 1 − 9 35 6 − 4 1 11 − 18     6 18 10 − 4 0 18 A T A =   A T b =   − 4 10 15 0 − 1 19         1 − 4 0 11 1 − 10     11 0 − 1 1 5 − 15 Row reduce: 35 6 − 4 1 11 − 18 1 0 0 0 0 16 / 7     6 18 10 − 4 0 18 0 1 0 0 0 − 8 / 7     − 4 10 15 0 − 1 19 0 0 1 0 0 15 / 7         1 − 4 0 11 1 − 10 0 0 0 1 0 − 6 / 7     11 0 − 1 1 5 − 15 0 0 0 0 1 − 52 / 7 Best fit ellipse: x 2 + 16 7 y 2 − 8 7 xy + 15 7 x − 6 7 y − 52 7 = 0 or 7 x 2 + 16 y 2 − 8 xy + 15 x − 6 y − 52 = 0 .

Data modeling: best fit ellipse Picture (0 , 2) ( − 3 , 1) (2 , 1) (1 , − 1) ( − 1 , − 2) 7 x 2 + 16 y 2 − 8 xy + 15 x − 6 y − 52 = 0 Remark: Gauss invented the method of least squares to do exactly this: he predicted the (elliptical) orbit of the asteroid Ceres as it passed behind the sun in 1801.

Extra: Best fit linear function x y f ( x , y ) What least squares problem Ax = b finds the best 1 0 0 linear function f ( x , y ) fitting the following data ? 0 1 1 The general equation for a linear function in two − 1 0 3 variables is 0 − 1 4 f ( x , y ) = ax + by + c . So we want to solve a (1) + b (0) + c = 0 a (0) + b (1) + c = 1 a ( − 1) + b (0) + c = 3 a (0) + b ( − 1) + c = 4 In matrix form:       1 0 1 0 a     0 1 1 1     =   b  .    − 1 0 1 3 c 0 − 1 1 4 c = 2 so best fit is: f ( x , y ) = − 3 2 x − 3 a = − 3 2 , � b = − 3 Answer: � 2 , � 2 y + 2

Extra: Best fit linear function Picture (0 , − 1 , 4) f ( − 1 , 0) f (0 , − 1) f ( x , y ) Graph of ( − 1 , 0 , 3) f ( x , y ) = − 3 2 x − 3 2 y + 2 x (0 , 1 , 1) f (1 , 0) y f (0 , 1) (1 , 0 , 0)

Multiple Regression Generalizing the best-fit plane before: ◮ A variable y depends on ◮ Independent variables u , v General formula: The best fit plane: A quadratic function (next week’s subject):

Multiple regression Expert’s notation The model to fit: The equation display y = X β + ε : The error We want to minimize the length of ε . In last section we don’t write it as part of the equation.

Section 6.6 Least Squares Problems Data Modeling: Best fit line - PowerPoint PPT Presentation

Section 6.6 Least Squares Problems Data Modeling: Best fit line What does it minimize? Best fit line minimizes the sum of the squares of the vertical distances from the data points to the line. (0 , 6) 1 y = 3 x + 5 2 (2 , 0) (1 ,

Module V: Vector Spaces Module V Math 237 Module V Section V.0 Section V.1 Section V.2

Half Year Results Presentation 2019 6 months ended 30 June 2019 Section 1 Section 2 Section 3

2018 Full year results presentation 12 months ended 31 December 2018 1 Section 1 Section 2

May 2013 Agenda Section 1 Jaypee Group Overview Section 2 Company Overview Section 3 Yamuna

Fermilab NORTH 0 20 20 40 1"=20'-0" 2/8/2019 6:57:50 PM 4850 LEVEL SCALE SC LE

Module A: Algebraic properties of linear maps Module A Math 237 Module A Section A.1 Section

Probability Chapter 4 Section 2: Fundamentals Section 3: Addition Rule Section 4:

Probability Chapter 4 Section 2: Fundamentals Section 3: Addition Rule Section 4:

Investor Update CONTENTS SECTION 01 SECTION 02 Asset Overview management strategy SECTION

Agenda Section 1: Introduction Section 2: Emergency & Welfare Arrangements Section

Company presentation June 2016 Table of contents Section 1 Summary 3 Section 2 Market

1 2 3 4 Section 1 Section 2 Section 3 Section 4 INTRODUCTION FINANCIAL SEGMENTAL GROUP

SR 15 SECTION 088 CSVT SOUTHERN SECTION PUBLIC MEETING NOVEMBER 15, 2017 SR 15 SECTION 088

1 Table of content Introduction Section 1 Executive Summary 3 Corporate Overview 9 Section 2

SECTION 3 AGENDA WHAT IS SECTION 3? EXAMPLES OF SECTION 3 OPPORTUNITIES SECTION 3

Contents Page Section 1 Executive Summary 2 Section 2 Investment Rationale 9 Section 3

Offline Analysis of H4 Beam Line Instrumentation Data Alexander Booth for N. Charitonidis, Y.

STAT 113 Analytic Inference for Regression Colin Reimer Dawson Oberlin College 21-24 April 2017

Develop Your Data Mindset Module 8 - Progress Monitoring Part 2 - Background Knowledge (Graphing

y i y = n Median : the midpoint of a group of data. Uchechukwu Ofoegbu Temple University

Topics in Algorithms and Data Science Singular Value Decomposition (SVD) Omid Etesami The

The conditional CAPM does not explain asset- pricing anomalies Jonathan Lewellen & Stefan

Robust Statistics and Generative Adversarial Networks Yuan YAO HKUST Chao Gao (Chicago) Jiyu

xtdcce2 : Estimating Dynamic Common Correlated Effects in Stata Jan Ditzen Spatial Economics and

Section 6.6 Least Squares Problems Data Modeling: Best fit line - PowerPoint PPT Presentation

Section 6.6 Least Squares Problems Data Modeling: Best fit line What does it minimize? Best fit line minimizes the sum of the squares of the vertical distances from the data points to the line. (0 , 6) 1 y = 3 x + 5 2 (2 , 0) (1 ,

Module V: Vector Spaces Module V Math 237 Module V Section V.0 Section V.1 Section V.2

Half Year Results Presentation 2019 6 months ended 30 June 2019 Section 1 Section 2 Section 3

2018 Full year results presentation 12 months ended 31 December 2018 1 Section 1 Section 2

May 2013 Agenda Section 1 Jaypee Group Overview Section 2 Company Overview Section 3 Yamuna

Fermilab NORTH 0 20 20 40 1&quot;=20'-0&quot; 2/8/2019 6:57:50 PM 4850 LEVEL SCALE SC LE

Module A: Algebraic properties of linear maps Module A Math 237 Module A Section A.1 Section

Probability Chapter 4 Section 2: Fundamentals Section 3: Addition Rule Section 4:

Probability Chapter 4 Section 2: Fundamentals Section 3: Addition Rule Section 4:

Investor Update CONTENTS SECTION 01 SECTION 02 Asset Overview management strategy SECTION

Agenda Section 1: Introduction Section 2: Emergency &amp; Welfare Arrangements Section

Company presentation June 2016 Table of contents Section 1 Summary 3 Section 2 Market

1 2 3 4 Section 1 Section 2 Section 3 Section 4 INTRODUCTION FINANCIAL SEGMENTAL GROUP

SR 15 SECTION 088 CSVT SOUTHERN SECTION PUBLIC MEETING NOVEMBER 15, 2017 SR 15 SECTION 088

1 Table of content Introduction Section 1 Executive Summary 3 Corporate Overview 9 Section 2

SECTION 3 AGENDA WHAT IS SECTION 3? EXAMPLES OF SECTION 3 OPPORTUNITIES SECTION 3

Contents Page Section 1 Executive Summary 2 Section 2 Investment Rationale 9 Section 3

Offline Analysis of H4 Beam Line Instrumentation Data Alexander Booth for N. Charitonidis, Y.

STAT 113 Analytic Inference for Regression Colin Reimer Dawson Oberlin College 21-24 April 2017

Develop Your Data Mindset Module 8 - Progress Monitoring Part 2 - Background Knowledge (Graphing

y i y = n Median : the midpoint of a group of data. Uchechukwu Ofoegbu Temple University

Topics in Algorithms and Data Science Singular Value Decomposition (SVD) Omid Etesami The

The conditional CAPM does not explain asset- pricing anomalies Jonathan Lewellen &amp; Stefan

Robust Statistics and Generative Adversarial Networks Yuan YAO HKUST Chao Gao (Chicago) Jiyu

xtdcce2 : Estimating Dynamic Common Correlated Effects in Stata Jan Ditzen Spatial Economics and

Fermilab NORTH 0 20 20 40 1"=20'-0" 2/8/2019 6:57:50 PM 4850 LEVEL SCALE SC LE

Agenda Section 1: Introduction Section 2: Emergency & Welfare Arrangements Section

The conditional CAPM does not explain asset- pricing anomalies Jonathan Lewellen & Stefan