A fast way to compute Least Squares Teo Zhi Shen Anderson - - PowerPoint PPT Presentation

a fast way to compute least squares
SMART_READER_LITE
LIVE PREVIEW

A fast way to compute Least Squares Teo Zhi Shen Anderson - - PowerPoint PPT Presentation

A fast way to compute Least Squares Teo Zhi Shen Anderson Serangoon Junior College Least squares and Linear regression Used in the study of relationships between variables X and Y to : 1. Determine how strong the relationship is between the 2


slide-1
SLIDE 1

A fast way to compute Least Squares

Teo Zhi Shen Anderson Serangoon Junior College

slide-2
SLIDE 2

Least squares and Linear regression

Used in the study of relationships between variables X and Y to :

  • 1. Determine how strong the relationship is between the 2 variables X and Y
  • 2. Predict effects on Y caused by changes in X
  • 3. Predict a trend between X and Y
slide-3
SLIDE 3

Method of Least Squares

Minimises the squares of the differences between the data points and modelled equation.

Scatter plot

slide-4
SLIDE 4

Rise of Big Data

More information generated Increasingly large data sets Longer computation times + Reduced efficiency

slide-5
SLIDE 5

Applications of Least Square

Used in :

slide-6
SLIDE 6

The Two point method

The new two point method aims to form the line of best fit based on 2 points. 1.Geometric Centre of the scatter plot 2.” Centre of Mass”

  • f the scatter plot
slide-7
SLIDE 7

Improvements

2 Point Method Current Method

  • Simpler

computational steps

  • Contains complex
  • perations like partial

derivatives

  • Lesser operations
  • More operations
slide-8
SLIDE 8

Aims and Objectives

To determine whether the Two Point Method is indeed faster and more efficient than the current method for computing least square.

slide-9
SLIDE 9

Computation Time

Through the use of Python, the computation time for both methods can be compared with one another to confirm that the 2 point method is indeed faster.

slide-10
SLIDE 10

Comparing Computing Times

slide-11
SLIDE 11

Trends

For smaller sets of data, the current method is faster. However for larger sets, the 2 point method is faster.

slide-12
SLIDE 12

Conclusion

The new Two point method is indeed faster in calculating large data sets. However, both methods are still vulnerable to outlier values. Additionally , the new Two Point method can only be used to improve the efficiency of linear regression but not non-linear data sets.