SLIDE 7 7 We can then define the error to be the difference between the coordinates and the prediction line. The coordinate of one point: (xi, yi) Predicted value for given xi : “Best” line minimizes , the sum of the squared errors.
“Best” line: least-squares, or regression line
i i
x b b y
1
ˆ
2
ˆi
i
y y
Error = distance from one point to the line = Coordinate – Prediction
Some of the errors will be positive and some will be negative! The problem is that when we add positive and negative values, they tend to cancel each other out.