1
Advanced Mathematical Methods
Part II – Statistics Generalised Linear Model
Mel Slater
http://www.cs.ucl.ac.uk/staff/m.slater/Teaching/Statistics/
Advanced Mathematical Methods Part II Statistics Generalised - - PowerPoint PPT Presentation
Advanced Mathematical Methods Part II Statistics Generalised Linear Model Mel Slater http://www.cs.ucl.ac.uk/staff/m.slater/Teaching/Statistics/ 1 Outline Introduction The General Linear Model Least Squares Estimation
1
Part II – Statistics Generalised Linear Model
Mel Slater
http://www.cs.ucl.ac.uk/staff/m.slater/Teaching/Statistics/
2
3
4
In practice the model is ‘linear’
We have observations on n individuals,
5
y = Xβ + ε
Note that p=k+1 if a constant term β0 is
Normally there should be a constant term.
6
To estimate the unknown parameters β To make inferences about β In particular we can find confidence
We can test hypotheses, in particular the
– Tests for relationship between y and X.
7
8
The total variation in the
Let y* = X β*
Then the total variation in
2 1
n i i −
= 2 1
n i i −
=
9
10
Now we make the further assumption that
Then under this assumption and the null
F = FSS/RSS ~ F(k,n-k)
11
12
R2 = Fitted SS/ Total SS This is the multiple correlation coefficient It is the proportion of the variation in the
R2 is between 0 and 1 It should be used together with the F-
13
Each β* ~ t-distribution on n-k degrees of
This can be used to construct confidence
An approx rule is
The ‘standard deviation’ for an estimate is
14
15
16
17
GLIM will print out the deviance and
Note if you fit the empty model, it will just
The deviance for this is the Total SS
18
$display e !will print out the estimates of
This can be used to look at each beta
The higher the ratio
the better that parameter and the more
19
You can incrementally fit variables
$fit . $ !refits the current model $display m $!displays the current model The advantage of GLIM compared to
The ‘user interface’ is the disadvantage
20
21
Take the F-Ratio If this is large then reject the null