[PPT] - Methods for Longitudinal Data Categorical Response Gilbert PowerPoint Presentation

SLIDE 1

LIVES Doctoral Program: Categorical longitudinal data

Methods for Longitudinal Data Categorical Response

Gilbert Ritschard

Institute for demographic and life course studies, University Geneva http://mephisto.unige.ch

Doctoral Program, Lausanne, May 20, 2011

19/5/2011gr 1/37

SLIDE 2

LIVES Doctoral Program: Categorical longitudinal data

Typology of methods for life course data

Issues Questions duration/hazard state/event sequencing descriptive

Survival curves:
Sequence

Parametric clustering (Weibull, Gompertz, ...)

Frequencies of given

and non parametric patterns (Kaplan-Meier, Nelson-

Discovering typical

Aalen) estimators. episodes causality

Hazard regression models
Markov models

(Cox, ...)

Mobility trees
Survival trees
Association rules

among episodes

19/5/2011gr 2/37

SLIDE 3

LIVES Doctoral Program: Categorical longitudinal data Survival analysis

Outline

1

Survival analysis

2

State sequence analysis: brief overview

3

Mobility and transition rates

4

Conclusion

19/5/2011gr 3/37

SLIDE 4

LIVES Doctoral Program: Categorical longitudinal data Survival analysis Survival curves

Section outline

1

Survival analysis Survival curves Survival models and trees

19/5/2011gr 4/37

SLIDE 5

LIVES Doctoral Program: Categorical longitudinal data Survival analysis Survival curves

Survival Approaches

Event history analysis

Survival or Event history analysis (Mills, 2011)(Blossfeld and Rohwer,

2002)

Focuses on one event. Concerned with duration until event occurs

r with hazard of experiencing event.

Survival curves: Distribution of duration until event occurs S(t) = p(T ≥ t) . Hazard models: Regression like models for S(t, x) or hazard h(t) = p(T = t | T ≥ t) h(t, x) = g

t, β0 + β1x1 + β2x2(t) + · · ·
.

19/5/2011gr 5/37

SLIDE 6

LIVES Doctoral Program: Categorical longitudinal data Survival analysis Survival curves

Survival Approaches

Event history analysis

Survival or Event history analysis (Mills, 2011)(Blossfeld and Rohwer,

2002)

Focuses on one event. Concerned with duration until event occurs

r with hazard of experiencing event.

Survival curves: Distribution of duration until event occurs S(t) = p(T ≥ t) . Hazard models: Regression like models for S(t, x) or hazard h(t) = p(T = t | T ≥ t) h(t, x) = g

t, β0 + β1x1 + β2x2(t) + · · ·
.

19/5/2011gr 5/37

SLIDE 7

LIVES Doctoral Program: Categorical longitudinal data Survival analysis Survival curves

Survival curves (Switzerland, SHP 2002 biographical survey)

Women 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 10 20 30 40 50 60 70 80 AGE (years) Survival probability Leaving home Marriage 1st Chilbirth Parents' death Last child left Divorce Widowing

19/5/2011gr 6/37

SLIDE 8

LIVES Doctoral Program: Categorical longitudinal data Survival analysis Survival models and trees

Section outline

1

Survival analysis Survival curves Survival models and trees

19/5/2011gr 7/37

SLIDE 9

LIVES Doctoral Program: Categorical longitudinal data Survival analysis Survival models and trees

SHP biographical retrospective survey

http://www.swisspanel.ch

SHP retrospective survey: 2001 (860) and 2002 (4700 cases). We consider only data collected in 2002. Data completed with variables from 2002 wave (language). Characteristics of retained data for divorce (individuals who get married at least once) men women Total Total 1414 1656 3070 1st marriage dissolution 231 308 539 16.3% 18.6% 17.6%

19/5/2011gr 8/37

SLIDE 10

LIVES Doctoral Program: Categorical longitudinal data Survival analysis Survival models and trees

SHP biographical retrospective survey

http://www.swisspanel.ch

SHP retrospective survey: 2001 (860) and 2002 (4700 cases). We consider only data collected in 2002. Data completed with variables from 2002 wave (language). Characteristics of retained data for divorce (individuals who get married at least once) men women Total Total 1414 1656 3070 1st marriage dissolution 231 308 539 16.3% 18.6% 17.6%

19/5/2011gr 8/37

SLIDE 11

LIVES Doctoral Program: Categorical longitudinal data Survival analysis Survival models and trees

Marriage duration until divorce

Survival curves

0 8 0.85 0.9 0.95 1 vie 0.5 0.55 0.6 0.65 0.7 0.75 0.8 10 20 30 40

prob. de surv

Durée du mariage, Femmes 0 8 0.85 0.9 0.95 1 vie 0.5 0.55 0.6 0.65 0.7 0.75 0.8 10 20 30 40

prob. de surv

Durée du mariage, Hommes

0 8 v 8 v 1942 et avant 1943-1952 1953 et après

19/5/2011gr 9/37

SLIDE 12

LIVES Doctoral Program: Categorical longitudinal data Survival analysis Survival models and trees

Marriage duration until divorce

Hazard model

Discrete time model (logistic regression on person-year data) exp(B) gives the Odds Ratio, i.e. change in the odd h/(1 − h) when covariate increases by 1 unit. exp(B) Sig. birthyr 1.0088 0.002 university 1.22 0.043 child 0.73 0.000 language unknwn 1.47 0.000 French 1.26 0.007 German 1 ref Italian 0.89 0.537 Constant 0.0000000004 0.000

19/5/2011gr 10/37

SLIDE 13

LIVES Doctoral Program: Categorical longitudinal data Survival analysis Survival models and trees

Divorce, Switzerland, Relative risk

19/5/2011gr 11/37

SLIDE 14

LIVES Doctoral Program: Categorical longitudinal data Survival analysis Survival models and trees

Hazard model with interaction

Adding interaction effects detected with the tree approach improves significantly the fit (sig ∆χ2 = 0.004) exp(B) Sig. born after 1940 1.78 0.000 university 1.22 0.049 child 0.94 0.619 language unknwn 1.50 0.000 French 1.12 0.282 German 1 ref Italian 0.92 0.677 b before 40French 1.46 0.028 b after 40child 0.68 0.010 Constant 0.008 0.000

19/5/2011gr 12/37

SLIDE 15

LIVES Doctoral Program: Categorical longitudinal data State sequence analysis: brief overview

Outline

1

Survival analysis

2

State sequence analysis: brief overview

3

Mobility and transition rates

4

Conclusion

19/5/2011gr 13/37

SLIDE 16

LIVES Doctoral Program: Categorical longitudinal data State sequence analysis: brief overview

Illustrative mvad data set

McVicar and Anyadike-Danes (2002)’s study of transition from school to employment in North Ireland.

Survey of 712 Irish youngsters. Sequences describe their follow-up during the 6 years after the end of compulsory school (16 years old) and are formed by 70 successive monthly observed states between September 1993 and June 1999. Sates are: EM Empoyement FE Further education HE Higher education JL Joblessness SC School TR Training.

19/5/2011gr 14/37

SLIDE 17

LIVES Doctoral Program: Categorical longitudinal data State sequence analysis: brief overview

Sate sequences - mvad data set

First sequences (first 20 months)

Sequence 1 EM-EM-EM-EM-TR-TR-EM-EM-EM-EM-EM-EM-EM-EM-EM-EM-EM-EM-EM-EM 2 FE-FE-FE-FE-FE-FE-FE-FE-FE-FE-FE-FE-FE-FE-FE-FE-FE-FE-FE-FE 3 TR-TR-TR-TR-TR-TR-TR-TR-TR-TR-TR-TR-TR-TR-TR-TR-TR-TR-TR-TR 4 TR-TR-TR-TR-TR-TR-TR-TR-TR-TR-TR-TR-TR-TR-TR-TR-TR-TR-TR-TR

compact representation (SPS format)

Sequence [1] (EM,4)-(TR,2)-(EM,64) [2] (FE,36)-(HE,34) [3] (TR,24)-(FE,34)-(EM,10)-(JL,2) [4] (TR,47)-(EM,14)-(JL,9)

4 seq. (n=4) Sep.93 Sep.94 Sep.95 Sep.96 Sep.97 Sep.98 4 3 2 1

19/5/2011gr 15/37

SLIDE 18

LIVES Doctoral Program: Categorical longitudinal data State sequence analysis: brief overview

State sequences: Graphical display

19/5/2011gr 16/37

SLIDE 19

LIVES Doctoral Program: Categorical longitudinal data State sequence analysis: brief overview

Pairwise dissimilarities and cluster analysis

Different metrics permit to compute pairwise dissimilarities between sequences

f which optimal matching (Abbott and Forrest, 1986) is perhaps

the most popular in social sciences

Once you have pairwise dissimilarities, you can do

cluster analysis of sequences principal coordinate analysis measure the discrepancy between sequences Find representative sequences, either most central or with highest density neighborhood (Gabadinho et al., 2011b) ANOVA-like analysis and Regression trees (Studer et al., 2011)

19/5/2011gr 17/37

SLIDE 20

LIVES Doctoral Program: Categorical longitudinal data State sequence analysis: brief overview

Cluster analysis: Outcome

Rendering the cluster contents: transversal state distributions

Cluster 1

Freq. (weighted n=226.47)

Sep.93 Mar.95 Sep.96 Mar.98 0.0 0.2 0.4 0.6 0.8 1.0

Cluster 2

Freq. (weighted n=189.06)

Sep.93 Mar.95 Sep.96 Mar.98 0.0 0.2 0.4 0.6 0.8 1.0

Cluster 3

Freq. (weighted n=196.82)

Sep.93 Mar.95 Sep.96 Mar.98 0.0 0.2 0.4 0.6 0.8 1.0

Cluster 4

Freq. (weighted n=99.22)

Sep.93 Mar.95 Sep.96 Mar.98 0.0 0.2 0.4 0.6 0.8 1.0 employment further education higher education joblessness school training

19/5/2011gr 18/37

SLIDE 21

LIVES Doctoral Program: Categorical longitudinal data State sequence analysis: brief overview

Cluster analysis: Outcome (2)

Mean time per state by cluster

EM FE HE JL SC TR

Cluster 1

Mean time (weighted n=226.47) 14 28 42 56 70 EM FE HE JL SC TR

Cluster 2

Mean time (weighted n=189.06) 14 28 42 56 70 EM FE HE JL SC TR

Cluster 3

Mean time (weighted n=196.82) 14 28 42 56 70 EM FE HE JL SC TR

Cluster 4

Mean time (weighted n=99.22) 14 28 42 56 70 employment further education higher education joblessness school training

19/5/2011gr 19/37

SLIDE 22

LIVES Doctoral Program: Categorical longitudinal data State sequence analysis: brief overview

Regression tree

19/5/2011gr 20/37

SLIDE 23

LIVES Doctoral Program: Categorical longitudinal data Mobility and transition rates

Outline

1

Survival analysis

2

State sequence analysis: brief overview

3

Mobility and transition rates

4

Conclusion

19/5/2011gr 21/37

SLIDE 24

LIVES Doctoral Program: Categorical longitudinal data Mobility and transition rates Markov process

Section outline

3

Mobility and transition rates Markov process Mobility tree

19/5/2011gr 22/37

SLIDE 25

LIVES Doctoral Program: Categorical longitudinal data Mobility and transition rates Markov process

Markov process: Principle

(Br´ emaud, 1999; Berchtold and Raftery, 2002)

Assume we have a sequence of states (not necessarily panel data) How is state in position t related to previous states? What is the probability to switch to state B in t when we are in state A in t − 1?

Probability to fall next year into joblessness when we have a partial time job. Probability to stay unemployed next t when we are currently unemployed. Probability to recover from illness next month.

19/5/2011gr 23/37

SLIDE 26

LIVES Doctoral Program: Categorical longitudinal data Mobility and transition rates Markov process

Homogenous Markov process: Assumptions

transition probability is the same whatever t (homogeneity) a few lagged states summarize all the sequence before t 1st order: state in t − 1 summarizes all the sequence before t; i.e.; state in t depends only on state in t − 1 2nd order: states in t − 1 and t − 2 summarize all the sequence before t; i.e.; state in t depends only on states in t − 1 and t − 2 ...

19/5/2011gr 24/37

SLIDE 27

LIVES Doctoral Program: Categorical longitudinal data Mobility and transition rates Markov process

Homogenous Markov process: Assumptions

transition probability is the same whatever t (homogeneity) a few lagged states summarize all the sequence before t 1st order: state in t − 1 summarizes all the sequence before t; i.e.; state in t depends only on state in t − 1 2nd order: states in t − 1 and t − 2 summarize all the sequence before t; i.e.; state in t depends only on states in t − 1 and t − 2 ...

19/5/2011gr 24/37

SLIDE 28

LIVES Doctoral Program: Categorical longitudinal data Mobility and transition rates Markov process

Markov process: Illustration

Blossfeld and Rohwer (2002) sample of 600 job episodes extracted from the German Life History Study Job episodes partitioned into 3 job length categories

short (1) = ≤ 3 years medium (2) = (3; 10] years long (3) = > 10 years

Data reorganized into 162 sequences of 2 to 9 job episodes (units with single episode not considered) How does present episode length depend upon those of preceding jobs?

19/5/2011gr 25/37

SLIDE 29

LIVES Doctoral Program: Categorical longitudinal data Mobility and transition rates Markov process

Markov matrices of order 0, 1 and 2

t − 2 t − 1 t t − 2 t − 1 t t − 2 t − 1 t

job length at t half conf. 1 2 3 interval Indep .50 .35 .15 .07 t−1 1 .57 .30 .13 .10 2 .43 .42 .15 .13 3 .20 .53 .27 .29 t−2 t−1 1 1 .55 .30 .15 .11 2 1 .60 .30 .10 .20 3 1 1 .65 1 2 .37 .45 .18 .18 2 2 .50 .41 .09 .20 3 2 .45 .33 .22 .38 1 3 .33 .17 .50 .46 2 3 .87 .13 .40 3 3 1 1

19/5/2011gr 26/37

SLIDE 30

LIVES Doctoral Program: Categorical longitudinal data Mobility and transition rates Markov process

Main findings

First order:

Probability to start short job (1) after a short one (1) is much higher than starting a medium (2) or long job (3) not the case after a medium or long job

Second order:

No clear evidence about impact of lag 2 job Main difference concerns long job (3) (but not significant) Confirmed by MTD model, which gives weight 0 to second lag

19/5/2011gr 27/37

SLIDE 31

LIVES Doctoral Program: Categorical longitudinal data Mobility and transition rates Markov process

Two state hidden Markov model

t − 2 t − 1 t

Hidden Process Observed Job Hidden state at t half conf. t−1 1 2 interval 1 .78 .22 .12 2 .53 .47 .19 initial .56 .44 .11 Hidden Job length half conf. state 1 2 3 interval 1 .75 .23 .02 .12 2 .05 .58 .37 .18

19/5/2011gr 28/37

SLIDE 32

LIVES Doctoral Program: Categorical longitudinal data Mobility and transition rates Markov process

Hidden Markov Model (HMM)

Relaxing homogeneity assumption with HMM Fitting a HMM with 2 hidden states

distribution of initial state of hidden variable transition matrix of hidden process distribution of transitions to the job length categories associated to each hidden state

19/5/2011gr 29/37

SLIDE 33

LIVES Doctoral Program: Categorical longitudinal data Mobility and transition rates Mobility tree

Section outline

3

Mobility and transition rates Markov process Mobility tree

19/5/2011gr 30/37

SLIDE 34

LIVES Doctoral Program: Categorical longitudinal data Mobility and transition rates Mobility tree

Mobility tree

Social transition tree with birth place covariate (Ritschard and Oris, 2005)

Low, Clock, High

19/5/2011gr 31/37

SLIDE 35

LIVES Doctoral Program: Categorical longitudinal data Conclusion

Outline

1

Survival analysis

2

State sequence analysis: brief overview

3

Mobility and transition rates

4

Conclusion

19/5/2011gr 32/37

SLIDE 36

LIVES Doctoral Program: Categorical longitudinal data Conclusion

Conclusion

Now, it is your turn! To chose a method, you first have to

Clarify what you are looking for

typical patterns, departures from standards, ... specific transitions or holistic view relationships with context (covariates) ...

Identify the nature of your data

Categorical vs numerical Direct or indirect measures of variable of interest Long or short sequences ...

19/5/2011gr 33/37

SLIDE 37

LIVES Doctoral Program: Categorical longitudinal data Conclusion

Thank You! Thank You!

19/5/2011gr 34/37

SLIDE 38

LIVES Doctoral Program: Categorical longitudinal data Conclusion

References I

Abbott, A. and J. Forrest (1986). Optimal matching methods for historical

sequences. Journal of Interdisciplinary History 16, 471–494.

Berchtold, A. and A. E. Raftery (2002). The mixture transition distribution model for high-order Markov chains and non-gaussian time series. Statistical Science 17(3), 328–356. Blossfeld, H.-P. and G. Rohwer (2002). Techniques of Event History Modeling, New Approaches to Causal Analysis (2nd ed.). Mahwah NJ: Lawrence Erlbaum. Br´ emaud, P. (1999). Markov Chains, Gibbs Fields, Monte Carlo Simulation, and Queues. New york: Springer Verlag. Gabadinho, A., G. Ritschard, N. S. M¨ uller, and M. Studer (2011a). Analyzing and visualizing state sequences in R with TraMineR. Journal of Statistical Software 40(4), 1–37.

19/5/2011gr 35/37

SLIDE 39

LIVES Doctoral Program: Categorical longitudinal data Conclusion

References II

Gabadinho, A., G. Ritschard, M. Studer, and N. S. M¨ uller (2011b). Extracting and rendering representative sequences. In A. Fred, J. L. G. Dietz, K. Liu, and J. Filipe (Eds.), Knowledge Discovery, Knowledge Engineering and Knowledge Management, Volume 128 of Communications in Computer and Information Science (CCIS), pp. 94–106. Springer-Verlag. McVicar, D. and M. Anyadike-Danes (2002). Predicting successful and unsuccessful transitions from school to work using sequence methods. Journal of the Royal Statistical Society A 165(2), 317–334. Mills, M. (2011). Introducing Survival and Event HistoryAnalysis. London:

Sage. (Chap. 11 about Sequential analysis and TraMineR).

Ritschard, G., A. Gabadinho, N. S. M¨ uller, and M. Studer (2008). Mining event histories: A social science perspective. International Journal of Data Mining, Modelling and Management 1(1), 68–90. Ritschard, G. and M. Oris (2005). Life course data in demography and social sciences: Statistical and data mining approaches. In R. Levy, P. Ghisletta, J.-M. Le Goff, D. Spini, and E. Widmer (Eds.), Towards an Interdisciplinary Perspective on the Life Course, Advances in Life Course Research, Vol. 10,

pp. 289–320. Amsterdam: Elsevier.

19/5/2011gr 36/37

SLIDE 40

LIVES Doctoral Program: Categorical longitudinal data Conclusion

References III

Studer, M., G. Ritschard, A. Gabadinho, and N. S. M¨ uller (2011). Discrepancy analysis of state sequences. Sociological Methods and Research. In press.

19/5/2011gr 37/37