Mining Life Event Sequences Gilbert Ritschard and Matthias Studer - PowerPoint PPT Presentation

Mining Life Event Sequences Mining Life Event Sequences Gilbert Ritschard and Matthias Studer NCCR LIVES and Institute for demographic and life course studies University of Geneva http://mephisto.unige.ch Society For Longitudinal and Life Course Studies International Conference Amsterdam, September 23-25, 2013 10/10/2013gr 1/55

Mining Life Event Sequences Introduction Introduction 1 Frequent subsequences in TraMineR 2 Frequent Swiss life course subsequences 3 Discriminant subsequences 4 Maximal subsequences 5 Conclusion 6 10/10/2013gr 2/55

Mining Life Event Sequences Introduction Objectives Introduction 1 Objectives The Biographical Data from the Swiss Household Panel Frequent subsequences versus Frequent itemsets 10/10/2013gr 3/55

Mining Life Event Sequences Introduction Objectives Objectives Data-mining-based methods (pattern mining) Discovering interesting information from sequences of life events, i.e., on how people sequence important life events What is the most typical succession of family or professional life events? Are there standard ways of sequencing those events? What are the most typical events that occur after a given subsequence such as after leaving home and ending education? How is the sequencing of events related to covariates? Which event sequencings do best discriminate groups such as men and women? Mining of frequent (Agrawal and Srikant, 1995; Mannila et al., 1995; Bettini et al., 1996; Mannila et al., 1997; Zaki, 2001) and discriminant event subsequences 10/10/2013gr 4/55

Mining Life Event Sequences Introduction Objectives Objectives (continued) Demonstrate the kind of results that can be obtained by mining event subsequences Search for most frequent subsequences subsequences that best discriminate groups (provided covariate) New concept of frequent maximal subsequence for more interesting results 10/10/2013gr 5/55

Mining Life Event Sequences Introduction Objectives What’s new Previous attempts with event sequences in social sciences (e.g. Billari et al., 2006; Ritschard et al., 2007) mainly consisted in counting predefined subsequences. 30% 25% 20% 15% 10% 5% 0% Switzerland, SHP 2002 biographical survey ( n = 5560) 10/10/2013gr 6/55

Mining Life Event Sequences Introduction Objectives Event sequences versus state sequences State sequence: states last a whole interval period age 20 21 22 23 24 25 26 state 2P 2P A A UC UC UC Event sequence: events occur at a given (time) position Interest in their order, in their sequencing Can be time stamped (TSE) id Timestamp Event 101 22 Leaving Home 101 24 Start living with partner 101 24 Childbirth 10/10/2013gr 7/55

Mining Life Event Sequences Introduction The Biographical Data from the Swiss Household Panel Introduction 1 Objectives The Biographical Data from the Swiss Household Panel Frequent subsequences versus Frequent itemsets 10/10/2013gr 8/55

Mining Life Event Sequences Introduction The Biographical Data from the Swiss Household Panel The Biographical SHP Data Sequences derived from the biographical survey conducted in 2002 by the Swiss Household Panel www.swisspanel.ch Retain the 1503 cases studied in Widmer and Ritschard (2009) with techniques for state sequences Two channels: Cohabitational and occupational Only individuals aged 45 or more at survey time Focus on life trajectory between 20 and 45 years Granularity is yearly level 10/10/2013gr 9/55

Mining Life Event Sequences Introduction The Biographical Data from the Swiss Household Panel The Cohabitational State Sequences 10/10/2013gr 10/55

Mining Life Event Sequences Introduction The Biographical Data from the Swiss Household Panel The Occupational State Sequences 10/10/2013gr 11/55

Mining Life Event Sequences Introduction The Biographical Data from the Swiss Household Panel Short and long state labels Cohabitational Occupational 2P Biological father and mother Mi Missing 1P One biological parent FT Full time PP One biological parent with her/his partner PT Part time A Alone NB Neg. break U With partner PB Pos. break UC Partner and biological child AH At home UN Partner and non biological child RE Retired C Biological child and no partner ED Education F Friends O Other 10/10/2013gr 12/55

Mining Life Event Sequences Introduction The Biographical Data from the Swiss Household Panel Events associated to cohabitational state transitions For cohabitational trajectories, we convert states to events by defining the events associated to the state transitions 2P 1P PP A U UC UN C F O 2P "2P" "1P" "PP" "LH,A" "LH,U" "LH,U,C" "LH,U,C" "LH,C" "LH,A" "LH,O" 1P "2P" "1P" "PP" "LH,A" "LH,U" "LH,U,C" "LH,U,C" "LH,C" "LH,A" "LH,O" PP "2P" "1P" "PP" "LH,A" "LH,U" "LH,U,C" "LH,U,C" "LH,C" "LH,A" "LH,O" A "2P" "1P" "PP" "A" "U" "U,C" "U,C" "C" "" "O" U "2P" "1P" "PP" "UE,A" "U" "C" "C" "C" "UE,A" "UE,O" UC "2P" "1P" "PP" "UE,CL,A" "CL" "U,C" "CL,C" "UE" "UE,CL,A" "UE,CL,O" UN "2P" "1P" "PP" "UE,CL,A" "CL" "C" "U,C" "UE,C" "UE,CL,A" "UE,CL,O" C "2P" "1P" "PP" "CL,A" "CL,U" "U" "CL,C" "C" "CL,A" "CL,O" F "2P" "1P" "PP" "" "U" "U,C" "U,C" "C" "A" "O" O "2P" "1P" "PP" "A" "U" "U,C" "U,C" "C" "A" "O" For occupational trajectories, we assign an event to the start of each spell in a state. 10/10/2013gr 13/55

Mining Life Event Sequences Gilbert Ritschard and Matthias Studer - PowerPoint PPT Presentation

Mining Life Event Sequences Mining Life Event Sequences Gilbert Ritschard and Matthias Studer NCCR LIVES and Institute for demographic and life course studies University of Geneva http://mephisto.unige.ch Society For Longitudinal and Life

Web Mining Web Mining Web Mining Web Mining Web mining is the use of data mining techniques

1 Mining Event Histories Mining Event Histories Sequence Analysis in Social Sciences Sequence

20-03-06 7. Learning Sequences/Behaviors How to use sequences/behaviors? Sequences and more

Web Mining Web Mining Web mining is the use of data mining techniques to automatically

Mining Event or State Sequences: A Social Science Perspective Gilbert Ritschard Department of

Sequences Sequences and Difference Equations "Sequences" is a central topic in

Sequences Sequences and Difference Equations "Sequences" is a central topic in

Events Event-driven programming Event loop Event dispatch Event handling Event Driven

Events Event-driven programming Event loop Event dispatch Event handling Event Driven

Methods Updating Variables Console Programs int life = 42; life life = 42 life; 21 life =

Cement, Aggregates, Mining Presentation Cement, Aggregates and Mining Cement, Aggregates and

Frequent Pattern Mining Frequent Sequence Mining Frequent Tree Mining Christian Borgelt

Data Mining 2020 Frequent Pattern Mining (2) Ad Feelders Universiteit Utrecht October 2, 2020

Introduction What is data mining? to Data Mining: On what kind of data? Data Mining

Web MINING Web MINING Overview Overview Dr Ahmed Rafea Rafea Dr Ahmed 1 Web Mining Outline

More Event Combinators CML provides two more event combinators: guard and withNack : val guard :

W4231: Analysis of Algorithms Shortest Path with Budget Constraint 11/16/99 (revised 11/18/99) We

Efficient List-based Computation of the String Subsequence Kernel Slimane Bellaouar 1 Hadda

Applying Subsequence Matching to Collaborative Filtering Alejandro Bellog n Pablo S

Algorithms for Computing the Longest Parameterized Common Subsequence Costas S. Iliopoulos 1 ,

Design and Analysis of Algorithms

Spectral properties of approximation sequences: Helena Mascarenhas IST, University of Lisbon and

An optimal Arc Consistency algorithm for a chain of Atmost constraints with cardinality Mohamed

Scientific Programming: Algorithms (part B) Programming paradigms Luca Bianco - Academic Year

Sambuz

Useful Links

Newsletter

Mail Us

Mining Life Event Sequences Gilbert Ritschard and Matthias Studer - PowerPoint PPT Presentation

Mining Life Event Sequences Mining Life Event Sequences Gilbert Ritschard and Matthias Studer NCCR LIVES and Institute for demographic and life course studies University of Geneva http://mephisto.unige.ch Society For Longitudinal and Life

Web Mining Web Mining Web Mining Web Mining Web mining is the use of data mining techniques

1 Mining Event Histories Mining Event Histories Sequence Analysis in Social Sciences Sequence

20-03-06 7. Learning Sequences/Behaviors How to use sequences/behaviors? Sequences and more

Web Mining Web Mining Web mining is the use of data mining techniques to automatically

Mining Event or State Sequences: A Social Science Perspective Gilbert Ritschard Department of

Sequences Sequences and Difference Equations &quot;Sequences&quot; is a central topic in

Sequences Sequences and Difference Equations &quot;Sequences&quot; is a central topic in

Events Event-driven programming Event loop Event dispatch Event handling Event Driven

Events Event-driven programming Event loop Event dispatch Event handling Event Driven

Methods Updating Variables Console Programs int life = 42; life life = 42 life; 21 life =

Cement, Aggregates, Mining Presentation Cement, Aggregates and Mining Cement, Aggregates and

Frequent Pattern Mining Frequent Sequence Mining Frequent Tree Mining Christian Borgelt

Data Mining 2020 Frequent Pattern Mining (2) Ad Feelders Universiteit Utrecht October 2, 2020

Introduction What is data mining? to Data Mining: On what kind of data? Data Mining

Web MINING Web MINING Overview Overview Dr Ahmed Rafea Rafea Dr Ahmed 1 Web Mining Outline

More Event Combinators CML provides two more event combinators: guard and withNack : val guard :

W4231: Analysis of Algorithms Shortest Path with Budget Constraint 11/16/99 (revised 11/18/99) We

Efficient List-based Computation of the String Subsequence Kernel Slimane Bellaouar 1 Hadda

Applying Subsequence Matching to Collaborative Filtering Alejandro Bellog n Pablo S

Algorithms for Computing the Longest Parameterized Common Subsequence Costas S. Iliopoulos 1 ,

Design and Analysis of Algorithms

Spectral properties of approximation sequences: Helena Mascarenhas IST, University of Lisbon and

An optimal Arc Consistency algorithm for a chain of Atmost constraints with cardinality Mohamed

Scientific Programming: Algorithms (part B) Programming paradigms Luca Bianco - Academic Year

Sambuz

Useful Links

Newsletter

Mail Us

Sequences Sequences and Difference Equations "Sequences" is a central topic in

Sequences Sequences and Difference Equations "Sequences" is a central topic in