EULERIAN TOUR ALGORITHMS
FOR DATA VISUALIZATION AND THE PAIRVIZ PACKAGE
Catherine Hurley NUI Maynooth R.W. Oldford
- U. Waterloo
July 8 2009 UseR!
Monday 13 July 2009
Graphics: Effect Ordering Packages: seriation, gclus, corrgram - - PowerPoint PPT Presentation
E ULERIAN TOUR ALGORITHMS FOR DATA VISUALIZATION AND THE P AIR V IZ PACKAGE Catherine Hurley R.W. Oldford NUI Maynooth U. Waterloo July 8 2009 UseR! Monday 13 July 2009 Graphics: Effect Ordering Packages: seriation, gclus, corrgram
July 8 2009 UseR!
Monday 13 July 2009
Standard order
Tars1 Tars2 Aede1 Aede2 Head Aede3
Correlation order
Tars2 Aede1 Aede2 Aede3 Tars1 Head
0.2
Monday 13 July 2009
Flea data: correlation order
Monday 13 July 2009
A B C D E F
Monday 13 July 2009
Mice in 5 diet groups, response is lifetime Nodes are treatments, edges are planned comparisons Weights are p-values
0.0083 0.0147 0.3111 N/N85 N/R40 N/R50 NP R/R50 lopro
N/R50 N/N85 NP lopro N/R50 N/R40 R/R50 N/R50 10 20 30 40 50
Planned comparisons of diets
Lifetime
5 10
Differences
Reducing calories and protein increases lifetime
Monday 13 July 2009
A B C D E F G H A B C D E F G H
Open hamiltonian path Closed hamiltonian path Closed eulerian path on K7
A B C D E F G
Monday 13 July 2009
Weight edges by 1-corr, eulerian follows low weight edges
X1 X2 X3 Y1 Y2
Monday 13 July 2009
Each node in G is a predictor subset edge: add/drop predictor
eg Each node in G is a var, each node in L(G) is var pair, edge is 3-d transition
Cube for factorial experiment
000 001 010 011 100 101 110 111
A B C DAB AC AD BC BD CD
Monday 13 July 2009
1 2 3 4 5 6 7 1 2 3 4 5 6 7 1 2 3 4 5 6 7 Monday 13 July 2009
1 2 3 4 5 6 7 Monday 13 July 2009
Chinese postman does this in optimal way
0.0083 0.0147 0.3111 N/N85 N/R40 N/R50 NP R/R50 lopro
Monday 13 July 2009
Complete-no weights
5 10 15 20 25 30 35 2 4 6 8
Etour 9
5 10 15 20 25 30 35 2 4 6 8
Eseq 9
5 10 15 20 25 30 35 2 4 6 8
hpaths 9
prefers low vertices prefers low edges 4 hamiltonians
Monday 13 July 2009
50 100 150 200 1000 2000 3000 4000
Algorithm eseq: Eurodist edge weights
50 100 150 200 1000 2000 3000 4000
Weighted etour on Eurodist
50 100 150 200 1000 2000 3000 4000 Weighted hamiltonians on Eurodist 1 2 3 4 5 6 7 8 9 10ignores weights Starts in Geneva
hamiltonian decomp, with increasing path lengths
Monday 13 July 2009
Mammal sleep data Y= log brain wt. Predictors A= non dreaming sleep, B=dreaming sleep, C=log body wt, D=life span
A B C D AB AC AD BC BD CD ABC ABD ACD BCD ABCD
stepwise regression algorithm
Sleep data: Model residuals.
ABCD BCD CD ACD ABCD ABC BC C AC ABC AB A AD ABD BD D AD ACD AC A D CD C B BD BCD BC B AB ABD ABCDMonday 13 July 2009
Sleep data: 10 vars (nodes) 45 edges Eulerian has length 50
Eulerian on scagnostics: Outlying
GP Bd L Br Bd SW PS TS SE PS TS D L P L PS Br P TS Bd TS PS P D D Br P D 0.0 0.3 0.6
Using outlying index from scagnostics package for eulerian traversal zoom on first half of display
Monday 13 July 2009
Reduce the graph NN graph: eliminate edges with outlier index < .2 Reduces graph from 10 to 5 nodes, and 45 to 5 edges Other nodes have no edges
NN Eulerian on scagnostics: Outlying
GP L Bd SW L Br GP 0.0 0.3 0.6
SW Bd Br L GP Monday 13 July 2009
with Adrian Waddell, UW
Monday 13 July 2009