Automatic Selection of Partitioning Variables for Small Multiple Displays
Anushka Anand, Justin Talbot
Automatic Selection of Partitioning Variables for Small Multiple - - PowerPoint PPT Presentation
Automatic Selection of Partitioning Variables for Small Multiple Displays Anushka Anand, Justin Talbot Presented by Yujie Yang, CPSC 547 Information Visualization Agenda Introduction Goodness-of-Split Criteria Algorithm
Anushka Anand, Justin Talbot
CPSC547 Presentation - Yujie Yang 2 2015/11/26
CPSC547 Presentation - Yujie Yang 3
2015/11/26
CPSC547 Presentation - Yujie Yang 4
Firstly introduced by John and Paul Tukey Wilkinson extended original idea
“Judge the relative interest of different displays” Scagnostics – scatterplot diagnostics
2015/11/26
2015/11/26 CPSC547 Presentation - Yujie Yang 5
CPSC547 Presentation - Yujie Yang 6
2015/11/26
CPSC547 Presentation - Yujie Yang 7 2015/11/26
CPSC547 Presentation - Yujie Yang 8 2015/11/26
CPSC547 Presentation - Yujie Yang 9
Data: X: proportion of old houses built before 1940 for census tracts in Boston Y: median value of owner-occupied houses 2015/11/26
CPSC547 Presentation - Yujie Yang 10 2015/11/26 (b) (c) (a)Input scatterplot (b)Partitioned by distance (c)Partitioned by random permutation (d)Distribution of Skewed value (d) (a)
CPSC547 Presentation - Yujie Yang 11
Where Xi is the true scagnostic value of the i-th partition and μi and σi are the mean and standard deviation of the scagnostic measures over the repeated random permutations of the i-th partition.
2015/11/26
CPSC547 Presentation - Yujie Yang 12 2015/11/26
CPSC547 Presentation - Yujie Yang 13
2015/11/26
Data: X: linolenic measurement in olive oil specimens in Italy Y: linoleic measurement in olive oil specimens in Italy
CPSC547 Presentation - Yujie Yang 14
2015/11/26
CPSC547 Presentation - Yujie Yang 15
2015/11/26
Data: X: death rate of world countries Y: birth rate of world countries
CPSC547 Presentation - Yujie Yang 16 2015/11/26
Scagnostic: monotonic Partitioning
Scagnostic: monotonic Partitioning
CPSC547 Presentation - Yujie Yang 17
2015/11/26
Data: X: admission rate at US universities Y: graduation rate at US universities
CPSC547 Presentation - Yujie Yang 18
Random 10% of full dataset
Scagnostic: monotonic
Partitioning variable: admit ACT scores
Z-score: 3.6
2015/11/26
Full dataset
Scagnostic: monotonic
Partitioning variable: admit ACT scores
Z-score: 16.4
CPSC547 Presentation - Yujie Yang 19
2015/11/26
CPSC547 Presentation - Yujie Yang 20
2015/11/26
CPSC547 Presentation - Yujie Yang 21
2015/11/26
CPSC547 Presentation - Yujie Yang 22 2015/11/26
CPSC547 Presentation - Yujie Yang 23
[1] Anand A, Talbot J. Automatic Selection of Partitioning Variables for Small Multiple Displays[J]. 2016. [2] Friedman J H, Stuetzle W. John W. Tukey's work on interactive graphics[J]. Annals of Statistics, 2002: 1629-1639. [3] Wilkinson L, Anand A, Grossman R L. Graph-Theoretic Scagnostics[C]//INFOVIS. 2005, 5: 21. [4] Wilkinson L, Wills G. Scagnostics distributions[J]. Journal of Computational and Graphical Statistics, 2008, 17(2): 473-491.
2015/11/26