Khairi Reda | redak@iu.edu School of Informa5cs & Compu5ng, IUPUI
Week 13 | Nov 16, 2016 Filtering and Aggregation Models in Visual Analytics
I590 Interactive Visual Analytics Week 13 | Nov 16, 2016 Filtering - - PowerPoint PPT Presentation
I590 Interactive Visual Analytics Week 13 | Nov 16, 2016 Filtering and Aggregation Models in Visual Analytics Khairi Reda | redak@iu.edu School of Informa5cs & Compu5ng, IUPUI
Khairi Reda | redak@iu.edu School of Informa5cs & Compu5ng, IUPUI
Week 13 | Nov 16, 2016 Filtering and Aggregation Models in Visual Analytics
http://www.michelecoscia.com/wp-content/uploads/2012/08/demon2.png
the data into two set based
geographic extents
aGributes, as opposed to the data point themselves
Based on a slide by Alex Lex
Schneiderman
Willett 2007, Via Alex Lex
filtering and explora5on
representa5ons that bind to interface elements
Riche 2010, Via Alex Lex
in each bin
Number of bins can affect the shape of the histogram
Distribution of passengers by Age 10 Bins 20 Bins
Based on a slide by Alex Lex
http://web.stanford.edu/~mwaskom/software/seaborn/tutorial/plotting_distributions.html
25% of the data
highest 25% of the data
http://image.mathcaptain.com/cms/images/106/box-plot.png
Wikipedia
representa5on to the min/max is to scale the whiskers by the Interquar5le Range (Q3-Q1)
http://stat.mq.edu.au/wp-content/uploads/2014/05/Can_the_Box_Plot_be_Improved.pdf
Streit & Gehlenborg, PoV, Nature Methods, 2014 Via Alex Lex
http://web.stanford.edu/~mwaskom/software/seaborn/tutorial/plotting_distributions.html
Based on a slide by Alex Lex
Based on a slide by Alex Lex
weaknesses and strengths
Those eventually will comprise the clusters
centroid, assigning the point to the closest centroid
roughly “circular” clusters of equal size
http://stats.stackexchange.com/questions/133656/how-to- understand-the-drawbacks-of-k-means
http://stats.stackexchange.com/questions/133656/how-to-understand-the-drawbacks-of-k-means
Item ANr 1 ANr 2 ANr 3 ANr 4 ANr 5 ANr 6 ANr 7 ANr 8 ANr 9 ANr 10 ANr 11 …
A B C …
aGributes
dimensions (aGributes) while keeping as much varia5on as possible
Item ANr 1 ANr 2 ANr 3 ANr 4 ANr 5 ANr 6 ANr 7 ANr 8 ANr 9 ANr 10 ANr 11 …
A B C …
dimensions (axes) that explains the majority of the variance in the data
by variance
component accounts for most variance
http://setosa.io/ev/principal-component-analysis/
space onto a much lower space (e.g, 2D)
points (usually have to compute pairwise similarity between every pair of points)
More difficult to interpret than PCA, but can maintain structures beGer in some cases
Adapted from: http://slideplayer.com/slide/4659134/ and from Remo Chang, 2010
informa5on in order to improve one’s understanding
world that are more tractable
[1] H. Simon 1957. “A Behavioral Model of Rational Choice”
Card, Mackinlay, and Scneiderman. Readings in Information Visualization: Using Vision to Think, Morgan Kaufmann, 1999, pp. 17
Van Wijk, J. “The value of visualization”, 2005
D=Data V=Visualiza5on S=Specifica5on I=Image P=Percep5on K=Knowledge E=Explora5on
Keim, D et al. “Visual Analytics: Definition, process, and challenges”, 2008
Pirolli, P and Card, S. “The sense making process and leverage points for analyst technology as identified through cognitive task analysis”, 2005