Model visualisation (with ggplot2) Hadley Wickham Rice University - PowerPoint PPT Presentation

Model visualisation (with ggplot2) Hadley Wickham Rice University Monday, 13 July 2009

1. Introducing plot.lm 2. The current state of play. Why this is suboptimal. 3. A better strategy: separate data from representation. 4. Why a canned set of plots is not good enough. Monday, 13 July 2009

plot.lm(mod, Residuals vs Fitted which = 1) 0.3 624 ● 0.2 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 0.1 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● Residuals ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 0.0 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● − 0.1 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● − 0.2 ● ● 133 ● − 0.3 574 − 0.2 0.0 0.2 0.4 0.6 Fitted values lm(log10(sales) ~ city * ns(date, 3) + factor(month)) Monday, 13 July 2009

# File src/library/stats/R/plot.lm.R show[which] <- TRUE # Part of the R package, http://www.R-project.org r <- residuals(x) # yh <- predict(x) # != fitted() for glm # This program is free software; you can redistribute it and/or modify w <- weights(x) # it under the terms of the GNU General Public License as published by if(!is.null(w)) { # drop obs with zero wt: PR#6640 # the Free Software Foundation; either version 2 of the License, or wind <- w != 0 # (at your option) any later version. r <- r[wind] # yh <- yh[wind] # This program is distributed in the hope that it will be useful, w <- w[wind] # but WITHOUT ANY WARRANTY; without even the implied warranty of labels.id <- labels.id[wind] # MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the } # GNU General Public License for more details. n <- length(r) # if (any(show[2L:6L])) { # A copy of the GNU General Public License is available at s <- if (inherits(x, "rlm")) x$s # http://www.r-project.org/Licenses/ else if(isGlm) sqrt(summary(x)$dispersion) else sqrt(deviance(x)/df.residual(x)) plot.lm <- hii <- lm.influence(x, do.coef = FALSE)$hat function (x, which = c(1L:3,5), ## was which = 1L:4, if (any(show[4L:6L])) { caption = list("Residuals vs Fitted", "Normal Q-Q", cook <- if (isGlm) cooks.distance(x) "Scale-Location", "Cook's distance", else cooks.distance(x, sd = s, res = r) "Residuals vs Leverage", } expression("Cook's dist vs Leverage " * h[ii] / (1 - h[ii]))), } panel = if(add.smooth) panel.smooth else points, if (any(show[2L:3L])) { sub.caption = NULL, main = "", ylab23 <- if(isGlm) "Std. deviance resid." else "Standardized residuals" ask = prod(par("mfcol")) < length(which) && dev.interactive(), ..., r.w <- if (is.null(w)) r else sqrt(w) * r id.n = 3, labels.id = names(residuals(x)), cex.id = 0.75, ## NB: rs is already NaN if r=0, hii=1 qqline = TRUE, cook.levels = c(0.5, 1.0), rs <- dropInf( r.w/(s * sqrt(1 - hii)), hii ) add.smooth = getOption("add.smooth"), } label.pos = c(4,2), cex.caption = 1) { if (any(show[5L:6L])) { # using 'leverages' dropInf <- function(x, h) { r.hat <- range(hii, na.rm = TRUE) # though should never have NA if(any(isInf <- h >= 1.0)) { isConst.hat <- all(r.hat == 0) || warning("Not plotting observations with leverage one:\n ", diff(r.hat) < 1e-10 * mean(hii, na.rm = TRUE) paste(which(isInf), collapse=", "), } call.=FALSE) if (any(show[c(1L, 3L)])) x[isInf] <- NaN l.fit <- if (isGlm) "Predicted values" else "Fitted values" } if (is.null(id.n)) x id.n <- 0 } else { id.n <- as.integer(id.n) if (!inherits(x, "lm")) if(id.n < 0L || id.n > n) stop("use only with \"lm\" objects") stop(gettextf("'id.n' must be in {1,..,%d}", n), domain = NA) if(!is.numeric(which) || any(which < 1) || any(which > 6)) } stop("'which' must be in 1L:6") if(id.n > 0L) { ## label the largest residuals isGlm <- inherits(x, "glm") if(is.null(labels.id)) show <- rep(FALSE, 6) labels.id <- paste(1L:n) Monday, 13 July 2009

Model visualisation (with ggplot2) Hadley Wickham Rice University - PowerPoint PPT Presentation

Model visualisation (with ggplot2) Hadley Wickham Rice University Monday, 13 July 2009 1. Introducing plot.lm 2. The current state of play. Why this is suboptimal. 3. A better strategy: separate data from representation. 4. Why a canned

Science Visualisation Paul Bourke iVEC @ University of Western Australia Contents What is

Science Visualisation Paul Bourke iVEC @ University of Western Australia Contents What is

RESULTS VISUALISATION RESULTS VISUALISATION At the beginning of this course, the large majority

Detection, Analysis and Visualisation of Detection, Analysis and Visualisation of Georeferenced

The Diversity of Visualisation: Selected visualisation projects from 2011 Paul Bourke Contents

Computer Graphics: Visualisation Lecture 3 Taku Komura tkomura@inf.ed.ac.uk Institute for

Machine Learning and Visualisation Ian T. Nabney Aston University, Birmingham, UK March 2015

Data Visualisation with R Data Visualisation with R Workshop Day 1 Workshop Day 1 Scales and

Data Handling: Import, Cleaning and Visualisation Lecture 11: Visualisation and Dynamic Documents

Visualisation and Presentation in Victorian Statistics John Aldrich University of Southampton

An introduction to interactive visualisation Christophe 20 novembre 2013 In summary, the most

Teaching an old dog new tricks How data visualisation & design can be used by everyone SAMRA

Why data visualisation is nice By Dave and Rebecca, UTS Library david.litting@uts.edu.au

Diana and Tseries An Open Source Production and Visualisation Package Lisbeth Bergholt Baltmet

BEYOND VISUALISATION: 3-DIMENSIONAL PLANNING OF FUTURE ENVIRONMENTS 3D GIS SEMINAR Rydges Hotel,

Novel physical representations for the visualisation of science data and mathematics Paul Bourke

Semi milept leptonic ic B decays La Thuile, Mar 10 th -17 th , 2012 Giulia Ricciardi

On the spectrum of light mesons in an SU(2) gauge theory with dynamical fermions Reinhard Alkofer

Baryon in the sextet gauge model Zoltan Fodor, Kieran Holland, Julius Kuti, Santanu Mondal,

Model Checker NuSMV Hao Zheng Department of Computer Science and Engineering University of South

Less dimensions and the origin of DM Martti Raidal National Institute of Chemical Physics and

A magnetic instability of the Sakai-Sugimoto model Nele Callebaut Ghent University and Vrije

DOIs for MILC asqtad ensembles Future Roles of ILDG J. Simone and the MILC Collaboration Heath

Some Progress on the Construction of Technicolor and Extended Technicolor Models Robert Shrock

Model visualisation (with ggplot2) Hadley Wickham Rice University - PowerPoint PPT Presentation

Model visualisation (with ggplot2) Hadley Wickham Rice University Monday, 13 July 2009 1. Introducing plot.lm 2. The current state of play. Why this is suboptimal. 3. A better strategy: separate data from representation. 4. Why a canned

Science Visualisation Paul Bourke iVEC @ University of Western Australia Contents What is

Science Visualisation Paul Bourke iVEC @ University of Western Australia Contents What is

RESULTS VISUALISATION RESULTS VISUALISATION At the beginning of this course, the large majority

Detection, Analysis and Visualisation of Detection, Analysis and Visualisation of Georeferenced

The Diversity of Visualisation: Selected visualisation projects from 2011 Paul Bourke Contents

Computer Graphics: Visualisation Lecture 3 Taku Komura tkomura@inf.ed.ac.uk Institute for

Machine Learning and Visualisation Ian T. Nabney Aston University, Birmingham, UK March 2015

Data Visualisation with R Data Visualisation with R Workshop Day 1 Workshop Day 1 Scales and

Data Handling: Import, Cleaning and Visualisation Lecture 11: Visualisation and Dynamic Documents

Visualisation and Presentation in Victorian Statistics John Aldrich University of Southampton

An introduction to interactive visualisation Christophe 20 novembre 2013 In summary, the most

Teaching an old dog new tricks How data visualisation &amp; design can be used by everyone SAMRA

Why data visualisation is nice By Dave and Rebecca, UTS Library david.litting@uts.edu.au

Diana and Tseries An Open Source Production and Visualisation Package Lisbeth Bergholt Baltmet

BEYOND VISUALISATION: 3-DIMENSIONAL PLANNING OF FUTURE ENVIRONMENTS 3D GIS SEMINAR Rydges Hotel,

Novel physical representations for the visualisation of science data and mathematics Paul Bourke

Semi milept leptonic ic B decays La Thuile, Mar 10 th -17 th , 2012 Giulia Ricciardi

On the spectrum of light mesons in an SU(2) gauge theory with dynamical fermions Reinhard Alkofer

Baryon in the sextet gauge model Zoltan Fodor, Kieran Holland, Julius Kuti, Santanu Mondal,

Model Checker NuSMV Hao Zheng Department of Computer Science and Engineering University of South

Less dimensions and the origin of DM Martti Raidal National Institute of Chemical Physics and

A magnetic instability of the Sakai-Sugimoto model Nele Callebaut Ghent University and Vrije

DOIs for MILC asqtad ensembles Future Roles of ILDG J. Simone and the MILC Collaboration Heath

Some Progress on the Construction of Technicolor and Extended Technicolor Models Robert Shrock

Teaching an old dog new tricks How data visualisation & design can be used by everyone SAMRA