Lecture 9: Residual Analysis Instructor: Prof. Shuai Huang - PowerPoint PPT Presentation

May 05, 2023 •247 likes •352 views

Lecture 9: Residual Analysis Instructor: Prof. Shuai Huang Industrial and Systems Engineering University of Washington Residual Analysis (a.k.a. Model Diagnostics) Residual versus fitted values The residuals, by definition, form the

Lecture 9: Residual Analysis Instructor: Prof. Shuai Huang Industrial and Systems Engineering University of Washington
Residual Analysis (a.k.a. Model Diagnostics)
Residual versus fitted values • The residuals, by definition, form the “unsystematic” part of the data, that suppose to be noise and random (any nonrandom behavior raises a red flag)
Q-Q Plot • Q-Q plot is to validate that the residuals follow a certain distribution (e.g., a normal distribution)
Cook’s distance • The Cook’s distance shows the influential data points that have larger than average influence on the parameter estimation. • The Cook’s distance of a data point is built on the idea of how much change will be induced on the estimated parameters if the data point is deleted.
Leverage 𝜖 ො 𝑧 𝑗 • Mathematically, the leverage of a data point is 𝜖𝑧 𝑗 , reflecting how sensitive the prediction on the data point by the model is decided by the observed outcome value 𝑧 𝑗 . • For data points that are surrounded by many close-by data points, their leverages won’t be large. • Thus, we could infer that the data points that sparsely occupy their neighbor areas will have large leverages. • These data points could either be outliers that severely derivate from the linear trend represented by the majority of the data points, or could be valuable data points that align with the linear trend but lack neighbor data points.
Multicollinearity analysis • Suppose the data is generated by this model: 2 , 𝑧 = 𝛾 0 + 𝛾 1 𝑦 1 + 𝛾 2 𝑦 2 + ⋯ + 𝛾 𝑞 𝑦 𝑞 + 𝜁 , 𝜁~𝑂 0, 𝜏 𝜁 2 𝑦 1 = 2𝑦 2 + 𝜗 , 𝜗~𝑂 0,0.1𝜏 𝜁 • Theoretically, we could value the regression model that is shown in above as the ground truth model equally as we value the following models: 𝑧 = 𝛾 0 + 2𝛾 1 + 𝛾 2 𝑦 2 + 𝛾 3 𝑦 3 … + 𝛾 𝑞 𝑦 𝑞 , 𝑧 = 𝛾 0 + 𝛾 1 + 0.5𝛾 2 𝑦 1 + 𝛾 3 𝑦 3 + ⋯ + 𝛾 𝑞 𝑦 𝑞 , 𝑧 = 𝛾 0 + 1000𝑦 1 + 𝛾 2 + 𝛾 1 − 2000 𝑦 2 + 𝛾 3 𝑦 3 + ⋯ + 𝛾 𝑞 𝑦 𝑞 .
Correplot Package
Remarks • Important to understand that, residual analysis is “opportunistic” checking of the model • Like patient checks in hospital for screening or examination. Negative results don’t mean that the patient is healthy • It is a significant focus on regression models, but less developed in machine learning community
R lab • Download the markdown code from course website • Conduct the experiments • Interpret the results • Repeat the analysis on other datasets

Recommend

Pipeline Strategies and conversations behind securing a Residual Bequest Agenda 1. Why Residual?

The Perpetual Pipeline Strategies and conversations behind securing a Residual Bequest Agenda 1. Why Residual? 2. Structured for Residual 3. Residual Results 4. Residual Strategy 5. Residual Relationships 6. Residual Data 7.

1.17k views • 43 slides

Lecture 3 Residual Analysis + Generalized Linear Models Colin Rundel 1/23/2017 1 Residual

Lecture 3 Residual Analysis + Generalized Linear Models Colin Rundel 1/23/2017 1 Residual Analysis 2 3 Atmospheric CO 2 (ppm) from Mauna Loa 360 co2 350 1988 1992 1996 date Where to start? Well, it looks like stuff is going up on

822 views • 44 slides

Lecture 3 Residual Analysis + Generalized Linear Models Colin Rundel 1/23/2018 1 Residual

Lecture 3 Residual Analysis + Generalized Linear Models Colin Rundel 1/23/2018 1 Residual Analysis 2 3 Atmospheric CO 2 (ppm) from Mauna Loa 360 co2 350 1988 1992 1996 date Where to start? Well, it looks like stuff is going up on

509 views • 47 slides

Clarifying Residual Flow s for Surface Water Takes August 2017 Clarifying Residual Flow s

Clarifying Residual Flow s for Surface Water Takes August 2017 Clarifying Residual Flow s Purpose of consultation is to clarify: The difference between residual and minimum flows Purpose of the plan change Who may be affected

518 views • 16 slides

An Overview of Deep Residual Learning Semih Yagcioglu 01.03.2016 Deep Residual Learning

An Overview of Deep Residual Learning Semih Yagcioglu 01.03.2016 Deep Residual Learning Microsoft Research Asia (MSRA) Kaiming He, Xiangyu Zhang, Shaoqing Ren, & Jian Sun. Deep Residual Learning for Image Recognition. arXiv

1.32k views • 54 slides

SPOT Farm East (Elveden) 2016 Residual Herbicide Demonstration Report Background The urea

SPOT Farm East (Elveden) 2016 Residual Herbicide Demonstration Report Background The urea based selective residual herbicide active linuron has been the major residual herbicide applied to the potato crop of the UK on loamy and sandy loam

741 views • 45 slides

Residual Flows for Invertible Generative Modeling Ricky T. Q. Chen, Jens Behrmann, David

Residual Flows for Invertible Generative Modeling Ricky T. Q. Chen, Jens Behrmann, David Duvenaud, Jrn-Henrik Jacobsen Invertible Residual Networks (i-ResNet) It can be shown that residual blocks can be inverted by fixed-point iteration and

445 views • 16 slides

Residual Networks (ResNet) Residual Networks (ResNet) In [1]: import d2l from mxnet import gluon,

3/8/2019 resnet slides Residual Networks (ResNet) Residual Networks (ResNet) In [1]: import d2l from mxnet import gluon, init, nd from mxnet.gluon import nn http://127.0.0.1:8000/resnet.slides.html?print-pdf/#/ 1/10 3/8/2019 resnet slides

895 views • 10 slides

Residual modular Galois representations and their images Samuele Anni University of Warwick

Residual modular Galois representations and their images Residual modular Galois representations and their images Samuele Anni University of Warwick University of Warwick, Number Theory Seminar 2nd December 2013 Residual modular Galois

971 views • 55 slides

SESSION 8: VALUING RESIDUAL CLAIMS (EQUITY) Valuing Equity Equity represents a residual

SESSION 8: VALUING RESIDUAL CLAIMS (EQUITY) Valuing Equity Equity represents a residual cashflow rather than a promised cashflow. You can value equity in one of two ways: By discounting cashflows to equity at the cost of equity to

333 views • 12 slides

RESIDUAL STRAIN MEASUREMENT IN Presentation by: Jason Cantrell COMPOSITES USING CURE-

RESIDUAL STRAIN MEASUREMENT IN Presentation by: Jason Cantrell COMPOSITES USING CURE- COMPOSITES USING CURE EAS 6939 1/31/11 REFERENCING METHOD by: P.G Ifju, X. Niu, B.C Kilday, S.-C. Liu & S.M. Ettinger Background g Residual

422 views • 12 slides

Residual Unit Commitment Procedure in MRTU Lorenzo Kristov Principal Market Architect Joint

Residual Unit Commitment Procedure in MRTU Lorenzo Kristov Principal Market Architect Joint Market Surveillance Committee and ISO Stakeholder Meeting December 11, 2008 Residual Unit Commitment (RUC) is an integral component of the MRTU

379 views • 5 slides

SOUTH STREET LIME RESIDUAL CLEAN UP July 30, 2018 1 APPROXIMATELY 20,000 CUBIC YARDS (30,000

SOUTH STREET LIME RESIDUAL CLEAN UP July 30, 2018 1 APPROXIMATELY 20,000 CUBIC YARDS (30,000 TONS) OF LIME RESIDUAL DEPOSITED ON PROPERTY IN THE EARLY 60S 2 BACKGROUND January 16, 2018 Council Workshop Conclusions of

268 views • 16 slides

RADIOLOGICAL ASSESSMENT OF AN AREA WITH URANIUM RESIDUAL MATERIAL Danyl Prez-Snchez

RADIOLOGICAL ASSESSMENT OF AN AREA WITH URANIUM RESIDUAL MATERIAL Danyl Prez-Snchez Departamento de Medio Ambiente, CIEMAT, Avenida Complutense 22, 28040 Madrid OBJECTIVES DISPOSAL OF URANIUM RESIDUAL MATERIALS in a specific area was

433 views • 14 slides

Extraction of Humic substances from Extraction of Humic substances from residual mixed Municipal

7th International Conference on Sustainable Solid Waste Management 26-29 June 2019, Heraklion, Crete Island, Greece Extraction of Humic substances from Extraction of Humic substances from residual mixed Municipal Solid Waste residual mixed

189 views • 16 slides

Successful Remediation of Residual DNAPL in Tight Materials S. Markesic, J.Rossabi, J.S. Haselow

Successful Remediation of Residual DNAPL in Tight Materials S. Markesic, J.Rossabi, J.S. Haselow (Redox Tech, LLC) REDOX TECH, LLC Overview Residual DNAPL and difficulties with remediation Summary of Remediation Techniques Available

476 views • 26 slides

VALSE webinar 2015 5 27 Feature Selection in Image and Video Recognition

VALSE webinar 2015 5 27 Feature Selection in Image and Video Recognition JianxinWu National Key Laboratory for Novel Software Technology Nanjing University http://lamda.nju.edu.cn Introduction For image classification, how to

727 views • 39 slides

1 Data science and engineering for local weather forecasts Nikhil R Podduturi Data {Scientist,

1 Data science and engineering for local weather forecasts Nikhil R Podduturi Data {Scientist, Engineer} November, 2016 Agenda About MeteoGroup Introduction to weather data Problem description Data science and weather

761 views • 59 slides

STAT 215 Polynomials, Multicollinearity Colin Reimer Dawson Oberlin College 4 November 2016

Outline Polynomial Regression Interactions Multicollinearity STAT 215 Polynomials, Multicollinearity Colin Reimer Dawson Oberlin College 4 November 2016 Outline Polynomial Regression Interactions Multicollinearity Outline Polynomial

522 views • 28 slides

STAT 213 Multicollinearity and Model Selection Colin Reimer Dawson Oberlin College 7 April 2016

Outline Multicollinearity Model Selection STAT 213 Multicollinearity and Model Selection Colin Reimer Dawson Oberlin College 7 April 2016 Outline Multicollinearity Model Selection Outline Multicollinearity Model Selection Outline

562 views • 29 slides

of Australian hospital data Liam HEINIGER a , Norm GOOD b and Sankalp KHANNA b a University of

You can change this image to be appropriate for your topic by inserting an image in this space or use the alternate title slide with lines. Note: only one image should be used and do not overlap the title text. Enter your Business Unit or

378 views • 12 slides

Local or Global Smoothing? A Bandwidth Selector for Dependent Data Francesco Giordano Maria

Local or Global Smoothing? A Bandwidth Selector for Dependent Data Francesco Giordano Maria Lucia Parrella Department of Economics and Statistics University of Salerno COMPSTAT 2010 F. Giordano M.L. Parrella (UNISA) Local or Global

1.6k views • 127 slides

Linear Models DS-GA 1013 / MATH-GA 2824 Optimization-based Data Analysis

Linear Models DS-GA 1013 / MATH-GA 2824 Optimization-based Data Analysis http://www.cims.nyu.edu/~cfgranda/pages/OBDA_fall17/index.html Carlos Fernandez-Granda Linear regression Least-squares estimation Geometric interpretation Probabilistic

1.79k views • 154 slides

On the Communication Complexity of Multilateral Trading Nicolas Maudet Ulle Endriss Universit

On the Communication Complexity of Multilateral Trading AAMAS-2004 On the Communication Complexity of Multilateral Trading Nicolas Maudet Ulle Endriss Universit e Paris Dauphine Imperial College London maudet@lamsade.dauphine.fr

447 views • 10 slides