Download the notebook for this section from the CS109 repo or here: - PowerPoint PPT Presentation

Download the notebook for this section from the CS109 repo or here: http://bit.ly/109_S6 1

Linear Regression Y=α+β1X1+...+βn+Xn+ϵ Four Assumptions of Linear Regression: 2

Linear Regression Y=α+β1X1+...+βn+Xn+ϵ Four Assumptions of Linear Regression: 1. Linearity : Our dependent variable Y is a linear combination of the explanatory variables X (and the error terms) 2. Observations are independent of one another 3. I.I.D error terms that are Normally Distributed ~ N(0,σ^2) 4. Design matrix X is Full Rank . That is: 1. We don't have more predictors than we have observations (aka, our model is not “overdetermined”) 2. We can’t have an exact linear relationship between two of our predictors ( multicollinearity) 3

Linear Regression Linear models presume that the only stochastic part of the model is the normally-distributed noise ϵ around the predicted mean. 4

Linear Regression Suppose we have a binary outcome variable. Can we use Linear Regression? 5

Linear Regression for binary outcomes? If our OLS regression is of the form: Y = β 0 + β 1 X + ϵ ; where Y = (0, 1) Then we will have the following problems: • The error terms are heteroskedastic • ϵ is not normally distributed because Y takes on only two values • The predicted probabilities can be greater than 1 or less than 0 6 More generally, just not a very useful model!

Datasets where linear regression is problematic Linear models presume that the only stochastic part of the model is the normally-distributed noise ϵ around the predicted mean. However, there are many data sets where this is not the case such as: • Binary response data where there are only two outcomes (yes/no, 0/1, etc.) • Categorical or Ordinal Data of any type, where the outcome is one of a number of discrete (possibly ordered) classes • Count data in which the outcome is restricted to non-negative integers • Continuous data in which the noise is not normally distributed Generalized Linear Models (GLMs), of which Logistic regression is a specific type, allow us to model and predict these types of datasets without violating the assumptions of linear regression. Logistic regression is most useful for binary response and categorical data. 7

Odds & Odds Ratios p = - odds Recall the definitions of an odds : 1 p The odds has a range of 0 to ¥ with values greater than 1 associated with an event being more likely to occur than to not occur and values less than 1 associated with an event that is less likely to occur than not occur. The logit is defined as the log of the odds: æ ö p ( ) ( ) ( ) = = - - ln odds ln ln p ln 1 p ç ÷ - 1 p è ø This transformation is useful because it creates a variable with a range from - ¥ to + ¥ . Hence, this transformation solves the problem we encountered in fitting a linear model to probabilities. Because probabilities (the dependent variable) only range from 0 to 1, we can get linear predictions that are outside of this range. If we transform our probabilities to logits, then we do not have this problem because the range of the logit is not restricted. In addition, the interpretation of logits is simple— take the exponential of the logit and you have the odds for the two groups in question. 8

Logistic Regression b + b x e 0 1 ln[p/(1-p)] = b 0 + b 1 X = P ( y x ) b + b + x 1 e 0 1 § ln[p/(1-p)]: log odds ratio, or "logit“ § [range=- ∞ to + ∞ ] § p/(1-p) is the "odds ratio" § [range=0 to ∞ ] § p is the probability that the event Y occurs, p(Y=1) § [range=0 to 1] 9

Download the notebook for this section from the CS109 repo or here: - PowerPoint PPT Presentation

Download the notebook for this section from the CS109 repo or here: http://bit.ly/109_S6 1 Linear Regression Y=+1X1+...+n+Xn+ Four Assumptions of Linear Regression: 2 Linear Regression Y=+1X1+...+n+Xn+ Four Assumptions of

Engineering Notebook What is an Engineering Notebook? An engineering notebook helps a team to

Lecture 4: Cylinders, quadric surfaces and vector functions 1 math2110L4Full.notebook 2

8.6.20 1 English Term 6 Week 2.notebook June 06, 2020 8.6.20 2 English Term 6 Week 2.notebook

Monday - Art 1 Week 3 Afternoons.notebook June 12, 2020 Monday - Art 2 Week 3

English day 3 retell.notebook June 05, 2020 English day 3 retell.notebook June 05, 2020 Listen

MATH 2110Q Practice Exam 3 1 exam3ReviewAnswers.notebook November 03, 2014 2

Monday - Art 1 Week 6 Afternoons.notebook July 03, 2020 Monday - Art 2 Week 6

Download for PC The most accessible web browser available. Download Step 1:

LOGISTIC REGRESSION AND GENERALIZED LINEAR MODELS W. RYAN LEE CS109/AC209/STAT121 ADVANCED

SVM AND STATISTICAL LEARNING THEORY W. RYAN LEE CS109/AC209/STAT121 ADVANCED SECTION

Module V: Vector Spaces Module V Math 237 Module V Section V.0 Section V.1 Section V.2

How you can access our ftp server. Step 1: Download Filezilla Client Download ftp client named

Word histogram Map data type To compare different authors, or to identify a good match in a We

Exceptions Exceptions and Errors When a runtime error occurs, the program terminates with an

Objects and Classes Objects with attributes Objects are the basis of object-oriented programming.

App programming Cross-compiling Programming a smartphone app is a bit different from PC Write

Relevance Vector Machines Jukka Lankinen LUT February 21, 2011 Jukka Lankinen Relevance Vector

Ipopt Tutorial Andreas W achter IBM T.J. Watson Research Center andreasw@watson.ibm.com

Comparison of Local Feature Descriptors Subhransu Maji Department of EECS, University of

Simon Pabst (Double Negative VFX) Talk Overview 1. The need in production (Jeff) 2. The

Markov Models Kunsch, H.R., State Space and Hidden Markov Models . ETH- Zurich, Zurich;

The Hidden Markov The Hidden Markov Model (HMM) Model (HMM) 1 Lecture Outline Lecture Outline

MARKOV MODELING AND TRAFFIC FLOW MODELING FILTERS APPLIED IN EXISTING SIGNALING OF CELLULAR

Data for Official Statistics Marco Puts, Piet Daas, Martijn Tennekes Road sensors Road sensor