Advanced Section #1: Linear Algebra and Hypothesis Testing Will - PowerPoint PPT Presentation

Advanced Section #1: Linear Algebra and Hypothesis Testing Will Claybaugh CS109A Introduction to Data Science Pavlos Protopapas and Kevin Rader 1

Advanced Section 1 WARNING This deck uses animations to focus attention and break apart complex concepts. Either watch the section video or read the deck in Slide Show mode. CS109A, P ROTOPAPAS , R ADER 2

Advanced Section 1 Today’s topics: Linear Algebra (Math 21b, 8 weeks) Maximum Likelihood Estimation (Stat 111/211, 4 weeks) Hypothesis Testing (Stat 111/211, 4 weeks) Our time limit: 90 minutes • We will move fast • We’ll work together • You are only expected to catch the big • I owe you this knowledge ideas • Come debt collect at OHs if I • Much of the deck is intended as notes don’t do my job today • I will give you the TL;DR of each slide • Let’s do this : ) • We will recap the big ideas at the end of each section CS109A, P ROTOPAPAS , R ADER 3

LINEAR (THE HIGHLIGHTS) ALGEBRA 4

� Interpreting the dot product What does a dot product mean? 1,5,2 % 3, −2,4 = 1 % 3 + 5 % −2 + 2 % 4 • Weighted sum : We weight the entries of one vector by the entries of the other • Either vector can be seen as weights • Pick whichever is more convenient in your context • Measure of Length : A vector dotted with itself gives the squared distance from (0,0,0) to the given point 1,5,2 % 1,5,2 = 1 % 1 + 5 % 5 + 2 % 2 = 1 − 0 , + 5 − 0 , + 2 − 0 , = 28 • 1,5,2 thus has length 28 • Measure of orthogonality : For vectors of fixed length, 𝑏 % 𝑐 is biggest when 𝑏 • and 𝑐 point are in the same direction, and zero when they are at a 90 ° angle • Making a vector longer (multiplying all entries by c) scales the dot product by the same amount Question : how could we get a true measure of orthogonality (one that ignores length?) CS109A, P ROTOPAPAS , R ADER 5

Dot Product for Matrices 2 -1 3 20 -11 3 3 1 1,5,2 % 3, −2,4 1 5 2 1 32 1 5 2 1 % = -2 -2 7 -1 1 3 7 0 4 4 -2 6 4 9 46 16 2,2,1 % 1,7, −2 2 2 1 6 14 5 by 3 by 5 by 3 2 2 Matrix multiplication is a bunch of dot products • In fact, it is every possible dot product, nicely organized • Matrices being multiplied must have the shapes 𝑜, 𝑛 % 𝑛, 𝑞 and the result is of size 𝑜, 𝑞 • (the middle dimensions have to match, and then drop out) CS109A, P ROTOPAPAS , R ADER 6

2 -1 3 20 -11 3 1 1 5 2 1 32 Column by Column -2 7 7 0 -1 1 3 4 -2 46 16 6 4 9 6 14 2 2 1 -1 3 20 2 -1 3 2 5 2 1 1 5 2 3 1 = = % + + % % % -2 4 3 1 3 7 -1 1 3 -2 -1 4 9 46 6 4 9 4 6 2 1 6 2 2 1 2 Since matrix multiplication is a dot product, we can think of it as a • weighted sum • We weight each column as specified, and sum them together This produces the first column of the output • • The second column of the output combines the same columns under different weights • Rows? CS109A, P ROTOPAPAS , R ADER 7

2 -1 3 20 -11 3 1 1 5 2 1 32 Row by Row -2 7 7 0 -1 1 3 4 -2 46 16 6 4 9 6 14 2 2 1 % 1 3 1 + 2 % 3 1 = = % 5 -2 7 1 32 1 5 -2 7 + 4 -2 % 2 4 -2 • Apply a row of A as weights on the rows B to get a row of output CS109A, P ROTOPAPAS , R ADER 8

LINEAR (THE HIGHLIGHTS) Span ALGEBRA 9

Span and Column Space -1 3 2 4 2 1 𝛾 , 𝛾 8 + 𝛾 7 + % % % 1 3 -1 4 9 6 2 1 2 • Span : every possible linear combination of some vectors • If vectors are the columns of a matrix call it the column space of that matrix • If vectors are the rows of a matrix it is the row space of that matrix • Q: what is the span of {(-2,3), (5,1)}? What is the span of {(1,2,3), (-2,-4,-6), (1,1,1)} CS109A, P ROTOPAPAS , R ADER 10

LINEAR (THE HIGHLIGHTS) Bases ALGEBRA 11

Basis Basics • Given a space, we’ll often want to come up with a set of vectors that span it • If we give a minimal set of vectors, we’ve found a basis for that space • A basis is a coordinate system for a space • Any element in the space is a weighted sum of the basis elements • Each element has exactly one representation in the basis • The same space can be viewed in any number of bases - pick a good one CS109A, P ROTOPAPAS , R ADER 12

Function Bases Bases can be quite abstract: • • Taylor polynomials express any analytic function in the infinite basis 1, 𝑦, 𝑦 , , 𝑦 7 , … • The Fourier transform expresses many functions in a basis built on sines and cosines Radial Basis Functions express functions in • yet another basis In all cases, we get an ‘address’ for a particular • function In the Taylor basis, sin (𝑦) = • 8 8 (0,1,0, A , 0, 8,B , … ) Taylor approximations to • Bases become super important in feature y=sin(x) engineering • Y may depend on some transformation of x, but we only have x itself We can include features 1, 𝑦, 𝑦 , , 𝑦 7 , … • to approximate CS109A, P ROTOPAPAS , R ADER 13

LINEAR (THE HIGHLIGHTS) Interpreting Transpose and Inverse ALGEBRA 14

Transpose 3 3 1 3 2 3 9 2 𝑦 D = 𝐵 D = 2 -1 𝑦 = 𝐵 = 3 2 3 9 1 -1 2 7 3 3 2 9 9 7 Transposes switch columns and rows. Written 𝐵 D • Better dot product notation: 𝑏 % 𝑐 is often expressed as 𝑏 D 𝑐 • Interpreting: The matrix multiplilcation 𝐵𝐶 is rows of A dotted with columns of B • 𝐵 D 𝐶 is columns of 𝐵 dotted with columns of 𝐶 • 𝐵𝐶 D is rows of 𝐵 dotted with rows of 𝐶 • Transposes (sort of) distribute over multiplication and addition: • 𝐵𝐶 D = 𝐶 D 𝐵 D 𝐵 + 𝐶 D = 𝐵 D + 𝐶 D 𝐵 D D = 𝐵 CS109A, P ROTOPAPAS , R ADER 15

Inverses Algebraically, 𝐵𝐵 F8 = 𝐵 F8 𝐵 = 1 • Geometrically, 𝐵 F8 writes an arbitrary • point 𝑐 in the coordinate system provided by the columns of 𝐵 • Proof (read this later): Consider 𝐵𝑦 = 𝑐 . We’re trying to find • weights 𝑦 that combine 𝐵 ’s columns to make 𝑐 Solution 𝑦 = 𝐵 F8 𝑐 means that when 𝐵 F8 • multiplies a vector we get that vector’s How do we write (-2,1) in this basis? coordinates in A’s basis Just multiply 𝐵 F8 by (-2,1) Matrix inverses exist iff columns of the • matrix form a basis • 1 Million other equivalents to invertibility: Invertible Matrix Theorem CS109A, P ROTOPAPAS , R ADER 16

LINEAR (THE HIGHLIGHTS) Eigenvalues and Eigenvectors ALGEBRA 17

Eigenvalues Original vectors: • Sometimes, multiplying a vector by a matrix just scales the vector The red vector’s length triples • The orange vector’s length halves • • All other vectors point in new directions • The vectors that simply stretch are called egienvectors . The amount they stretch is After multiplying by their eigenvalue 2x2 matrix A: Anything along the given axis is an • eigenvector; Here, (-2,5) is an eigenvector so (-4,10) is too • We often pick the version with length 1 • When they exist, eigenvectors/eigenvalues can be used to understand what a matrix does CS109A, P ROTOPAPAS , R ADER 18

Interpreting Eigenthings Warnings and Examples: • Eigenvalues/Eigenvectors only apply to square matrices • Eigenvalues may be 0 (indicating some axis is removed entirely) • Eigenvalues may be complex numbers (indicating the matrix applies a rotation) • Eigenvalues may be repeat, with one eigenvector per repetition (the matrix may scales some n-dimension subspace) • Eigenvalues may repeat, with some eigenvectors missing (shears) If we have a full set of eigenvectors, we know • everything about the given matrix S, and S = 𝑅𝐸𝑅 F8 • Q’s columns are eigenvectors, D is diagonal matrix of eigenvalues CS109A, P ROTOPAPAS , R ADER 19 •

Calculating Eigenvalues Eigenvalues can be found by: • • A computer program • But what if we need to do it on a blackboard? The definition 𝐵𝑦 = 𝜇𝑦 • This says that for special vectors x, • multiplying by the matrix A is the same as just scaling by 𝜇 (x is then an eigenvector matching eigenvalue 𝜇 ) The equation det 𝐵 − 𝜇𝐽 O = 0 • 𝐽 O is the n by n identity matrix of size n by • n. In effect, we subtract lambda from the diagonal of A • Determinants are tedious to write out, but • Eigenvectors matching known eigenvalues can be found by solving A − 𝜇𝐽 O 𝑦 = this produces a polynomial in 𝜇 which can 0 for x be solved to find eigenvalues CS109A, P ROTOPAPAS , R ADER 20

LINEAR (THE HIGHLIGHTS) Matrix Decomposition ALGEBRA 21

Matrix Decompositions Eigenvalue Decomposition : Some square matrices can be • decomposed into scalings along particular axes Symbolically: S = 𝑅𝐸𝑅 F8 ; D diagonal matrix of eigenvalues; Q made up of • eigenvectors, but possibly wild (unless S was symmetric; then Q is orthonormal) • Polar Decomposition : Every matrix M can be expressed as a rotation (which may introduce or remove dimensions) and a stretch Symbolically: M = UP or M=PU; P positive semi-definite, U’s columns orthonormal • • Singular Value Decomposition : Every matrix M can be decomposed into a rotation in the original space, a scaling, and a rotation in the final space Symbolically: 𝑁 = 𝑉𝛵𝑊 D ; U and V orthonormal, 𝛵 diagonal (though not square) • CS109A, P ROTOPAPAS , R ADER 22

Advanced Section #1: Linear Algebra and Hypothesis Testing Will - PowerPoint PPT Presentation

Advanced Section #1: Linear Algebra and Hypothesis Testing Will Claybaugh CS109A Introduction to Data Science Pavlos Protopapas and Kevin Rader 1 Advanced Section 1 WARNING This deck uses animations to focus attention and break apart complex

STAT 113 Hypothesis Testing I Colin Reimer Dawson Oberlin College October 5, 2017 1 / 17

CME/STATS 195 CME/STATS 195 Lecture 7: Hypothesis Testing and Lecture 7: Hypothesis Testing and

Chapter 6 Hypothesis Testing What is Hypothesis Testing? the use of statistical

Chapter 6 Hypothesis Testing What is Hypothesis Testing? the use of statistical

STAT 215 Hypothesis Testing I Colin Reimer Dawson Oberlin College September 7, 2017 1 / 14

Gov 2000: 6. Hypothesis Testing Matthew Blackwell October 11, 2016 1 / 55 1. Hypothesis

Cluster Validity Hypothesis Random Graph Hypothesis Random Label Hypothesis Relative Criteria

Linear Algebra Linear algebra has become as basic and as applicable as calculus, and

Chapter 1 What is Linear Algebra? Chapter 1 What is Linear Algebra? The study of linear

Graphics 2014 Linear Algebra II Linear Maps & Matrices Linear Maps & Matrices CORE

Testing Specification testing Michel Bierlaire Introduction to choice models Differences from

Hypothesis Testing Mark Lunt Centre for Epidemiology Versus Arthritis University of Manchester

Hypothesis tests with binomial example STAT 587 (Engineering) Iowa State University October 2,

t -tests STAT 587 (Engineering) Iowa State University October 2, 2020 Statistical hypothesis

Lecture 14: Dense Linear Algebra David Bindel 18 Oct 2010 Where we are This week: dense

MATRICES AND LINEAR ALGEBRA Linear Algebra Matrix manipulation is the original essence of

KCA Deutag J.P. Morgan Global High Yield & Leveraged Finance Conference February / March

April 2012 Onshore Energy: Delivering a Secure future Contents Company Overview Market

The Bonn Agreement 1969 and BE-AWARE Project Alexander von Buxhoeveden Representing the Bonn

Full Year 2019 Results 31 MARCH 2020 GENERAL AND DISCLAIMER Except as the context otherwise

Lecture 22 April 9th, 2013 Biostatistics 602 - Lecture 22 Hyun Min Kang April 9th, 2013 Hyun

Cultural and Computer Network Attack (CNA) Behaviors By: Char Sample & Dave Barnett CERT

Requisites Review SCC CIC Fall 2016 Sources Guidelines for Title 5 Regulations Section 55003

Guiding Students from Matriculation to Graduation: Analysis of a Four Year Professional

Advanced Section #1: Linear Algebra and Hypothesis Testing Will - PowerPoint PPT Presentation

Advanced Section #1: Linear Algebra and Hypothesis Testing Will Claybaugh CS109A Introduction to Data Science Pavlos Protopapas and Kevin Rader 1 Advanced Section 1 WARNING This deck uses animations to focus attention and break apart complex

STAT 113 Hypothesis Testing I Colin Reimer Dawson Oberlin College October 5, 2017 1 / 17

CME/STATS 195 CME/STATS 195 Lecture 7: Hypothesis Testing and Lecture 7: Hypothesis Testing and

Chapter 6 Hypothesis Testing What is Hypothesis Testing? the use of statistical

Chapter 6 Hypothesis Testing What is Hypothesis Testing? the use of statistical

STAT 215 Hypothesis Testing I Colin Reimer Dawson Oberlin College September 7, 2017 1 / 14

Gov 2000: 6. Hypothesis Testing Matthew Blackwell October 11, 2016 1 / 55 1. Hypothesis

Cluster Validity Hypothesis Random Graph Hypothesis Random Label Hypothesis Relative Criteria

Linear Algebra Linear algebra has become as basic and as applicable as calculus, and

Chapter 1 What is Linear Algebra? Chapter 1 What is Linear Algebra? The study of linear

Graphics 2014 Linear Algebra II Linear Maps &amp; Matrices Linear Maps &amp; Matrices CORE

Testing Specification testing Michel Bierlaire Introduction to choice models Differences from

Hypothesis Testing Mark Lunt Centre for Epidemiology Versus Arthritis University of Manchester

Hypothesis tests with binomial example STAT 587 (Engineering) Iowa State University October 2,

t -tests STAT 587 (Engineering) Iowa State University October 2, 2020 Statistical hypothesis

Lecture 14: Dense Linear Algebra David Bindel 18 Oct 2010 Where we are This week: dense

MATRICES AND LINEAR ALGEBRA Linear Algebra Matrix manipulation is the original essence of

KCA Deutag J.P. Morgan Global High Yield &amp; Leveraged Finance Conference February / March

April 2012 Onshore Energy: Delivering a Secure future Contents Company Overview Market

The Bonn Agreement 1969 and BE-AWARE Project Alexander von Buxhoeveden Representing the Bonn

Full Year 2019 Results 31 MARCH 2020 GENERAL AND DISCLAIMER Except as the context otherwise

Lecture 22 April 9th, 2013 Biostatistics 602 - Lecture 22 Hyun Min Kang April 9th, 2013 Hyun

Cultural and Computer Network Attack (CNA) Behaviors By: Char Sample &amp; Dave Barnett CERT

Requisites Review SCC CIC Fall 2016 Sources Guidelines for Title 5 Regulations Section 55003

Guiding Students from Matriculation to Graduation: Analysis of a Four Year Professional

Graphics 2014 Linear Algebra II Linear Maps & Matrices Linear Maps & Matrices CORE

KCA Deutag J.P. Morgan Global High Yield & Leveraged Finance Conference February / March

Cultural and Computer Network Attack (CNA) Behaviors By: Char Sample & Dave Barnett CERT