Linear Algebra Review (with a Small Dose of Optimization) Hristo - PowerPoint PPT Presentation

Linear Algebra Review (with a Small Dose of Optimization) Hristo Paskov CS246

Outline • Basic definitions • Subspaces and Dimensionality • Matrix functions: inverses and eigenvalue decompositions • Convex optimization

Vectors and Matrices • Vector � ∈ ℝ � � � � � � = ⋮ � � • May also write � = � � � � … � � �

Vectors and Matrices • Matrix � ∈ ℝ �×� � �� ⋯ � �� ⋮ ⋱ ⋮ � = � �� ⋯ � �� • Written in terms of rows or columns � � � = � � … � � � = ⋮ � � � � �� = � �� = � �� … …

Multiplication • Vector-vector: �, � ∈ ℝ � → ℝ � � � � = � � � � � �� • Matrix-vector: � ∈ ℝ � , � ∈ ℝ �×� → ℝ � � � � � � � � �� = � = ⋮ ⋮ � � � � � � �

Multiplication • Matrix-matrix: � ∈ ℝ �×� , � ∈ ℝ �×� → ℝ �×� 3 4 4 3 = 5 5

Multiplication • Matrix-matrix: � ∈ ℝ �×� , � ∈ ℝ �×� → ℝ �×� – � rows of � , ! " cols of � � � � �� = �! � … �! � = ⋮ � � � � ! � � ! � � ⋯ � � ! " ⋮ � ⋮ = � ! � � ! � � ⋯ �

Multiplication Properties • Associative �� # = � �# • Distributive � � + # = �� + �# • NOT commutative �� ≠ �� – Dimensions may not even be conformable

Useful Matrices • Identity matrix & ∈ ℝ �×� – �& = �, &� = � 1 0 0 ��& �" = )0�* ≠ + 0 1 0 1�* = + 0 0 1 • Diagonal matrix � ∈ ℝ �×� 0 � ⋯ 0 ⋮ 0 � ⋮ � = diag 0 � , … , 0 � = 0 ⋯ 0 �

Useful Matrices • Symmetric � ∈ ℝ �×� : � = � � • Orthogonal 2 ∈ ℝ �×� : 2 � 2 = 22 � = & – Columns/ rows are orthonormal • Positive semidefinite � ∈ ℝ �×� : � � �� ≥ 0��for�all�� ∈ ℝ � �� – Equivalently, there exists 8 ∈ ℝ �×� � = 88 �

Norms • Quantify “size” of a vector • Given � ∈ ℝ � , a norm satisfies 9� = 9 � 1. � = 0 ⇔ � = 0 2. � + � ≤ � + � 3. • Common norms: � + ⋯ + � � � 1. Euclidean 8 � -norm: � � = � � 8 � -norm: � � = � � + ⋯ + � � 2. 8 < -norm: � < = max � � 3. �

Linear Subspaces

Linear Subspaces • Subspace ? ⊂ ℝ � satisfies 0 ∈ ? 1. If �, � ∈ ? and 9 ∈ ℝ , then 9 � + � ∈ ? 2. Vectors A � , … , A � span ? if • � B ∈ ℝ � ? = � B � A � ��

Linear Independence and Dimension • Vectors A � , … , A � are linearly independent if � C B � A � = 0 ⟺ B = 0 �� – Every linear combination of the A � is unique • Dim ? = F if A � , … , A � span ? and are linearly independent – If G � , … , G � span ?� then • H ≥ F • If H > F then G � are NOT linearly independent

Linear Independence and Dimension

Matrix Subspaces • Matrix � ∈ ℝ �×� defines two subspaces – Column space col � = �B B ∈ ℝ � ⊂ ℝ � – Row space row � = � � L L ∈ ℝ � ⊂ ℝ � • Nullspace of � : null � = � ∈ ℝ � �� = 0 – null � ⊥ row � – dim null � + dim row � = P – Analog for column space

Matrix Rank • rank � gives dimensionality of row and column spaces • If � ∈ ℝ �×� has rank H , can decompose into product of F × H and H × P matrices H � = F F rank = H P P H

Properties of Rank • For �, � ∈ ℝ �×� 1. rank � ≤ min F, P 2. rank � = rank � � 3. rank �� ≤ min rank � , rank � 4. rank � + � ≤ rank � + rank � • � has full rank if rank � = min F, P • If F > rank � rows not linearly independent – Same for columns if P > rank �

Matrix Inverse • � ∈ ℝ �×� is invertible iff rank � = F • Inverse is unique and satisfies � T� � = �� T� = & 1. � T� T� = � 2. � � T� = � T� � 3. If � is invertible then �� is invertible and 4. �� T� = � T� � T�

Systems of Equations • Given � ∈ ℝ �×� , � ∈ ℝ � wish to solve �� = � – Exists only if � ∈ col � • Possibly infinite number of solutions • If � is invertible then � = � T� � – Notational device, do not actually invert matrices – Computationally, use solving routines like Gaussian elimination

Systems of Equations • What if � ∉ col � ? • Find � that gives � V = �� closest to � – � V is projection of � onto col � – Also known as regression • Assume rank � = P < F � = � � � T� � � �� V = � � � � T� � � � Invertible Projection matrix

Systems of Equations ' ( X' ' X' R� R� R� XSR� = XR� = S ' X'R� ' ' X'R�

Eigenvalue Decomposition • Eigenvalue decomposition of symmetric � ∈ ℝ �×� is � � = YΣY � = � [ � \ � \ � � �� – Σ = diag [ � , … , [ � contains eigenvalues of �� – Y is orthogonal and contains eigenvectors \ � of � • If � is not symmetric but diagonalizable � = YΣY T� – Σ is diagonal by possibly complex – Y not necessarily orthogonal

Characterizations of Eigenvalues • Traditional formulation �� = [� – Leads to characteristic polynomial det � X [& = 0 • Rayleigh quotient (symmetric � ) � � �� max � � � _

Eigenvalue Properties • For � ∈ ℝ �×� with eigenvalues [ � � tr � = C [ � 1. �� det � = [ � [ � … [ � 2. rank � = #[ � ≠ 0 3. When � is symmetric • Eigenvalue decomposition is singular value – decomposition Eigenvectors for nonzero eigenvalues give – orthogonal basis for row � = col �

Simple Eigenvalue Proof • Why det � − [& = 0 ? • Assume � is symmetric and full rank 1. � = YΣY � YY � = & 2. � − [& = YΣY � − [& = Y Σ − [& Y � 3. If [ = [ � , * ab eigenvalue of � − [& is 0 4. Since det � − [& is product of eigenvalues, one of the terms is 0 , so product is 0

Convex Optimization • Find minimum of a function subject to solution constraints • Business/economics/ game theory – Resource allocation – Optimal planning and strategies • Statistics and Machine Learning – All forms of regression and classification – Unsupervised learning • Control theory – Keeping planes in the air!

Convex Sets • A set # is convex if ∀�, � ∈ # and ∀B ∈ 0,1 B� + 1 − B � ∈ # – Line segment between points in # also lies in # • Ex – Intersection of halfspaces – 8 d balls – Intersection of convex sets

Convex Functions • A real-valued function e is convex if dome is convex and ∀�, � ∈ dome and ∀B ∈ 0,1 e B� + 1 − B � ≤ Be � + 1 − B e � – Graph of e upper bounded by line segment between points on graph �, e � �, e �

Gradients • Differentiable convex e with dome = ℝ � • Gradient fe at � gives linear approximation � ge ge fe = … g� � g� � e � + h � fe e �

Gradient Descent • To minimize e� move down gradient – But not too far! – Optimum when fe = 0 • Given e , learning rate B , starting point � i � = � i Do until fe = 0 � = � − Bfe

Stochastic Gradient Descent • Many learning problems have extra structure � e j = � 8 j; A � �� • Computing gradient requires iterating over all points, can be too costly • Instead, compute gradient at single training example

Stochastic Gradient Descent � • Given e j = C 8 j; A � , learning rate B , �� starting point j i j = j i Do until e j nearly optimal For * = 1�to�P in random order j = j − Bf8 j; A � • Finds nearly optimal j

� � � − j � A � � Minimize C ��

Learning Parameter

Linear Algebra Review (with a Small Dose of Optimization) Hristo - PowerPoint PPT Presentation

Linear Algebra Review (with a Small Dose of Optimization) Hristo Paskov CS246 Outline Basic definitions Subspaces and Dimensionality Matrix functions: inverses and eigenvalue decompositions Convex optimization Vectors and

relationships at low dose IR Dik C. van Gent, Erasmus MC Modeling of the dose-response curve

Chapter 1 What is Linear Algebra? Chapter 1 What is Linear Algebra? The study of linear

Graphics 2014 Linear Algebra II Linear Maps & Matrices Linear Maps & Matrices CORE

Lecture 14: Dense Linear Algebra David Bindel 18 Oct 2010 Where we are This week: dense

Linear Algebra Linear algebra has become as basic and as applicable as calculus, and

A Tutorial on Radiation Dose and Dose Rate Kurt Sickafus Dose = Absorbed Energy Density

From Monolith to Microservices Tony Maher Dose Media www.dose.com www.omgfacts.com

PV Math Department MCL Vision Credit Options Credit General General/Post- College Honors

Linear algebra explained in four pages Excerpt from the N O BULLSHIT GUIDE TO LINEAR ALGEBRA by

Matrices Basic Linear Algebra Overview Lecture will cover why matrices and linear algebra

MATRICES AND LINEAR ALGEBRA Linear Algebra Matrix manipulation is the original essence of

Expressive Linear Algebra in Haskell Henning Thielemann 2019-08-21 Expressive Linear Algebra in

CS 7616 Pattern Recognition Linear, Linear, Linear Aaron Bobick School of Interactive

Linear algebra and analysis recalls Lectures for PHD course on Numerical optimization Enrico

A Review of Linear Algebra Mohammad Emtiyaz Khan CS,UBC A Review of Linear Algebra p.1/13

Linear Algebra Review Leila Wehbe January 29, 2013 Leila Wehbe Linear Algebra Review Metrics

Orthogonal matrices, change of basis, rank Math Tools for Neuroscience (NEU 314) Spring 2016

Security models Bj orn Victor Security models p.1/14 Harrison-Ruzzo-Ullman (HRU)

Lecture 5 Standard Input and Output 3. HardwareandSoftw are 4. HighLevel Languages 5.

Lecture 2: Access Control Matrix January 6, 2011 Lecture 2, Slide 1 ECS 235B, Foundations of

I ntroduction to Mobile Robotics Com pact Course on Linear Algebra Wolfram Burgard, Maren

L101: Matrix Factorization In a nutshell Matrix factorization/completion you know? In NLP?

Chapter 6 Linear Independence Chapter 6 Linear Dependence/Independence A set of vectors { v 1 ,

trst P r tt r

Linear Algebra Review (with a Small Dose of Optimization) Hristo - PowerPoint PPT Presentation

Linear Algebra Review (with a Small Dose of Optimization) Hristo Paskov CS246 Outline Basic definitions Subspaces and Dimensionality Matrix functions: inverses and eigenvalue decompositions Convex optimization Vectors and

relationships at low dose IR Dik C. van Gent, Erasmus MC Modeling of the dose-response curve

Chapter 1 What is Linear Algebra? Chapter 1 What is Linear Algebra? The study of linear

Graphics 2014 Linear Algebra II Linear Maps &amp; Matrices Linear Maps &amp; Matrices CORE

Lecture 14: Dense Linear Algebra David Bindel 18 Oct 2010 Where we are This week: dense

Linear Algebra Linear algebra has become as basic and as applicable as calculus, and

A Tutorial on Radiation Dose and Dose Rate Kurt Sickafus Dose = Absorbed Energy Density

From Monolith to Microservices Tony Maher Dose Media www.dose.com www.omgfacts.com

PV Math Department MCL Vision Credit Options Credit General General/Post- College Honors

Linear algebra explained in four pages Excerpt from the N O BULLSHIT GUIDE TO LINEAR ALGEBRA by

Matrices Basic Linear Algebra Overview Lecture will cover why matrices and linear algebra

MATRICES AND LINEAR ALGEBRA Linear Algebra Matrix manipulation is the original essence of

Expressive Linear Algebra in Haskell Henning Thielemann 2019-08-21 Expressive Linear Algebra in

CS 7616 Pattern Recognition Linear, Linear, Linear Aaron Bobick School of Interactive

Linear algebra and analysis recalls Lectures for PHD course on Numerical optimization Enrico

A Review of Linear Algebra Mohammad Emtiyaz Khan CS,UBC A Review of Linear Algebra p.1/13

Linear Algebra Review Leila Wehbe January 29, 2013 Leila Wehbe Linear Algebra Review Metrics

Orthogonal matrices, change of basis, rank Math Tools for Neuroscience (NEU 314) Spring 2016

Security models Bj orn Victor Security models p.1/14 Harrison-Ruzzo-Ullman (HRU)

Lecture 5 Standard Input and Output 3. HardwareandSoftw are 4. HighLevel Languages 5.

Lecture 2: Access Control Matrix January 6, 2011 Lecture 2, Slide 1 ECS 235B, Foundations of

I ntroduction to Mobile Robotics Com pact Course on Linear Algebra Wolfram Burgard, Maren

L101: Matrix Factorization In a nutshell Matrix factorization/completion you know? In NLP?

Chapter 6 Linear Independence Chapter 6 Linear Dependence/Independence A set of vectors { v 1 ,

trst P r tt r

Graphics 2014 Linear Algebra II Linear Maps & Matrices Linear Maps & Matrices CORE