[PPT] - Linear algebra and differential equations (Math 54): Lecture 18 PowerPoint Presentation

SLIDE 1

Linear algebra and differential equations (Math 54): Lecture 18

Vivek Shende April 3, 2019

SLIDE 2

Hello and welcome to class!

SLIDE 3

Hello and welcome to class!

Last time

SLIDE 4

Hello and welcome to class!

Last time

We discussed the “least squares” problem, and then considered notions of distance and orthogonality in a general setting, as captured by inner products.

SLIDE 5

Hello and welcome to class!

Last time

We discussed the “least squares” problem, and then considered notions of distance and orthogonality in a general setting, as captured by inner products.

This time

SLIDE 6

Hello and welcome to class!

Last time

We discussed the “least squares” problem, and then considered notions of distance and orthogonality in a general setting, as captured by inner products.

This time

We meet the spectral theorem, and the singular value decomposition.

SLIDE 7

Hello and welcome to class!

Last time

We discussed the “least squares” problem, and then considered notions of distance and orthogonality in a general setting, as captured by inner products.

This time

We meet the spectral theorem, and the singular value

decomposition. This is the last class on linear algebra.

SLIDE 8

Hello and welcome to class!

Last time

We discussed the “least squares” problem, and then considered notions of distance and orthogonality in a general setting, as captured by inner products.

This time

We meet the spectral theorem, and the singular value

decomposition. This is the last class on linear algebra. Next week,

we begin differential equations!

SLIDE 9

Review: Inner products

Definition

An inner product on a vector space V is a map · , · : V × V → R which is distributive, commutative, and positive.

SLIDE 10

Review: Inner product properties

Distributivity (aka ”bilinearity”) av+bv′, cw+dw′ = acv, w+bcv′, w+adv, w′+bdv′, w′ Commutativity v, w = w, v Positivity v, v ≥ 0, with equality only when v = 0

SLIDE 11

Review: Inner products on Rn

SLIDE 12

Review: Inner products on Rn

Last time, we saw that distributivity implies that any inner product

n Rn is given by a matrix:

v, w = vTAw Ai,j = ei, ej

SLIDE 13

When does a matrix define an inner product?

SLIDE 14

When does a matrix define an inner product?

For any matrix A, consider the formula v, w = vTAw.

SLIDE 15

When does a matrix define an inner product?

For any matrix A, consider the formula v, w = vTAw. This always satisfies the distributivity axiom.

SLIDE 16

When does a matrix define an inner product?

For any matrix A, consider the formula v, w = vTAw. This always satisfies the distributivity axiom. It satisfies commutativity iff A is symmetric, i.e. AT = A.

SLIDE 17

When does a matrix define an inner product?

For any matrix A, consider the formula v, w = vTAw. This always satisfies the distributivity axiom. It satisfies commutativity iff A is symmetric, i.e. AT = A. When does it satisfy positivity?

SLIDE 18

When does a matrix define an inner product?

For any matrix A, consider the formula v, w = vTAw. This always satisfies the distributivity axiom. It satisfies commutativity iff A is symmetric, i.e. AT = A. When does it satisfy positivity? (last time: when A is 2x2, positive iff the eigenvalues are)

SLIDE 19

Eigenvectors of symmetric matrices

SLIDE 20

Eigenvectors of symmetric matrices

Eigenvectors with different eigenvalues are linearly independent.

SLIDE 21

Eigenvectors of symmetric matrices

Eigenvectors with different eigenvalues are linearly independent. For a symmetric matrix,

SLIDE 22

Eigenvectors of symmetric matrices

Eigenvectors with different eigenvalues are linearly independent. For a symmetric matrix, eigenvectors with different eigenvalues are in fact orthogonal:

SLIDE 23

Eigenvectors of symmetric matrices

Eigenvectors with different eigenvalues are linearly independent. For a symmetric matrix, eigenvectors with different eigenvalues are in fact orthogonal: Indeed, since MT = M,

SLIDE 24

Eigenvectors of symmetric matrices

Eigenvectors with different eigenvalues are linearly independent. For a symmetric matrix, eigenvectors with different eigenvalues are in fact orthogonal: Indeed, since MT = M, for any vector v, we have vTM = (Mv)T.

SLIDE 25

Eigenvectors of symmetric matrices

Eigenvectors with different eigenvalues are linearly independent. For a symmetric matrix, eigenvectors with different eigenvalues are in fact orthogonal: Indeed, since MT = M, for any vector v, we have vTM = (Mv)T. If vi, vj are eigenvectors with eigenvalues λi, λj:

SLIDE 26

Eigenvectors of symmetric matrices

Eigenvectors with different eigenvalues are linearly independent. For a symmetric matrix, eigenvectors with different eigenvalues are in fact orthogonal: Indeed, since MT = M, for any vector v, we have vTM = (Mv)T. If vi, vj are eigenvectors with eigenvalues λi, λj: vT

i Mvj = λjvT i vj

SLIDE 27

Eigenvectors of symmetric matrices

Eigenvectors with different eigenvalues are linearly independent. For a symmetric matrix, eigenvectors with different eigenvalues are in fact orthogonal: Indeed, since MT = M, for any vector v, we have vTM = (Mv)T. If vi, vj are eigenvectors with eigenvalues λi, λj: vT

i Mvj = λjvT i vj

vT

i Mvj = (Mvi)Tvj = λivT i vj

SLIDE 28

Eigenvectors of symmetric matrices

Eigenvectors with different eigenvalues are linearly independent. For a symmetric matrix, eigenvectors with different eigenvalues are in fact orthogonal: Indeed, since MT = M, for any vector v, we have vTM = (Mv)T. If vi, vj are eigenvectors with eigenvalues λi, λj: vT

i Mvj = λjvT i vj

vT

i Mvj = (Mvi)Tvj = λivT i vj

So if λi = λj, then vT

i vj = 0.

SLIDE 29

The spectral theorem

SLIDE 30

The spectral theorem

Theorem

A symmetric matrix M has all real eigenvalues and an orthonormal basis of eigenvectors.

SLIDE 31

The spectral theorem

Theorem

A symmetric matrix M has all real eigenvalues and an orthonormal basis of eigenvectors. Proof.

SLIDE 32

The spectral theorem

Theorem

A symmetric matrix M has all real eigenvalues and an orthonormal basis of eigenvectors.

Proof. Given any real basis of eigenvectors, those of different

eigenvalues are already orthogonal; and if some eigenspace is of dimension > 1, we may use Gram-Schmidt to replace our basis with an orthonormal one.

SLIDE 33

The spectral theorem

Theorem

A symmetric matrix M has all real eigenvalues and an orthonormal basis of eigenvectors.

Proof. Given any real basis of eigenvectors, those of different

eigenvalues are already orthogonal; and if some eigenspace is of dimension > 1, we may use Gram-Schmidt to replace our basis with an orthonormal one. So it is enough to find a basis of real eigenvectors.

SLIDE 34

The spectral theorem

Theorem

A symmetric matrix M has all real eigenvalues and an orthonormal basis of eigenvectors.

Proof. Given any real basis of eigenvectors, those of different

eigenvalues are already orthogonal; and if some eigenspace is of dimension > 1, we may use Gram-Schmidt to replace our basis with an orthonormal one. So it is enough to find a basis of real

eigenvectors. Our first task is to produce a single such vector.

SLIDE 35

The spectral theorem

Certainly M has a complex eigenvector v with complex eigenvalue λ.

SLIDE 36

The spectral theorem

Certainly M has a complex eigenvector v with complex eigenvalue λ. Let † denote the operation of taking transpose and complex conjugate.

SLIDE 37

The spectral theorem

Certainly M has a complex eigenvector v with complex eigenvalue λ. Let † denote the operation of taking transpose and complex conjugate. Then λv†v = v†Mv = (Mv)†v = λv†v

SLIDE 38

The spectral theorem

Certainly M has a complex eigenvector v with complex eigenvalue λ. Let † denote the operation of taking transpose and complex conjugate. Then λv†v = v†Mv = (Mv)†v = λv†v Since v†v is the sum of the squares of the lengths of the entries of v,

SLIDE 39

The spectral theorem

Certainly M has a complex eigenvector v with complex eigenvalue λ. Let † denote the operation of taking transpose and complex conjugate. Then λv†v = v†Mv = (Mv)†v = λv†v Since v†v is the sum of the squares of the lengths of the entries of v, it is positive,

SLIDE 40

The spectral theorem

Certainly M has a complex eigenvector v with complex eigenvalue λ. Let † denote the operation of taking transpose and complex conjugate. Then λv†v = v†Mv = (Mv)†v = λv†v Since v†v is the sum of the squares of the lengths of the entries of v, it is positive, hence nonzero,

SLIDE 41

The spectral theorem

Certainly M has a complex eigenvector v with complex eigenvalue λ. Let † denote the operation of taking transpose and complex conjugate. Then λv†v = v†Mv = (Mv)†v = λv†v Since v†v is the sum of the squares of the lengths of the entries of v, it is positive, hence nonzero, hence λ = λ is a real number.

SLIDE 42

The spectral theorem

Now we know there is a real unit eigenvector v with the real eigenvalue λ.

SLIDE 43

The spectral theorem

Now we know there is a real unit eigenvector v with the real eigenvalue λ. Consider the orthogonal complement v⊥.

SLIDE 44

The spectral theorem

Now we know there is a real unit eigenvector v with the real eigenvalue λ. Consider the orthogonal complement v⊥. This is preserved by M:

SLIDE 45

The spectral theorem

Now we know there is a real unit eigenvector v with the real eigenvalue λ. Consider the orthogonal complement v⊥. This is preserved by M: if wTv = 0, then (Mw)Tv = wTMv = λwTv = 0

SLIDE 46

The spectral theorem

Now we know there is a real unit eigenvector v with the real eigenvalue λ. Consider the orthogonal complement v⊥. This is preserved by M: if wTv = 0, then (Mw)Tv = wTMv = λwTv = 0 Pick an orthonormal basis w2, . . . , wn of v⊥.

SLIDE 47

The spectral theorem

Now we know there is a real unit eigenvector v with the real eigenvalue λ. Consider the orthogonal complement v⊥. This is preserved by M: if wTv = 0, then (Mw)Tv = wTMv = λwTv = 0 Pick an orthonormal basis w2, . . . , wn of v⊥. In the basis v, w2, w3, . . . , wn, the matrix M takes the shape M =      vTMv · · · wT

2 Mw2

· · · wT

2 Mwn

. . . . . . ... . . . wT

n Mw2

· · · wT

n Mwn

    

SLIDE 48

The spectral theorem

Now we know there is a real unit eigenvector v with the real eigenvalue λ. Consider the orthogonal complement v⊥. This is preserved by M: if wTv = 0, then (Mw)Tv = wTMv = λwTv = 0 Pick an orthonormal basis w2, . . . , wn of v⊥. In the basis v, w2, w3, . . . , wn, the matrix M takes the shape M =      vTMv · · · wT

2 Mw2

· · · wT

2 Mwn

. . . . . . ... . . . wT

n Mw2

· · · wT

n Mwn

     The lower-right block is a smaller symmetric matrix; we are done by induction.

SLIDE 49

The spectral theorem

Theorem

A symmetric matrix M has all real eigenvalues and an orthonormal basis of eigenvectors.

SLIDE 50

The spectral theorem: reformulation

SLIDE 51

The spectral theorem: reformulation

Recall that a square matrix O is said to be orthogonal if OT = O−1;

SLIDE 52

The spectral theorem: reformulation

Recall that a square matrix O is said to be orthogonal if OT = O−1; equivalently, if the columns are orthonormal;

SLIDE 53

The spectral theorem: reformulation

Recall that a square matrix O is said to be orthogonal if OT = O−1; equivalently, if the columns are orthonormal; equivalently, if the rows are orthonormal.

SLIDE 54

The spectral theorem: reformulation

Recall that a square matrix O is said to be orthogonal if OT = O−1; equivalently, if the columns are orthonormal; equivalently, if the rows are orthonormal.

Theorem

If M is symmetric, there is an orthogonal matrix O and a real diagonal matrix D with M = ODO−1 = ODOT

SLIDE 55

The spectral theorem: reformulation

Recall that a square matrix O is said to be orthogonal if OT = O−1; equivalently, if the columns are orthonormal; equivalently, if the rows are orthonormal.

Theorem

If M is symmetric, there is an orthogonal matrix O and a real diagonal matrix D with M = ODO−1 = ODOT Proof: the columns of O are orthonormal eigenvectors of M.

SLIDE 56

The spectral theorem: reformulation

Theorem

If M is symmetric, there is an orthonormal basis vi and real numbers λi such that M = λ1v1vT

1 + · · · + λnvnvT n

SLIDE 57

The spectral theorem: reformulation

Theorem

If M is symmetric, there is an orthonormal basis vi and real numbers λi such that M = λ1v1vT

1 + · · · + λnvnvT n

Proof: the vi are the eigenvectors of M and the λi are their eigenvalues.

SLIDE 58

The spectral theorem: reformulation

Theorem

If M is symmetric, there is an orthonormal basis vi and real numbers λi such that M = λ1v1vT

1 + · · · + λnvnvT n

Proof: the vi are the eigenvectors of M and the λi are their eigenvalues. Note by orthonormality, (vivT

i )2 = vivT i , and (vivT i )(vjvT j ) = 0

when i = j.

SLIDE 59

What does positive mean?

Suppose M is a symmetric matrix. When is it true that: wTMw ≥ 0, with equality only when w = 0

SLIDE 60

What does positive mean?

Suppose M is a symmetric matrix. When is it true that: wTMw ≥ 0, with equality only when w = 0 Claim: if and only if M has only positive eigenvalues.

SLIDE 61

What does positive mean?

Suppose M is a symmetric matrix. When is it true that: wTMw ≥ 0, with equality only when w = 0 Claim: if and only if M has only positive eigenvalues. Take an orthonormal basis vi of eigenvectors for M, with eigenvalues λi.

SLIDE 62

What does positive mean?

Suppose M is a symmetric matrix. When is it true that: wTMw ≥ 0, with equality only when w = 0 Claim: if and only if M has only positive eigenvalues. Take an orthonormal basis vi of eigenvectors for M, with eigenvalues λi. Note vT

i Mvi = λivT i vi = λi.

SLIDE 63

What does positive mean?

Suppose M is a symmetric matrix. When is it true that: wTMw ≥ 0, with equality only when w = 0 Claim: if and only if M has only positive eigenvalues. Take an orthonormal basis vi of eigenvectors for M, with eigenvalues λi. Note vT

i Mvi = λivT i vi = λi. So if any of the λi is

≤ 0, then certainly M is not positive.

SLIDE 64

What does positive mean?

Suppose M is a symmetric matrix. When is it true that: wTMw ≥ 0, with equality only when w = 0 Claim: if and only if M has only positive eigenvalues. Take an orthonormal basis vi of eigenvectors for M, with eigenvalues λi. Note vT

i Mvi = λivT i vi = λi. So if any of the λi is

≤ 0, then certainly M is not positive. On the other hand, expand any w = ( wivi).

SLIDE 65

What does positive mean?

Suppose M is a symmetric matrix. When is it true that: wTMw ≥ 0, with equality only when w = 0 Claim: if and only if M has only positive eigenvalues. Take an orthonormal basis vi of eigenvectors for M, with eigenvalues λi. Note vT

i Mvi = λivT i vi = λi. So if any of the λi is

≤ 0, then certainly M is not positive. On the other hand, expand any w = ( wivi). Then wTMw = (

wivi)TM(
wivi) =
w2

i λi

SLIDE 66

What does positive mean?

Suppose M is a symmetric matrix. When is it true that: wTMw ≥ 0, with equality only when w = 0 Claim: if and only if M has only positive eigenvalues. Take an orthonormal basis vi of eigenvectors for M, with eigenvalues λi. Note vT

i Mvi = λivT i vi = λi. So if any of the λi is

≤ 0, then certainly M is not positive. On the other hand, expand any w = ( wivi). Then wTMw = (

wivi)TM(
wivi) =
w2

i λi

This is certainly positive if the λi are positive and w = 0.

SLIDE 67

Square-roots of positive matrices

SLIDE 68

Square-roots of positive matrices

Positive numbers have square-roots.

SLIDE 69

Square-roots of positive matrices

Positive numbers have square-roots. So do positive symmetric matrices:

SLIDE 70

Square-roots of positive matrices

Positive numbers have square-roots. So do positive symmetric matrices: if A = ODO−1 with D having all non-negative entries,

SLIDE 71

Square-roots of positive matrices

Positive numbers have square-roots. So do positive symmetric matrices: if A = ODO−1 with D having all non-negative entries, then we can write √ D for the diagonal matrix whose entries are the square-roots of D’s entries, and:

SLIDE 72

Square-roots of positive matrices

Positive numbers have square-roots. So do positive symmetric matrices: if A = ODO−1 with D having all non-negative entries, then we can write √ D for the diagonal matrix whose entries are the square-roots of D’s entries, and:

SLIDE 73

Square-roots of positive matrices

Positive numbers have square-roots. So do positive symmetric matrices: if A = ODO−1 with D having all non-negative entries, then we can write √ D for the diagonal matrix whose entries are the square-roots of D’s entries, and: √ A := O √ DO−1

SLIDE 74

Square-roots of positive matrices

Positive numbers have square-roots. So do positive symmetric matrices: if A = ODO−1 with D having all non-negative entries, then we can write √ D for the diagonal matrix whose entries are the square-roots of D’s entries, and: √ A := O √ DO−1 Note that √ A is again symmetric and positive.

SLIDE 75

Stretching and shrinking

SLIDE 76

Stretching and shrinking

How big or small can be |Aw|

|w| ?

SLIDE 77

Stretching and shrinking

How big or small can be |Aw|

|w| ?

If A is symmetric, then we take a basis of orthonormal eigenvectors vi with eigenvalues λi and write:

SLIDE 78

Stretching and shrinking

How big or small can be |Aw|

|w| ?

If A is symmetric, then we take a basis of orthonormal eigenvectors vi with eigenvalues λi and write: w =

wivi

SLIDE 79

Stretching and shrinking

How big or small can be |Aw|

|w| ?

If A is symmetric, then we take a basis of orthonormal eigenvectors vi with eigenvalues λi and write: w =

wivi

Aw =

wiλivi

From this it is not hard to see

SLIDE 80

Stretching and shrinking

How big or small can be |Aw|

|w| ?

If A is symmetric, then we take a basis of orthonormal eigenvectors vi with eigenvalues λi and write: w =

wivi

Aw =

wiλivi

From this it is not hard to see min(|λi|) ≤ |Aw| |w| = w2

i λ2 i

w2

i

≤ max(|λi|)

SLIDE 81

Stretching and shrinking

SLIDE 82

Stretching and shrinking

What about for matrices A in general?

SLIDE 83

Stretching and shrinking

What about for matrices A in general? Maybe not even square?

SLIDE 84

Stretching and shrinking

What about for matrices A in general? Maybe not even square? |Av|2 = (Av)T(Av) = vTATAv

SLIDE 85

Stretching and shrinking

What about for matrices A in general? Maybe not even square? |Av|2 = (Av)T(Av) = vTATAv Note the matrix ATA is (square and) symmetric.

SLIDE 86

Stretching and shrinking

What about for matrices A in general? Maybe not even square? |Av|2 = (Av)T(Av) = vTATAv Note the matrix ATA is (square and) symmetric. It’s also non-negative:

SLIDE 87

Stretching and shrinking

What about for matrices A in general? Maybe not even square? |Av|2 = (Av)T(Av) = vTATAv Note the matrix ATA is (square and) symmetric. It’s also non-negative: vTATAv = |Av|2 ≥ 0.

SLIDE 88

Stretching and shrinking

What about for matrices A in general? Maybe not even square? |Av|2 = (Av)T(Av) = vTATAv Note the matrix ATA is (square and) symmetric. It’s also non-negative: vTATAv = |Av|2 ≥ 0. So it has a symmetric non-negative square-root B = √ ATA.

SLIDE 89

Stretching and shrinking

What about for matrices A in general? Maybe not even square? |Av|2 = (Av)T(Av) = vTATAv Note the matrix ATA is (square and) symmetric. It’s also non-negative: vTATAv = |Av|2 ≥ 0. So it has a symmetric non-negative square-root B = √

ATA. So

we reduced the problem to the symmetric case, since |Av|2 = vTATAv = vTB2v = vTBTBv = |Bv|2

SLIDE 90

Singular values

SLIDE 91

Singular values

Let vi be an orthonormal basis of eigenvectors for ATA, with eigenvalues λi. We order them so that λ1 ≥ λ2 ≥ · · · .

SLIDE 92

Singular values

Let vi be an orthonormal basis of eigenvectors for ATA, with eigenvalues λi. We order them so that λ1 ≥ λ2 ≥ · · · . As we have seen, all λi ≥ 0.

SLIDE 93

Singular values

Let vi be an orthonormal basis of eigenvectors for ATA, with eigenvalues λi. We order them so that λ1 ≥ λ2 ≥ · · · . As we have seen, all λi ≥ 0. We write σi = √λi.

SLIDE 94

Singular values

Let vi be an orthonormal basis of eigenvectors for ATA, with eigenvalues λi. We order them so that λ1 ≥ λ2 ≥ · · · . As we have seen, all λi ≥ 0. We write σi = √λi. The σi are called the singular values of A.

SLIDE 95

Singular values

Let vi be an orthonormal basis of eigenvectors for ATA, with eigenvalues λi. We order them so that λ1 ≥ λ2 ≥ · · · . As we have seen, all λi ≥ 0. We write σi = √λi. The σi are called the singular values of A. Note: (Avi) · (Avj) = vT

j ATAvi = 0

i = j

SLIDE 96

Singular values

Let vi be an orthonormal basis of eigenvectors for ATA, with eigenvalues λi. We order them so that λ1 ≥ λ2 ≥ · · · . As we have seen, all λi ≥ 0. We write σi = √λi. The σi are called the singular values of A. Note: (Avi) · (Avj) = vT

j ATAvi = 0

i = j (Avi) · (Avi) = vT

i ATAvi = σ2 i vT i vi = σ2 i

SLIDE 97

Singular value decomposition

So the vi are an orthonormal basis whose images are also

rthogonal.

SLIDE 98

Singular value decomposition

So the vi are an orthonormal basis whose images are also

rthogonal. We rescale the images to the orthonormal

ui := 1 σi Avi

SLIDE 99

Singular value decomposition

So the vi are an orthonormal basis whose images are also

rthogonal. We rescale the images to the orthonormal

ui := 1 σi Avi We extend the ui to an orthonormal basis.

SLIDE 100

Singular value decomposition

So the vi are an orthonormal basis whose images are also

rthogonal. We rescale the images to the orthonormal

ui := 1 σi Avi We extend the ui to an orthonormal basis. The matrices U, V whose columns are the basis vectors ui, vi are orthogonal:

SLIDE 101

Singular value decomposition

So the vi are an orthonormal basis whose images are also

rthogonal. We rescale the images to the orthonormal

ui := 1 σi Avi We extend the ui to an orthonormal basis. The matrices U, V whose columns are the basis vectors ui, vi are orthogonal: UT = U−1 and V T = V −1.

SLIDE 102

Singular value decomposition

So the vi are an orthonormal basis whose images are also

rthogonal. We rescale the images to the orthonormal

ui := 1 σi Avi We extend the ui to an orthonormal basis. The matrices U, V whose columns are the basis vectors ui, vi are orthogonal: UT = U−1 and V T = V −1. The matrix Σ := UTAV is diagonal, with entries σi.

SLIDE 103

Singular value decomposition

So the vi are an orthonormal basis whose images are also

rthogonal. We rescale the images to the orthonormal

ui := 1 σi Avi We extend the ui to an orthonormal basis. The matrices U, V whose columns are the basis vectors ui, vi are orthogonal: UT = U−1 and V T = V −1. The matrix Σ := UTAV is diagonal, with entries σi. A = UΣV T