[PPT] - Math 221: LINEAR ALGEBRA 5-3. Vector Space R n - Orthogonality Le PowerPoint Presentation

SLIDE 1

Math 221: LINEAR ALGEBRA

§5-3. Vector Space Rn - Orthogonality

Le Chen1

Emory University, 2020 Fall

(last updated on 08/18/2020) Creative Commons License (CC BY-NC-SA) 1Slides are adapted from those by Karen Seyffarth from University of Calgary.

SLIDE 2

Definitions

Let x =      x1 x2 . . . xn      and y =      y1 y2 . . . yn      be vectors in Rn.

SLIDE 3

Definitions

Let x =      x1 x2 . . . xn      and y =      y1 y2 . . . yn      be vectors in Rn.

1. The dot product of

x and y is

x ·

y = x1y1 + x2y2 + · · · xnyn = xT y. Note: x · y is a scalar, but we also treat it as a 1 × 1 matrix.

SLIDE 4

Definitions

Let x =      x1 x2 . . . xn      and y =      y1 y2 . . . yn      be vectors in Rn.

1. The dot product of

x and y is

x ·

y = x1y1 + x2y2 + · · · xnyn = xT y. Note: x · y is a scalar, but we also treat it as a 1 × 1 matrix.

2. The length of

x, denoted || x|| is || x|| =

x2

1 + x2 2 · · · + x2 n =

√

x ·

x.

SLIDE 5

Definitions

Let x =      x1 x2 . . . xn      and y =      y1 y2 . . . yn      be vectors in Rn.

1. The dot product of

x and y is

x ·

y = x1y1 + x2y2 + · · · xnyn = xT y. Note: x · y is a scalar, but we also treat it as a 1 × 1 matrix.

2. The length of

x, denoted || x|| is || x|| =

x2

1 + x2 2 · · · + x2 n =

√

x ·

x. 3. x is called a unit vector if || x|| = 1.

SLIDE 6

Theorem (Properties of length and the dot product)

Let x, y, z ∈ Rn, and let a ∈ R. Then

SLIDE 7

Theorem (Properties of length and the dot product)

Let x, y, z ∈ Rn, and let a ∈ R. Then 1. x · y = y · x (the dot product is commutative)

SLIDE 8

Theorem (Properties of length and the dot product)

Let x, y, z ∈ Rn, and let a ∈ R. Then 1. x · y = y · x (the dot product is commutative) 2. x · ( y + z) = x · y + x · z (the dot product distributes over addition)

SLIDE 9

Theorem (Properties of length and the dot product)

Let x, y, z ∈ Rn, and let a ∈ R. Then 1. x · y = y · x (the dot product is commutative) 2. x · ( y + z) = x · y + x · z (the dot product distributes over addition)

3. (a

x) · y = a( x · y) = x · (a y)

SLIDE 10

Theorem (Properties of length and the dot product)

Let x, y, z ∈ Rn, and let a ∈ R. Then 1. x · y = y · x (the dot product is commutative) 2. x · ( y + z) = x · y + x · z (the dot product distributes over addition)

3. (a

x) · y = a( x · y) = x · (a y)

4. ||

x||2 = x · x.

SLIDE 11

Theorem (Properties of length and the dot product)

Let x, y, z ∈ Rn, and let a ∈ R. Then 1. x · y = y · x (the dot product is commutative) 2. x · ( y + z) = x · y + x · z (the dot product distributes over addition)

3. (a

x) · y = a( x · y) = x · (a y)

4. ||

x||2 = x · x.

5. ||

x|| ≥ 0 with equality if and only if x = 0n.

SLIDE 12

Theorem (Properties of length and the dot product)

Let x, y, z ∈ Rn, and let a ∈ R. Then 1. x · y = y · x (the dot product is commutative) 2. x · ( y + z) = x · y + x · z (the dot product distributes over addition)

3. (a

x) · y = a( x · y) = x · (a y)

4. ||

x||2 = x · x.

5. ||

x|| ≥ 0 with equality if and only if x = 0n.

6. ||a

x|| = |a| || x||.

SLIDE 13

Example

Let x, y ∈ Rn. Then || x + y||2 = ( x + y) · ( x + y) =

x ·

x + x · y + y · x + y · y =

x ·

x + 2( x · y) + y · y = || x||2 + 2( x · y) + || y||2.

SLIDE 14

Example

Let { f1, f2, . . . , fk} ∈ Rn and suppose Rn = span{ f1, f2, . . . , fk}. Furthermore, suppose that there exists a vector x ∈ Rn for which x · fj = 0 for all j, 1 ≤ j ≤ k.

SLIDE 15

Example

Let { f1, f2, . . . , fk} ∈ Rn and suppose Rn = span{ f1, f2, . . . , fk}. Furthermore, suppose that there exists a vector x ∈ Rn for which x · fj = 0 for all j, 1 ≤ j ≤ k. Write x = t1 f1 + t2 f2 + · · · + tk fk for some t1, t2, . . . , tk ∈ R (this is possible because f1, f2, . . . , fk span Rn).

SLIDE 16

Example

Let { f1, f2, . . . , fk} ∈ Rn and suppose Rn = span{ f1, f2, . . . , fk}. Furthermore, suppose that there exists a vector x ∈ Rn for which x · fj = 0 for all j, 1 ≤ j ≤ k. Write x = t1 f1 + t2 f2 + · · · + tk fk for some t1, t2, . . . , tk ∈ R (this is possible because f1, f2, . . . , fk span Rn). Then || x||2 =

x ·

x

SLIDE 17

Example

Let { f1, f2, . . . , fk} ∈ Rn and suppose Rn = span{ f1, f2, . . . , fk}. Furthermore, suppose that there exists a vector x ∈ Rn for which x · fj = 0 for all j, 1 ≤ j ≤ k. Write x = t1 f1 + t2 f2 + · · · + tk fk for some t1, t2, . . . , tk ∈ R (this is possible because f1, f2, . . . , fk span Rn). Then || x||2 =

x ·

x =

x · (t1

f1 + t2 f2 + · · · + tk fk)

SLIDE 18

Example

Let { f1, f2, . . . , fk} ∈ Rn and suppose Rn = span{ f1, f2, . . . , fk}. Furthermore, suppose that there exists a vector x ∈ Rn for which x · fj = 0 for all j, 1 ≤ j ≤ k. Write x = t1 f1 + t2 f2 + · · · + tk fk for some t1, t2, . . . , tk ∈ R (this is possible because f1, f2, . . . , fk span Rn). Then || x||2 =

x ·

x =

x · (t1

f1 + t2 f2 + · · · + tk fk) =

x · (t1

f1) + x · (t2 f2) + · · · + x · (tk fk)

SLIDE 19

Example

Let { f1, f2, . . . , fk} ∈ Rn and suppose Rn = span{ f1, f2, . . . , fk}. Furthermore, suppose that there exists a vector x ∈ Rn for which x · fj = 0 for all j, 1 ≤ j ≤ k. Write x = t1 f1 + t2 f2 + · · · + tk fk for some t1, t2, . . . , tk ∈ R (this is possible because f1, f2, . . . , fk span Rn). Then || x||2 =

x ·

x =

x · (t1

f1 + t2 f2 + · · · + tk fk) =

x · (t1

f1) + x · (t2 f2) + · · · + x · (tk fk) = t1( x · f1) + t2( x · f2) + · · · + tk( x · fk)

SLIDE 20

Example

Let { f1, f2, . . . , fk} ∈ Rn and suppose Rn = span{ f1, f2, . . . , fk}. Furthermore, suppose that there exists a vector x ∈ Rn for which x · fj = 0 for all j, 1 ≤ j ≤ k. Write x = t1 f1 + t2 f2 + · · · + tk fk for some t1, t2, . . . , tk ∈ R (this is possible because f1, f2, . . . , fk span Rn). Then || x||2 =

x ·

x =

x · (t1

f1 + t2 f2 + · · · + tk fk) =

x · (t1

f1) + x · (t2 f2) + · · · + x · (tk fk) = t1( x · f1) + t2( x · f2) + · · · + tk( x · fk) = t1(0) + t2(0) + · · · + tk(0) = 0. Since || x||2 = 0, || x|| = 0. By the previous theorem, || x|| = 0 if and only if

x =
0n. Therefore,

x = 0n.

SLIDE 21

Example

Let { f1, f2, . . . , fk} ∈ Rn and suppose Rn = span{ f1, f2, . . . , fk}. Furthermore, suppose that there exists a vector x ∈ Rn for which x · fj = 0 for all j, 1 ≤ j ≤ k. Write x = t1 f1 + t2 f2 + · · · + tk fk for some t1, t2, . . . , tk ∈ R (this is possible because f1, f2, . . . , fk span Rn). Then || x||2 =

x ·

x =

x · (t1

f1 + t2 f2 + · · · + tk fk) =

x · (t1

f1) + x · (t2 f2) + · · · + x · (tk fk) = t1( x · f1) + t2( x · f2) + · · · + tk( x · fk) = t1(0) + t2(0) + · · · + tk(0) = 0. Since || x||2 = 0, || x|| = 0. By the previous theorem, || x|| = 0 if and only if

x =
0n. Therefore,

x = 0n.

SLIDE 22

Theorem (Cauchy Inequality)

If x, y ∈ Rn, then | x · y| ≤ || x|| || y|| with equality if and only if { x, y} is linearly dependent.

SLIDE 23

Theorem (Cauchy Inequality)

If x, y ∈ Rn, then | x · y| ≤ || x|| || y|| with equality if and only if { x, y} is linearly dependent.

Proof.

Let x, y ∈ Rn and t ∈ R. Then

0 ≤ ||t x + y||2 = (t x + y) · (t x + y) = t2 x · x + 2t x · y + y · y = t2|| x||2 + 2t( x · y) + || y||2. The quadratic t2|| x||2 + 2t( x · y) + || y||2 in t is always nonnegative, so it does not have distinct real roots. Thus, if we use the quadratic formula to solve for t, the discriminant must be nonpositive, i.e., (2 x · y)2 − 4|| x||2|| y||2 ≤ 0. Therefore, (2 x · y)2 ≤ 4|| x||2|| y||2 Since both sides of the inequality are nonnegative, we can take (positive) square roots of both sides: |2 x · y| ≤ 2|| x|| || y|| Therefore, | x · y| ≤ || x|| || y||. What remains is to show that | x · y| = || x|| || y|| if and

nly if {

x, y} is linearly dependent.

SLIDE 24

Proof (continued).

First suppose that { x, y} is dependent. Then by symmetry (of x and y),

x = k

y for some k ∈ R. Hence

| x · y| = |(k y) · y| = |k| | y · y| = |k| || y||2, and || x|| || y|| = ||k y|| || y|| = |k| || y||2,

so | x · y| = || x|| || y||. Conversely, suppose { x, y} is independent; then t x + y = 0n for all t ∈ R, so ||t x + y||2 > 0 for all t ∈ R. Thus the quadratic t2|| x||2 + 2t( x · y) + || y||2 > 0 so has no real roots. It follows that the the discriminant is negative, i.e., (2 x · y)2 − 4|| x||2|| y||2 < 0. Therefore, (2 x · y)2 < 4|| x||2|| y||2; taking square roots of both sides (they are both nonnegative) and dividing by two gives us | x · y| < || x|| || y||, showing that equality is impossible.

SLIDE 25

Corollary (Triangle Inequality I )

If x, y ∈ Rn, then || x + y|| ≤ || x|| + || y||. by the Cauchy Inequality Since both sides of the inequality are nonnegative, we take (positive) square roots of both sides:

SLIDE 26

Corollary (Triangle Inequality I )

If x, y ∈ Rn, then || x + y|| ≤ || x|| + || y||.

Proof.

|| x + y||2 = ( x + y) · ( x + y) by the Cauchy Inequality Since both sides of the inequality are nonnegative, we take (positive) square roots of both sides:

SLIDE 27

Corollary (Triangle Inequality I )

If x, y ∈ Rn, then || x + y|| ≤ || x|| + || y||.

Proof.

|| x + y||2 = ( x + y) · ( x + y) =

x ·

x + 2 x · y + y · y by the Cauchy Inequality Since both sides of the inequality are nonnegative, we take (positive) square roots of both sides:

SLIDE 28

Corollary (Triangle Inequality I )

If x, y ∈ Rn, then || x + y|| ≤ || x|| + || y||.

Proof.

|| x + y||2 = ( x + y) · ( x + y) =

x ·

x + 2 x · y + y · y = || x||2 + 2 x · y + || y||2 by the Cauchy Inequality Since both sides of the inequality are nonnegative, we take (positive) square roots of both sides:

SLIDE 29

Corollary (Triangle Inequality I )

If x, y ∈ Rn, then || x + y|| ≤ || x|| + || y||.

Proof.

|| x + y||2 = ( x + y) · ( x + y) =

x ·

x + 2 x · y + y · y = || x||2 + 2 x · y + || y||2 ≤ || x||2 + 2|| x|| || y|| + || y||2 by the Cauchy Inequality Since both sides of the inequality are nonnegative, we take (positive) square roots of both sides:

SLIDE 30

Corollary (Triangle Inequality I )

If x, y ∈ Rn, then || x + y|| ≤ || x|| + || y||.

Proof.

|| x + y||2 = ( x + y) · ( x + y) =

x ·

x + 2 x · y + y · y = || x||2 + 2 x · y + || y||2 ≤ || x||2 + 2|| x|| || y|| + || y||2 by the Cauchy Inequality = (|| x|| + || y||)2. Since both sides of the inequality are nonnegative, we take (positive) square roots of both sides:

SLIDE 31

Corollary (Triangle Inequality I )

If x, y ∈ Rn, then || x + y|| ≤ || x|| + || y||.

Proof.

|| x + y||2 = ( x + y) · ( x + y) =

x ·

x + 2 x · y + y · y = || x||2 + 2 x · y + || y||2 ≤ || x||2 + 2|| x|| || y|| + || y||2 by the Cauchy Inequality = (|| x|| + || y||)2. Since both sides of the inequality are nonnegative, we take (positive) square roots of both sides: || x + y|| ≤ || x|| + || y||.

SLIDE 32

Definition

If x, y ∈ Rn, then the distance between x and y is defined as d( x, y) = || x − y||. by Triangle Inequality I

SLIDE 33

Definition

If x, y ∈ Rn, then the distance between x and y is defined as d( x, y) = || x − y||.

Theorem (Properties of the distance function)

Let x, y, z ∈ Rn. Then by Triangle Inequality I

SLIDE 34

Definition

If x, y ∈ Rn, then the distance between x and y is defined as d( x, y) = || x − y||.

Theorem (Properties of the distance function)

Let x, y, z ∈ Rn. Then

1. d(

x, y) ≥ 0. by Triangle Inequality I

SLIDE 35

Definition

If x, y ∈ Rn, then the distance between x and y is defined as d( x, y) = || x − y||.

Theorem (Properties of the distance function)

Let x, y, z ∈ Rn. Then

1. d(

x, y) ≥ 0.

2. d(

x, y) = 0 if and only if x = y. by Triangle Inequality I

SLIDE 36

Definition

If x, y ∈ Rn, then the distance between x and y is defined as d( x, y) = || x − y||.

Theorem (Properties of the distance function)

Let x, y, z ∈ Rn. Then

1. d(

x, y) ≥ 0.

2. d(

x, y) = 0 if and only if x = y.

3. d(

x, y) = d( y, x). by Triangle Inequality I

SLIDE 37

Definition

If x, y ∈ Rn, then the distance between x and y is defined as d( x, y) = || x − y||.

Theorem (Properties of the distance function)

Let x, y, z ∈ Rn. Then

1. d(

x, y) ≥ 0.

2. d(

x, y) = 0 if and only if x = y.

3. d(

x, y) = d( y, x).

4. d(

x, z) ≤ d(

x,

y) + d( y, z) (Triangle Inequality II). by Triangle Inequality I

SLIDE 38

Definition

If x, y ∈ Rn, then the distance between x and y is defined as d( x, y) = || x − y||.

Theorem (Properties of the distance function)

Let x, y, z ∈ Rn. Then

1. d(

x, y) ≥ 0.

2. d(

x, y) = 0 if and only if x = y.

3. d(

x, y) = d( y, x).

4. d(

x, z) ≤ d(

x,

y) + d( y, z) (Triangle Inequality II).

Proof of the Triangle Inequality II.

d( x, z) = || x − z|| = ||( x − y) + ( y − z)|| by Triangle Inequality I

SLIDE 39

Definition

If x, y ∈ Rn, then the distance between x and y is defined as d( x, y) = || x − y||.

Theorem (Properties of the distance function)

Let x, y, z ∈ Rn. Then

1. d(

x, y) ≥ 0.

2. d(

x, y) = 0 if and only if x = y.

3. d(

x, y) = d( y, x).

4. d(

x, z) ≤ d(

x,

y) + d( y, z) (Triangle Inequality II).

Proof of the Triangle Inequality II.

d( x, z) = || x − z|| = ||( x − y) + ( y − z)|| ≤ || x − y|| + || y − z|| by Triangle Inequality I

SLIDE 40

Definition

If x, y ∈ Rn, then the distance between x and y is defined as d( x, y) = || x − y||.

Theorem (Properties of the distance function)

Let x, y, z ∈ Rn. Then

1. d(

x, y) ≥ 0.

2. d(

x, y) = 0 if and only if x = y.

3. d(

x, y) = d( y, x).

4. d(

x, z) ≤ d(

x,

y) + d( y, z) (Triangle Inequality II).

Proof of the Triangle Inequality II.

d( x, z) = || x − z|| = ||( x − y) + ( y − z)|| ≤ || x − y|| + || y − z|| by Triangle Inequality I = d( x, y) + d( y, z).

SLIDE 41

Orthogonality

Definitions

◮ Let x, y ∈ Rn. We say the x and y are orthogonal if x · y = 0. Let and suppose that and are orthogonal. Then it is not necessarily the case that is an orthogonal set.

SLIDE 42

Orthogonality

Definitions

◮ Let x, y ∈ Rn. We say the x and y are orthogonal if x · y = 0. ◮ More generally, X = { x1, x2, . . . , xk} ⊆ Rn is an orthogonal set if each

xi is nonzero, and every pair of distinct vectors of X is orthogonal, i.e.,
xi ·

xj = 0 for all i = j, 1 ≤ i, j ≤ k. Let and suppose that and are orthogonal. Then it is not necessarily the case that is an orthogonal set.

SLIDE 43

Orthogonality

Definitions

◮ Let x, y ∈ Rn. We say the x and y are orthogonal if x · y = 0. ◮ More generally, X = { x1, x2, . . . , xk} ⊆ Rn is an orthogonal set if each

xi is nonzero, and every pair of distinct vectors of X is orthogonal, i.e.,
xi ·

xj = 0 for all i = j, 1 ≤ i, j ≤ k. ◮ A set X = { x1, x2, . . . , xk} ⊆ Rn is an orthonormal set if X is an

rthogonal set of unit vectors, i.e., ||

xi|| = 1 for all i, 1 ≤ i ≤ k. Let and suppose that and are orthogonal. Then it is not necessarily the case that is an orthogonal set.

SLIDE 44

Orthogonality

Definitions

◮ Let x, y ∈ Rn. We say the x and y are orthogonal if x · y = 0. ◮ More generally, X = { x1, x2, . . . , xk} ⊆ Rn is an orthogonal set if each

xi is nonzero, and every pair of distinct vectors of X is orthogonal, i.e.,
xi ·

xj = 0 for all i = j, 1 ≤ i, j ≤ k. ◮ A set X = { x1, x2, . . . , xk} ⊆ Rn is an orthonormal set if X is an

rthogonal set of unit vectors, i.e., ||

xi|| = 1 for all i, 1 ≤ i ≤ k.

Subtlety in the Definitions!

Let x, y ∈ Rn and suppose that x and y are orthogonal. Then it is not necessarily the case that { x, y} is an orthogonal set.

SLIDE 45

Orthogonality

Definitions

◮ Let x, y ∈ Rn. We say the x and y are orthogonal if x · y = 0. ◮ More generally, X = { x1, x2, . . . , xk} ⊆ Rn is an orthogonal set if each

xi is nonzero, and every pair of distinct vectors of X is orthogonal, i.e.,
xi ·

xj = 0 for all i = j, 1 ≤ i, j ≤ k. ◮ A set X = { x1, x2, . . . , xk} ⊆ Rn is an orthonormal set if X is an

rthogonal set of unit vectors, i.e., ||

xi|| = 1 for all i, 1 ≤ i ≤ k.

Subtlety in the Definitions!

Let x, y ∈ Rn and suppose that x and y are orthogonal. Then it is not necessarily the case that { x, y} is an orthogonal set. Why?

SLIDE 46

Examples

SLIDE 47

Examples

1. The standard basis of Rn is an orthonormal set (and hence an
rthogonal set).

SLIDE 48

Examples

1. The standard basis of Rn is an orthonormal set (and hence an
rthogonal set).

2.            1 1 1 1     ,     1 1 −1 −1     ,     1 −1 1 −1            is an orthogonal (but not orthonormal) subset of R4.

SLIDE 49

Examples

1. The standard basis of Rn is an orthonormal set (and hence an
rthogonal set).

2.            1 1 1 1     ,     1 1 −1 −1     ,     1 −1 1 −1            is an orthogonal (but not orthonormal) subset of R4.

3. If {

x1, x2, . . . , xk} is an orthogonal subset of Rn and p = 0, then {p x1, p x2, . . . , p xk} is an orthogonal subset of Rn.

SLIDE 50

Examples

1. The standard basis of Rn is an orthonormal set (and hence an
rthogonal set).

2.            1 1 1 1     ,     1 1 −1 −1     ,     1 −1 1 −1            is an orthogonal (but not orthonormal) subset of R4.

3. If {

x1, x2, . . . , xk} is an orthogonal subset of Rn and p = 0, then {p x1, p x2, . . . , p xk} is an orthogonal subset of Rn. 4.        1 2     1 1 1 1     , 1 2     1 1 −1 −1     , 1 2     1 −1 1 −1            is an orthonormal subset of R4.

SLIDE 51

Definition

Normalizing an orthogonal set is the process of turning an orthogonal (but not orthonormal) set into an orthonormal set. If { x1, x2, . . . , xk} is an

rthogonal subset of Rn, then
1

|| x1|| x1, 1 || x2|| x2, . . . , 1 || xk|| xk

is an orthonormal set.

SLIDE 52

Definition

Normalizing an orthogonal set is the process of turning an orthogonal (but not orthonormal) set into an orthonormal set. If { x1, x2, . . . , xk} is an

rthogonal subset of Rn, then
1

|| x1|| x1, 1 || x2|| x2, . . . , 1 || xk|| xk

is an orthonormal set.

Problem

Verify that      1 −1 2   ,   2 1   ,   5 1 −2      is an orthogonal set, and normalize this set.

SLIDE 53

Solution

  1 −1 2   ·   2 1   = 0 − 2 + 2 = 0,   2 1   ·   5 1 −2   = 0 + 2 − 2 = 0,   1 −1 2   ·   5 1 −2   = 5 − 1 − 4 = 0, proving that the set is orthogonal. Normalizing gives us the orthonormal set    1 √ 6   1 −1 2   , 1 √ 5   2 1   , 1 √ 30   5 1 −2      .

SLIDE 54

Theorem (Pythagoras’ Theorem)

If { x1, x2, . . . , xk} ⊆ Rn is orthogonal, then || x1 + x2 + · · · + xk||2 = || x1||2 + || x2||2 + · · · + || xk||2. Start with

. . . . . . . . . The second last equality follows from the fact that the set is orthogonal, so for all and , and , . Thus, the only nonzero terms are the ones of the form , .

SLIDE 55

Theorem (Pythagoras’ Theorem)

If { x1, x2, . . . , xk} ⊆ Rn is orthogonal, then || x1 + x2 + · · · + xk||2 = || x1||2 + || x2||2 + · · · + || xk||2.

Proof.

Start with

|| x1 + x2 + · · · + xk||2 = ( x1 + x2 + · · · + xk) · ( x1 + x2 + · · · + xk) . . . . . . . . . The second last equality follows from the fact that the set is orthogonal, so for all and , and , . Thus, the only nonzero terms are the ones of the form , .

SLIDE 56

Theorem (Pythagoras’ Theorem)

If { x1, x2, . . . , xk} ⊆ Rn is orthogonal, then || x1 + x2 + · · · + xk||2 = || x1||2 + || x2||2 + · · · + || xk||2.

Proof.

Start with

|| x1 + x2 + · · · + xk||2 = ( x1 + x2 + · · · + xk) · ( x1 + x2 + · · · + xk) = ( x1 · x1 + x1 · x2 + · · · + x1 · xk) +( x2 · x1 + x2 · x2 + · · · + x2 · xk) . . . . . . . . . +( xk · x1 + xk · x2 + · · · + xk · xk) =

x1 ·

x1 + x2 · x2 + · · · + xk · xk The second last equality follows from the fact that the set is orthogonal, so for all and , and , . Thus, the only nonzero terms are the ones of the form , .

SLIDE 57

Theorem (Pythagoras’ Theorem)

If { x1, x2, . . . , xk} ⊆ Rn is orthogonal, then || x1 + x2 + · · · + xk||2 = || x1||2 + || x2||2 + · · · + || xk||2.

Proof.

Start with

|| x1 + x2 + · · · + xk||2 = ( x1 + x2 + · · · + xk) · ( x1 + x2 + · · · + xk) = ( x1 · x1 + x1 · x2 + · · · + x1 · xk) +( x2 · x1 + x2 · x2 + · · · + x2 · xk) . . . . . . . . . +( xk · x1 + xk · x2 + · · · + xk · xk) =

x1 ·

x1 + x2 · x2 + · · · + xk · xk = || x1||2 + || x2||2 + · · · + || xk||2. The second last equality follows from the fact that the set is orthogonal, so for all and , and , . Thus, the only nonzero terms are the ones of the form , .

SLIDE 58

Theorem (Pythagoras’ Theorem)

If { x1, x2, . . . , xk} ⊆ Rn is orthogonal, then || x1 + x2 + · · · + xk||2 = || x1||2 + || x2||2 + · · · + || xk||2.

Proof.

Start with

|| x1 + x2 + · · · + xk||2 = ( x1 + x2 + · · · + xk) · ( x1 + x2 + · · · + xk) = ( x1 · x1 + x1 · x2 + · · · + x1 · xk) +( x2 · x1 + x2 · x2 + · · · + x2 · xk) . . . . . . . . . +( xk · x1 + xk · x2 + · · · + xk · xk) =

x1 ·

x1 + x2 · x2 + · · · + xk · xk = || x1||2 + || x2||2 + · · · + || xk||2. The second last equality follows from the fact that the set is orthogonal, so for all i and j, i = j and 1 ≤ i, j ≤ k, xi · xj = 0. Thus, the only nonzero terms are the ones of the form xi · xi, 1 ≤ i ≤ k.

SLIDE 59

Theorem

If S = { x1, x2, . . . , xk} ⊆ Rn is an orthogonal set, then S is independent. Let for some , and suppose . Then for all , , since for all , where . Since and , it follows that for all , . Therefore, is linearly independent.

SLIDE 60

Theorem

If S = { x1, x2, . . . , xk} ⊆ Rn is an orthogonal set, then S is independent.

Proof.

Let x = t1 x1 + t2 x2 + · · · + tk xk for some t1, t2, . . . , tk ∈ R, and suppose

x =

0n. Then for all , , since for all , where . Since and , it follows that for all , . Therefore, is linearly independent.

SLIDE 61

Theorem

If S = { x1, x2, . . . , xk} ⊆ Rn is an orthogonal set, then S is independent.

Proof.

Let x = t1 x1 + t2 x2 + · · · + tk xk for some t1, t2, . . . , tk ∈ R, and suppose

x =
0n. Then for all i, 1 ≤ i ≤ k,

0 = x · xi = (t1 x1 + t2 x2 + · · · + tk xk) · xi = ti xi · xi = ti|| xi||2, since tj xj · xi = 0 for all j, 1 ≤ j ≤ k where j = i. Since and , it follows that for all , . Therefore, is linearly independent.

SLIDE 62

Theorem

If S = { x1, x2, . . . , xk} ⊆ Rn is an orthogonal set, then S is independent.

Proof.

Let x = t1 x1 + t2 x2 + · · · + tk xk for some t1, t2, . . . , tk ∈ R, and suppose

x =
0n. Then for all i, 1 ≤ i ≤ k,

0 = x · xi = (t1 x1 + t2 x2 + · · · + tk xk) · xi = ti xi · xi = ti|| xi||2, since tj xj · xi = 0 for all j, 1 ≤ j ≤ k where j = i. Since xi = 0n and ti|| xi||2 = 0, it follows that ti = 0 for all i, 1 ≤ i ≤ k. Therefore, S is linearly independent.

SLIDE 63

Example

Given an arbitrary vector

x =

     a1 a2 . . . an      ∈ Rn, it is trivial to express x as a linear combination of the standard basis vectors of Rn, { e1, e2, . . . , en}:

x = a1

e1 + a2 e2 + · · · + an en.

SLIDE 64

Example

Given an arbitrary vector

x =

     a1 a2 . . . an      ∈ Rn, it is trivial to express x as a linear combination of the standard basis vectors of Rn, { e1, e2, . . . , en}:

x = a1

e1 + a2 e2 + · · · + an en.

Problem

Given any orthogonal basis B of Rn (so not necessarily the standard basis), and an arbitrary vector x ∈ Rn, how do we express x as a linear combination of the vectors in B?

SLIDE 65

Theorem (Fourier Expansion)

Let { f1, f2, . . . , fm} be an orthogonal basis of a subspace U of Rn. Then for any x ∈ U,

x =
x ·

f1 || f1||2

f1 +
x ·

f2 || f2||2

f2 + · · · +
x ·

fm || fm||2

fm.

This expression is called the Fourier expansion of x, and

x ·

fj || fj||2 , j = 1, 2, . . . , m are the Fourier coefficients.

SLIDE 66

Example

Let f1 =   1 −1 2   , f2 =   2 1  , and f3 =   5 1 −2  , and let x =   1 1 1  . We saw in a Problem 12 that B = { f1, f2, f3} is an orthogonal subset of R3.

SLIDE 67

Example

Let f1 =   1 −1 2   , f2 =   2 1  , and f3 =   5 1 −2  , and let x =   1 1 1  . We saw in a Problem 12 that B = { f1, f2, f3} is an orthogonal subset of R3. It follows that B is an orthogonal basis of R3. (Why?)

SLIDE 68

Example

Let f1 =   1 −1 2   , f2 =   2 1  , and f3 =   5 1 −2  , and let x =   1 1 1  . We saw in a Problem 12 that B = { f1, f2, f3} is an orthogonal subset of R3. It follows that B is an orthogonal basis of R3. (Why?) To express x as a linear combination of the vectors of B, apply the Fourier Expansion Theorem. Assume x = t1 f1 + t2 f2 + t3 f3.

SLIDE 69

Example

Let f1 =   1 −1 2   , f2 =   2 1  , and f3 =   5 1 −2  , and let x =   1 1 1  . We saw in a Problem 12 that B = { f1, f2, f3} is an orthogonal subset of R3. It follows that B is an orthogonal basis of R3. (Why?) To express x as a linear combination of the vectors of B, apply the Fourier Expansion Theorem. Assume x = t1 f1 + t2 f2 + t3

f3. Then

t1 = x · f1 || f1||2 = 2 6, t2 = x · f2 || f2||2 = 3 5, and t3 = x · f3 || f3||2 = 4 30.

SLIDE 70

Example

Let f1 =   1 −1 2   , f2 =   2 1  , and f3 =   5 1 −2  , and let x =   1 1 1  . We saw in a Problem 12 that B = { f1, f2, f3} is an orthogonal subset of R3. It follows that B is an orthogonal basis of R3. (Why?) To express x as a linear combination of the vectors of B, apply the Fourier Expansion Theorem. Assume x = t1 f1 + t2 f2 + t3

f3. Then

t1 = x · f1 || f1||2 = 2 6, t2 = x · f2 || f2||2 = 3 5, and t3 = x · f3 || f3||2 = 4 30. Therefore,   1 1 1   = 1 3   1 −1 2   + 3 5   2 1   + 2 15   5 1 −2   .

SLIDE 71

Proof (Fourier Expansion).

Let x ∈ U. Since { f1, f2, . . . , fm} is a basis of U, x = t1 f1 + t2 f2 + · · · + tm fm for some t1, t2, . . . , tm ∈ R. Notice that for any , , since is orthogonal Since is nonzero, we obtain The result now follows. If is an orthonormal basis, then the Fourier coeffjcients are simply , .

SLIDE 72

Proof (Fourier Expansion).

Let x ∈ U. Since { f1, f2, . . . , fm} is a basis of U, x = t1 f1 + t2 f2 + · · · + tm fm for some t1, t2, . . . , tm ∈ R. Notice that for any i, 1 ≤ i ≤ m,

x ·

fi = (t1 f1 + t2 f2 + · · · + tm fm) · fi since is orthogonal Since is nonzero, we obtain The result now follows. If is an orthonormal basis, then the Fourier coeffjcients are simply , .

SLIDE 73

Proof (Fourier Expansion).

Let x ∈ U. Since { f1, f2, . . . , fm} is a basis of U, x = t1 f1 + t2 f2 + · · · + tm fm for some t1, t2, . . . , tm ∈ R. Notice that for any i, 1 ≤ i ≤ m,

x ·

fi = (t1 f1 + t2 f2 + · · · + tm fm) · fi = ti fi · fi since { f1, f2, . . . , fm} is orthogonal = ti|| fi||2. Since is nonzero, we obtain The result now follows. If is an orthonormal basis, then the Fourier coeffjcients are simply , .

SLIDE 74

Proof (Fourier Expansion).

Let x ∈ U. Since { f1, f2, . . . , fm} is a basis of U, x = t1 f1 + t2 f2 + · · · + tm fm for some t1, t2, . . . , tm ∈ R. Notice that for any i, 1 ≤ i ≤ m,

x ·

fi = (t1 f1 + t2 f2 + · · · + tm fm) · fi = ti fi · fi since { f1, f2, . . . , fm} is orthogonal = ti|| fi||2. Since fi is nonzero, we obtain ti = x · fi || fi||2 . The result now follows.

If

is an orthonormal basis, then the Fourier coeffjcients are simply , .

SLIDE 75

Proof (Fourier Expansion).

Let x ∈ U. Since { f1, f2, . . . , fm} is a basis of U, x = t1 f1 + t2 f2 + · · · + tm fm for some t1, t2, . . . , tm ∈ R. Notice that for any i, 1 ≤ i ≤ m,

x ·

fi = (t1 f1 + t2 f2 + · · · + tm fm) · fi = ti fi · fi since { f1, f2, . . . , fm} is orthogonal = ti|| fi||2. Since fi is nonzero, we obtain ti = x · fi || fi||2 . The result now follows.

If {

f1, f2, . . . , fm} is an orthonormal basis, then the Fourier coeffjcients are simply tj = x · fj, j = 1, 2, . . . , m.

SLIDE 76

Problem

Let

f1 =

   1 1    , f2 =    1 −1    , f3 =    1 1    , and

f4 =

   1 −1    . Show that B = { f1, f2, f3, f4} is an orthogonal basis of R4, and express

x =

a b c d T as a linear combination of f1, f2, f3 and f4.

Computing for gives us so is an orthogonal set. It follows that is independent, and since dim , also spans . Therefore, is an orthogonal basis of . By the Fourier Expansion Theorem,

SLIDE 77

Problem

Let

f1 =

   1 1    , f2 =    1 −1    , f3 =    1 1    , and

f4 =

   1 −1    . Show that B = { f1, f2, f3, f4} is an orthogonal basis of R4, and express

x =

a b c d T as a linear combination of f1, f2, f3 and f4.

Solution

Computing fi · fj for 1 ≤ i < j ≤ 4 gives us

f1 ·

f2 = 0,

f1 ·

f3 = 0,

f1 ·

f4 = 0,

f2 ·

f3 = 0,

f2 ·

f4 = 0,

f3 ·

f4 = 0, so B is an orthogonal set. It follows that is independent, and since dim , also spans . Therefore, is an orthogonal basis of . By the Fourier Expansion Theorem,

SLIDE 78

Problem

Let

f1 =

   1 1    , f2 =    1 −1    , f3 =    1 1    , and

f4 =

   1 −1    . Show that B = { f1, f2, f3, f4} is an orthogonal basis of R4, and express

x =

a b c d T as a linear combination of f1, f2, f3 and f4.

Solution

Computing fi · fj for 1 ≤ i < j ≤ 4 gives us

f1 ·

f2 = 0,

f1 ·

f3 = 0,

f1 ·

f4 = 0,

f2 ·

f3 = 0,

f2 ·

f4 = 0,

f3 ·

f4 = 0, so B is an orthogonal set. It follows that B is independent, and since |B| = 4 = dim(R4), B also spans R4. Therefore, B is an orthogonal basis of R4. By the Fourier Expansion Theorem,

SLIDE 79

Problem

Let

f1 =

   1 1    , f2 =    1 −1    , f3 =    1 1    , and

f4 =

   1 −1    . Show that B = { f1, f2, f3, f4} is an orthogonal basis of R4, and express

x =

a b c d T as a linear combination of f1, f2, f3 and f4.

Solution

Computing fi · fj for 1 ≤ i < j ≤ 4 gives us

f1 ·

f2 = 0,

f1 ·

f3 = 0,

f1 ·

f4 = 0,

f2 ·

f3 = 0,

f2 ·

f4 = 0,

f3 ·

f4 = 0, so B is an orthogonal set. It follows that B is independent, and since |B| = 4 = dim(R4), B also spans R4. Therefore, B is an orthogonal basis of

R4. By the Fourier Expansion Theorem,
x =

a + b 2

f1 +

a − b 2

f2 +

c + d 2

f3 +

c − d 2

f4.