Math 221: LINEAR ALGEBRA 5-3. Vector Space R n - Orthogonality Le - - PowerPoint PPT Presentation

math 221 linear algebra
SMART_READER_LITE
LIVE PREVIEW

Math 221: LINEAR ALGEBRA 5-3. Vector Space R n - Orthogonality Le - - PowerPoint PPT Presentation

Math 221: LINEAR ALGEBRA 5-3. Vector Space R n - Orthogonality Le Chen 1 Emory University, 2020 Fall (last updated on 08/18/2020) Creative Commons License (CC BY-NC-SA) 1 Slides are adapted from those by Karen Seyffarth from University of


slide-1
SLIDE 1

Math 221: LINEAR ALGEBRA

§5-3. Vector Space Rn - Orthogonality

Le Chen1

Emory University, 2020 Fall

(last updated on 08/18/2020) Creative Commons License (CC BY-NC-SA) 1Slides are adapted from those by Karen Seyffarth from University of Calgary.

slide-2
SLIDE 2

Definitions

Let x =      x1 x2 . . . xn      and y =      y1 y2 . . . yn      be vectors in Rn.

slide-3
SLIDE 3

Definitions

Let x =      x1 x2 . . . xn      and y =      y1 y2 . . . yn      be vectors in Rn.

  • 1. The dot product of

x and y is

  • x ·

y = x1y1 + x2y2 + · · · xnyn = xT y. Note: x · y is a scalar, but we also treat it as a 1 × 1 matrix.

slide-4
SLIDE 4

Definitions

Let x =      x1 x2 . . . xn      and y =      y1 y2 . . . yn      be vectors in Rn.

  • 1. The dot product of

x and y is

  • x ·

y = x1y1 + x2y2 + · · · xnyn = xT y. Note: x · y is a scalar, but we also treat it as a 1 × 1 matrix.

  • 2. The length of

x, denoted || x|| is || x|| =

  • x2

1 + x2 2 · · · + x2 n =

  • x ·

x.

slide-5
SLIDE 5

Definitions

Let x =      x1 x2 . . . xn      and y =      y1 y2 . . . yn      be vectors in Rn.

  • 1. The dot product of

x and y is

  • x ·

y = x1y1 + x2y2 + · · · xnyn = xT y. Note: x · y is a scalar, but we also treat it as a 1 × 1 matrix.

  • 2. The length of

x, denoted || x|| is || x|| =

  • x2

1 + x2 2 · · · + x2 n =

  • x ·

x. 3. x is called a unit vector if || x|| = 1.

slide-6
SLIDE 6

Theorem (Properties of length and the dot product)

Let x, y, z ∈ Rn, and let a ∈ R. Then

slide-7
SLIDE 7

Theorem (Properties of length and the dot product)

Let x, y, z ∈ Rn, and let a ∈ R. Then 1. x · y = y · x (the dot product is commutative)

slide-8
SLIDE 8

Theorem (Properties of length and the dot product)

Let x, y, z ∈ Rn, and let a ∈ R. Then 1. x · y = y · x (the dot product is commutative) 2. x · ( y + z) = x · y + x · z (the dot product distributes over addition)

slide-9
SLIDE 9

Theorem (Properties of length and the dot product)

Let x, y, z ∈ Rn, and let a ∈ R. Then 1. x · y = y · x (the dot product is commutative) 2. x · ( y + z) = x · y + x · z (the dot product distributes over addition)

  • 3. (a

x) · y = a( x · y) = x · (a y)

slide-10
SLIDE 10

Theorem (Properties of length and the dot product)

Let x, y, z ∈ Rn, and let a ∈ R. Then 1. x · y = y · x (the dot product is commutative) 2. x · ( y + z) = x · y + x · z (the dot product distributes over addition)

  • 3. (a

x) · y = a( x · y) = x · (a y)

  • 4. ||

x||2 = x · x.

slide-11
SLIDE 11

Theorem (Properties of length and the dot product)

Let x, y, z ∈ Rn, and let a ∈ R. Then 1. x · y = y · x (the dot product is commutative) 2. x · ( y + z) = x · y + x · z (the dot product distributes over addition)

  • 3. (a

x) · y = a( x · y) = x · (a y)

  • 4. ||

x||2 = x · x.

  • 5. ||

x|| ≥ 0 with equality if and only if x = 0n.

slide-12
SLIDE 12

Theorem (Properties of length and the dot product)

Let x, y, z ∈ Rn, and let a ∈ R. Then 1. x · y = y · x (the dot product is commutative) 2. x · ( y + z) = x · y + x · z (the dot product distributes over addition)

  • 3. (a

x) · y = a( x · y) = x · (a y)

  • 4. ||

x||2 = x · x.

  • 5. ||

x|| ≥ 0 with equality if and only if x = 0n.

  • 6. ||a

x|| = |a| || x||.

slide-13
SLIDE 13

Example

Let x, y ∈ Rn. Then || x + y||2 = ( x + y) · ( x + y) =

  • x ·

x + x · y + y · x + y · y =

  • x ·

x + 2( x · y) + y · y = || x||2 + 2( x · y) + || y||2.

slide-14
SLIDE 14

Example

Let { f1, f2, . . . , fk} ∈ Rn and suppose Rn = span{ f1, f2, . . . , fk}. Furthermore, suppose that there exists a vector x ∈ Rn for which x · fj = 0 for all j, 1 ≤ j ≤ k.

slide-15
SLIDE 15

Example

Let { f1, f2, . . . , fk} ∈ Rn and suppose Rn = span{ f1, f2, . . . , fk}. Furthermore, suppose that there exists a vector x ∈ Rn for which x · fj = 0 for all j, 1 ≤ j ≤ k. Write x = t1 f1 + t2 f2 + · · · + tk fk for some t1, t2, . . . , tk ∈ R (this is possible because f1, f2, . . . , fk span Rn).

slide-16
SLIDE 16

Example

Let { f1, f2, . . . , fk} ∈ Rn and suppose Rn = span{ f1, f2, . . . , fk}. Furthermore, suppose that there exists a vector x ∈ Rn for which x · fj = 0 for all j, 1 ≤ j ≤ k. Write x = t1 f1 + t2 f2 + · · · + tk fk for some t1, t2, . . . , tk ∈ R (this is possible because f1, f2, . . . , fk span Rn). Then || x||2 =

  • x ·

x

slide-17
SLIDE 17

Example

Let { f1, f2, . . . , fk} ∈ Rn and suppose Rn = span{ f1, f2, . . . , fk}. Furthermore, suppose that there exists a vector x ∈ Rn for which x · fj = 0 for all j, 1 ≤ j ≤ k. Write x = t1 f1 + t2 f2 + · · · + tk fk for some t1, t2, . . . , tk ∈ R (this is possible because f1, f2, . . . , fk span Rn). Then || x||2 =

  • x ·

x =

  • x · (t1

f1 + t2 f2 + · · · + tk fk)

slide-18
SLIDE 18

Example

Let { f1, f2, . . . , fk} ∈ Rn and suppose Rn = span{ f1, f2, . . . , fk}. Furthermore, suppose that there exists a vector x ∈ Rn for which x · fj = 0 for all j, 1 ≤ j ≤ k. Write x = t1 f1 + t2 f2 + · · · + tk fk for some t1, t2, . . . , tk ∈ R (this is possible because f1, f2, . . . , fk span Rn). Then || x||2 =

  • x ·

x =

  • x · (t1

f1 + t2 f2 + · · · + tk fk) =

  • x · (t1

f1) + x · (t2 f2) + · · · + x · (tk fk)

slide-19
SLIDE 19

Example

Let { f1, f2, . . . , fk} ∈ Rn and suppose Rn = span{ f1, f2, . . . , fk}. Furthermore, suppose that there exists a vector x ∈ Rn for which x · fj = 0 for all j, 1 ≤ j ≤ k. Write x = t1 f1 + t2 f2 + · · · + tk fk for some t1, t2, . . . , tk ∈ R (this is possible because f1, f2, . . . , fk span Rn). Then || x||2 =

  • x ·

x =

  • x · (t1

f1 + t2 f2 + · · · + tk fk) =

  • x · (t1

f1) + x · (t2 f2) + · · · + x · (tk fk) = t1( x · f1) + t2( x · f2) + · · · + tk( x · fk)

slide-20
SLIDE 20

Example

Let { f1, f2, . . . , fk} ∈ Rn and suppose Rn = span{ f1, f2, . . . , fk}. Furthermore, suppose that there exists a vector x ∈ Rn for which x · fj = 0 for all j, 1 ≤ j ≤ k. Write x = t1 f1 + t2 f2 + · · · + tk fk for some t1, t2, . . . , tk ∈ R (this is possible because f1, f2, . . . , fk span Rn). Then || x||2 =

  • x ·

x =

  • x · (t1

f1 + t2 f2 + · · · + tk fk) =

  • x · (t1

f1) + x · (t2 f2) + · · · + x · (tk fk) = t1( x · f1) + t2( x · f2) + · · · + tk( x · fk) = t1(0) + t2(0) + · · · + tk(0) = 0. Since || x||2 = 0, || x|| = 0. By the previous theorem, || x|| = 0 if and only if

  • x =
  • 0n. Therefore,

x = 0n.

slide-21
SLIDE 21

Example

Let { f1, f2, . . . , fk} ∈ Rn and suppose Rn = span{ f1, f2, . . . , fk}. Furthermore, suppose that there exists a vector x ∈ Rn for which x · fj = 0 for all j, 1 ≤ j ≤ k. Write x = t1 f1 + t2 f2 + · · · + tk fk for some t1, t2, . . . , tk ∈ R (this is possible because f1, f2, . . . , fk span Rn). Then || x||2 =

  • x ·

x =

  • x · (t1

f1 + t2 f2 + · · · + tk fk) =

  • x · (t1

f1) + x · (t2 f2) + · · · + x · (tk fk) = t1( x · f1) + t2( x · f2) + · · · + tk( x · fk) = t1(0) + t2(0) + · · · + tk(0) = 0. Since || x||2 = 0, || x|| = 0. By the previous theorem, || x|| = 0 if and only if

  • x =
  • 0n. Therefore,

x = 0n.

slide-22
SLIDE 22

Theorem (Cauchy Inequality)

If x, y ∈ Rn, then | x · y| ≤ || x|| || y|| with equality if and only if { x, y} is linearly dependent.

slide-23
SLIDE 23

Theorem (Cauchy Inequality)

If x, y ∈ Rn, then | x · y| ≤ || x|| || y|| with equality if and only if { x, y} is linearly dependent.

Proof.

Let x, y ∈ Rn and t ∈ R. Then

0 ≤ ||t x + y||2 = (t x + y) · (t x + y) = t2 x · x + 2t x · y + y · y = t2|| x||2 + 2t( x · y) + || y||2. The quadratic t2|| x||2 + 2t( x · y) + || y||2 in t is always nonnegative, so it does not have distinct real roots. Thus, if we use the quadratic formula to solve for t, the discriminant must be nonpositive, i.e., (2 x · y)2 − 4|| x||2|| y||2 ≤ 0. Therefore, (2 x · y)2 ≤ 4|| x||2|| y||2 Since both sides of the inequality are nonnegative, we can take (positive) square roots of both sides: |2 x · y| ≤ 2|| x|| || y|| Therefore, | x · y| ≤ || x|| || y||. What remains is to show that | x · y| = || x|| || y|| if and

  • nly if {

x, y} is linearly dependent.

slide-24
SLIDE 24

Proof (continued).

First suppose that { x, y} is dependent. Then by symmetry (of x and y),

  • x = k

y for some k ∈ R. Hence

| x · y| = |(k y) · y| = |k| | y · y| = |k| || y||2, and || x|| || y|| = ||k y|| || y|| = |k| || y||2,

so | x · y| = || x|| || y||. Conversely, suppose { x, y} is independent; then t x + y = 0n for all t ∈ R, so ||t x + y||2 > 0 for all t ∈ R. Thus the quadratic t2|| x||2 + 2t( x · y) + || y||2 > 0 so has no real roots. It follows that the the discriminant is negative, i.e., (2 x · y)2 − 4|| x||2|| y||2 < 0. Therefore, (2 x · y)2 < 4|| x||2|| y||2; taking square roots of both sides (they are both nonnegative) and dividing by two gives us | x · y| < || x|| || y||, showing that equality is impossible.

slide-25
SLIDE 25

Corollary (Triangle Inequality I )

If x, y ∈ Rn, then || x + y|| ≤ || x|| + || y||. by the Cauchy Inequality Since both sides of the inequality are nonnegative, we take (positive) square roots of both sides:

slide-26
SLIDE 26

Corollary (Triangle Inequality I )

If x, y ∈ Rn, then || x + y|| ≤ || x|| + || y||.

Proof.

|| x + y||2 = ( x + y) · ( x + y) by the Cauchy Inequality Since both sides of the inequality are nonnegative, we take (positive) square roots of both sides:

slide-27
SLIDE 27

Corollary (Triangle Inequality I )

If x, y ∈ Rn, then || x + y|| ≤ || x|| + || y||.

Proof.

|| x + y||2 = ( x + y) · ( x + y) =

  • x ·

x + 2 x · y + y · y by the Cauchy Inequality Since both sides of the inequality are nonnegative, we take (positive) square roots of both sides:

slide-28
SLIDE 28

Corollary (Triangle Inequality I )

If x, y ∈ Rn, then || x + y|| ≤ || x|| + || y||.

Proof.

|| x + y||2 = ( x + y) · ( x + y) =

  • x ·

x + 2 x · y + y · y = || x||2 + 2 x · y + || y||2 by the Cauchy Inequality Since both sides of the inequality are nonnegative, we take (positive) square roots of both sides:

slide-29
SLIDE 29

Corollary (Triangle Inequality I )

If x, y ∈ Rn, then || x + y|| ≤ || x|| + || y||.

Proof.

|| x + y||2 = ( x + y) · ( x + y) =

  • x ·

x + 2 x · y + y · y = || x||2 + 2 x · y + || y||2 ≤ || x||2 + 2|| x|| || y|| + || y||2 by the Cauchy Inequality Since both sides of the inequality are nonnegative, we take (positive) square roots of both sides:

slide-30
SLIDE 30

Corollary (Triangle Inequality I )

If x, y ∈ Rn, then || x + y|| ≤ || x|| + || y||.

Proof.

|| x + y||2 = ( x + y) · ( x + y) =

  • x ·

x + 2 x · y + y · y = || x||2 + 2 x · y + || y||2 ≤ || x||2 + 2|| x|| || y|| + || y||2 by the Cauchy Inequality = (|| x|| + || y||)2. Since both sides of the inequality are nonnegative, we take (positive) square roots of both sides:

slide-31
SLIDE 31

Corollary (Triangle Inequality I )

If x, y ∈ Rn, then || x + y|| ≤ || x|| + || y||.

Proof.

|| x + y||2 = ( x + y) · ( x + y) =

  • x ·

x + 2 x · y + y · y = || x||2 + 2 x · y + || y||2 ≤ || x||2 + 2|| x|| || y|| + || y||2 by the Cauchy Inequality = (|| x|| + || y||)2. Since both sides of the inequality are nonnegative, we take (positive) square roots of both sides: || x + y|| ≤ || x|| + || y||.

slide-32
SLIDE 32

Definition

If x, y ∈ Rn, then the distance between x and y is defined as d( x, y) = || x − y||. by Triangle Inequality I

slide-33
SLIDE 33

Definition

If x, y ∈ Rn, then the distance between x and y is defined as d( x, y) = || x − y||.

Theorem (Properties of the distance function)

Let x, y, z ∈ Rn. Then by Triangle Inequality I

slide-34
SLIDE 34

Definition

If x, y ∈ Rn, then the distance between x and y is defined as d( x, y) = || x − y||.

Theorem (Properties of the distance function)

Let x, y, z ∈ Rn. Then

  • 1. d(

x, y) ≥ 0. by Triangle Inequality I

slide-35
SLIDE 35

Definition

If x, y ∈ Rn, then the distance between x and y is defined as d( x, y) = || x − y||.

Theorem (Properties of the distance function)

Let x, y, z ∈ Rn. Then

  • 1. d(

x, y) ≥ 0.

  • 2. d(

x, y) = 0 if and only if x = y. by Triangle Inequality I

slide-36
SLIDE 36

Definition

If x, y ∈ Rn, then the distance between x and y is defined as d( x, y) = || x − y||.

Theorem (Properties of the distance function)

Let x, y, z ∈ Rn. Then

  • 1. d(

x, y) ≥ 0.

  • 2. d(

x, y) = 0 if and only if x = y.

  • 3. d(

x, y) = d( y, x). by Triangle Inequality I

slide-37
SLIDE 37

Definition

If x, y ∈ Rn, then the distance between x and y is defined as d( x, y) = || x − y||.

Theorem (Properties of the distance function)

Let x, y, z ∈ Rn. Then

  • 1. d(

x, y) ≥ 0.

  • 2. d(

x, y) = 0 if and only if x = y.

  • 3. d(

x, y) = d( y, x).

  • 4. d(

x, z) ≤ d(

  • x,

y) + d( y, z) (Triangle Inequality II). by Triangle Inequality I

slide-38
SLIDE 38

Definition

If x, y ∈ Rn, then the distance between x and y is defined as d( x, y) = || x − y||.

Theorem (Properties of the distance function)

Let x, y, z ∈ Rn. Then

  • 1. d(

x, y) ≥ 0.

  • 2. d(

x, y) = 0 if and only if x = y.

  • 3. d(

x, y) = d( y, x).

  • 4. d(

x, z) ≤ d(

  • x,

y) + d( y, z) (Triangle Inequality II).

Proof of the Triangle Inequality II.

d( x, z) = || x − z|| = ||( x − y) + ( y − z)|| by Triangle Inequality I

slide-39
SLIDE 39

Definition

If x, y ∈ Rn, then the distance between x and y is defined as d( x, y) = || x − y||.

Theorem (Properties of the distance function)

Let x, y, z ∈ Rn. Then

  • 1. d(

x, y) ≥ 0.

  • 2. d(

x, y) = 0 if and only if x = y.

  • 3. d(

x, y) = d( y, x).

  • 4. d(

x, z) ≤ d(

  • x,

y) + d( y, z) (Triangle Inequality II).

Proof of the Triangle Inequality II.

d( x, z) = || x − z|| = ||( x − y) + ( y − z)|| ≤ || x − y|| + || y − z|| by Triangle Inequality I

slide-40
SLIDE 40

Definition

If x, y ∈ Rn, then the distance between x and y is defined as d( x, y) = || x − y||.

Theorem (Properties of the distance function)

Let x, y, z ∈ Rn. Then

  • 1. d(

x, y) ≥ 0.

  • 2. d(

x, y) = 0 if and only if x = y.

  • 3. d(

x, y) = d( y, x).

  • 4. d(

x, z) ≤ d(

  • x,

y) + d( y, z) (Triangle Inequality II).

Proof of the Triangle Inequality II.

d( x, z) = || x − z|| = ||( x − y) + ( y − z)|| ≤ || x − y|| + || y − z|| by Triangle Inequality I = d( x, y) + d( y, z).

slide-41
SLIDE 41

Orthogonality

Definitions

◮ Let x, y ∈ Rn. We say the x and y are orthogonal if x · y = 0. Let and suppose that and are orthogonal. Then it is not necessarily the case that is an orthogonal set.

slide-42
SLIDE 42

Orthogonality

Definitions

◮ Let x, y ∈ Rn. We say the x and y are orthogonal if x · y = 0. ◮ More generally, X = { x1, x2, . . . , xk} ⊆ Rn is an orthogonal set if each

  • xi is nonzero, and every pair of distinct vectors of X is orthogonal, i.e.,
  • xi ·

xj = 0 for all i = j, 1 ≤ i, j ≤ k. Let and suppose that and are orthogonal. Then it is not necessarily the case that is an orthogonal set.

slide-43
SLIDE 43

Orthogonality

Definitions

◮ Let x, y ∈ Rn. We say the x and y are orthogonal if x · y = 0. ◮ More generally, X = { x1, x2, . . . , xk} ⊆ Rn is an orthogonal set if each

  • xi is nonzero, and every pair of distinct vectors of X is orthogonal, i.e.,
  • xi ·

xj = 0 for all i = j, 1 ≤ i, j ≤ k. ◮ A set X = { x1, x2, . . . , xk} ⊆ Rn is an orthonormal set if X is an

  • rthogonal set of unit vectors, i.e., ||

xi|| = 1 for all i, 1 ≤ i ≤ k. Let and suppose that and are orthogonal. Then it is not necessarily the case that is an orthogonal set.

slide-44
SLIDE 44

Orthogonality

Definitions

◮ Let x, y ∈ Rn. We say the x and y are orthogonal if x · y = 0. ◮ More generally, X = { x1, x2, . . . , xk} ⊆ Rn is an orthogonal set if each

  • xi is nonzero, and every pair of distinct vectors of X is orthogonal, i.e.,
  • xi ·

xj = 0 for all i = j, 1 ≤ i, j ≤ k. ◮ A set X = { x1, x2, . . . , xk} ⊆ Rn is an orthonormal set if X is an

  • rthogonal set of unit vectors, i.e., ||

xi|| = 1 for all i, 1 ≤ i ≤ k.

Subtlety in the Definitions!

Let x, y ∈ Rn and suppose that x and y are orthogonal. Then it is not necessarily the case that { x, y} is an orthogonal set.

slide-45
SLIDE 45

Orthogonality

Definitions

◮ Let x, y ∈ Rn. We say the x and y are orthogonal if x · y = 0. ◮ More generally, X = { x1, x2, . . . , xk} ⊆ Rn is an orthogonal set if each

  • xi is nonzero, and every pair of distinct vectors of X is orthogonal, i.e.,
  • xi ·

xj = 0 for all i = j, 1 ≤ i, j ≤ k. ◮ A set X = { x1, x2, . . . , xk} ⊆ Rn is an orthonormal set if X is an

  • rthogonal set of unit vectors, i.e., ||

xi|| = 1 for all i, 1 ≤ i ≤ k.

Subtlety in the Definitions!

Let x, y ∈ Rn and suppose that x and y are orthogonal. Then it is not necessarily the case that { x, y} is an orthogonal set. Why?

slide-46
SLIDE 46

Examples

slide-47
SLIDE 47

Examples

  • 1. The standard basis of Rn is an orthonormal set (and hence an
  • rthogonal set).
slide-48
SLIDE 48

Examples

  • 1. The standard basis of Rn is an orthonormal set (and hence an
  • rthogonal set).

2.            1 1 1 1     ,     1 1 −1 −1     ,     1 −1 1 −1            is an orthogonal (but not orthonormal) subset of R4.

slide-49
SLIDE 49

Examples

  • 1. The standard basis of Rn is an orthonormal set (and hence an
  • rthogonal set).

2.            1 1 1 1     ,     1 1 −1 −1     ,     1 −1 1 −1            is an orthogonal (but not orthonormal) subset of R4.

  • 3. If {

x1, x2, . . . , xk} is an orthogonal subset of Rn and p = 0, then {p x1, p x2, . . . , p xk} is an orthogonal subset of Rn.

slide-50
SLIDE 50

Examples

  • 1. The standard basis of Rn is an orthonormal set (and hence an
  • rthogonal set).

2.            1 1 1 1     ,     1 1 −1 −1     ,     1 −1 1 −1            is an orthogonal (but not orthonormal) subset of R4.

  • 3. If {

x1, x2, . . . , xk} is an orthogonal subset of Rn and p = 0, then {p x1, p x2, . . . , p xk} is an orthogonal subset of Rn. 4.        1 2     1 1 1 1     , 1 2     1 1 −1 −1     , 1 2     1 −1 1 −1            is an orthonormal subset of R4.

slide-51
SLIDE 51

Definition

Normalizing an orthogonal set is the process of turning an orthogonal (but not orthonormal) set into an orthonormal set. If { x1, x2, . . . , xk} is an

  • rthogonal subset of Rn, then
  • 1

|| x1|| x1, 1 || x2|| x2, . . . , 1 || xk|| xk

  • is an orthonormal set.
slide-52
SLIDE 52

Definition

Normalizing an orthogonal set is the process of turning an orthogonal (but not orthonormal) set into an orthonormal set. If { x1, x2, . . . , xk} is an

  • rthogonal subset of Rn, then
  • 1

|| x1|| x1, 1 || x2|| x2, . . . , 1 || xk|| xk

  • is an orthonormal set.

Problem

Verify that      1 −1 2   ,   2 1   ,   5 1 −2      is an orthogonal set, and normalize this set.

slide-53
SLIDE 53

Solution

  1 −1 2   ·   2 1   = 0 − 2 + 2 = 0,   2 1   ·   5 1 −2   = 0 + 2 − 2 = 0,   1 −1 2   ·   5 1 −2   = 5 − 1 − 4 = 0, proving that the set is orthogonal. Normalizing gives us the orthonormal set    1 √ 6   1 −1 2   , 1 √ 5   2 1   , 1 √ 30   5 1 −2      .

slide-54
SLIDE 54

Theorem (Pythagoras’ Theorem)

If { x1, x2, . . . , xk} ⊆ Rn is orthogonal, then || x1 + x2 + · · · + xk||2 = || x1||2 + || x2||2 + · · · + || xk||2. Start with

. . . . . . . . . The second last equality follows from the fact that the set is orthogonal, so for all and , and , . Thus, the only nonzero terms are the ones of the form , .

slide-55
SLIDE 55

Theorem (Pythagoras’ Theorem)

If { x1, x2, . . . , xk} ⊆ Rn is orthogonal, then || x1 + x2 + · · · + xk||2 = || x1||2 + || x2||2 + · · · + || xk||2.

Proof.

Start with

|| x1 + x2 + · · · + xk||2 = ( x1 + x2 + · · · + xk) · ( x1 + x2 + · · · + xk) . . . . . . . . . The second last equality follows from the fact that the set is orthogonal, so for all and , and , . Thus, the only nonzero terms are the ones of the form , .

slide-56
SLIDE 56

Theorem (Pythagoras’ Theorem)

If { x1, x2, . . . , xk} ⊆ Rn is orthogonal, then || x1 + x2 + · · · + xk||2 = || x1||2 + || x2||2 + · · · + || xk||2.

Proof.

Start with

|| x1 + x2 + · · · + xk||2 = ( x1 + x2 + · · · + xk) · ( x1 + x2 + · · · + xk) = ( x1 · x1 + x1 · x2 + · · · + x1 · xk) +( x2 · x1 + x2 · x2 + · · · + x2 · xk) . . . . . . . . . +( xk · x1 + xk · x2 + · · · + xk · xk) =

  • x1 ·

x1 + x2 · x2 + · · · + xk · xk The second last equality follows from the fact that the set is orthogonal, so for all and , and , . Thus, the only nonzero terms are the ones of the form , .

slide-57
SLIDE 57

Theorem (Pythagoras’ Theorem)

If { x1, x2, . . . , xk} ⊆ Rn is orthogonal, then || x1 + x2 + · · · + xk||2 = || x1||2 + || x2||2 + · · · + || xk||2.

Proof.

Start with

|| x1 + x2 + · · · + xk||2 = ( x1 + x2 + · · · + xk) · ( x1 + x2 + · · · + xk) = ( x1 · x1 + x1 · x2 + · · · + x1 · xk) +( x2 · x1 + x2 · x2 + · · · + x2 · xk) . . . . . . . . . +( xk · x1 + xk · x2 + · · · + xk · xk) =

  • x1 ·

x1 + x2 · x2 + · · · + xk · xk = || x1||2 + || x2||2 + · · · + || xk||2. The second last equality follows from the fact that the set is orthogonal, so for all and , and , . Thus, the only nonzero terms are the ones of the form , .

slide-58
SLIDE 58

Theorem (Pythagoras’ Theorem)

If { x1, x2, . . . , xk} ⊆ Rn is orthogonal, then || x1 + x2 + · · · + xk||2 = || x1||2 + || x2||2 + · · · + || xk||2.

Proof.

Start with

|| x1 + x2 + · · · + xk||2 = ( x1 + x2 + · · · + xk) · ( x1 + x2 + · · · + xk) = ( x1 · x1 + x1 · x2 + · · · + x1 · xk) +( x2 · x1 + x2 · x2 + · · · + x2 · xk) . . . . . . . . . +( xk · x1 + xk · x2 + · · · + xk · xk) =

  • x1 ·

x1 + x2 · x2 + · · · + xk · xk = || x1||2 + || x2||2 + · · · + || xk||2. The second last equality follows from the fact that the set is orthogonal, so for all i and j, i = j and 1 ≤ i, j ≤ k, xi · xj = 0. Thus, the only nonzero terms are the ones of the form xi · xi, 1 ≤ i ≤ k.

slide-59
SLIDE 59

Theorem

If S = { x1, x2, . . . , xk} ⊆ Rn is an orthogonal set, then S is independent. Let for some , and suppose . Then for all , , since for all , where . Since and , it follows that for all , . Therefore, is linearly independent.

slide-60
SLIDE 60

Theorem

If S = { x1, x2, . . . , xk} ⊆ Rn is an orthogonal set, then S is independent.

Proof.

Let x = t1 x1 + t2 x2 + · · · + tk xk for some t1, t2, . . . , tk ∈ R, and suppose

  • x =

0n. Then for all , , since for all , where . Since and , it follows that for all , . Therefore, is linearly independent.

slide-61
SLIDE 61

Theorem

If S = { x1, x2, . . . , xk} ⊆ Rn is an orthogonal set, then S is independent.

Proof.

Let x = t1 x1 + t2 x2 + · · · + tk xk for some t1, t2, . . . , tk ∈ R, and suppose

  • x =
  • 0n. Then for all i, 1 ≤ i ≤ k,

0 = x · xi = (t1 x1 + t2 x2 + · · · + tk xk) · xi = ti xi · xi = ti|| xi||2, since tj xj · xi = 0 for all j, 1 ≤ j ≤ k where j = i. Since and , it follows that for all , . Therefore, is linearly independent.

slide-62
SLIDE 62

Theorem

If S = { x1, x2, . . . , xk} ⊆ Rn is an orthogonal set, then S is independent.

Proof.

Let x = t1 x1 + t2 x2 + · · · + tk xk for some t1, t2, . . . , tk ∈ R, and suppose

  • x =
  • 0n. Then for all i, 1 ≤ i ≤ k,

0 = x · xi = (t1 x1 + t2 x2 + · · · + tk xk) · xi = ti xi · xi = ti|| xi||2, since tj xj · xi = 0 for all j, 1 ≤ j ≤ k where j = i. Since xi = 0n and ti|| xi||2 = 0, it follows that ti = 0 for all i, 1 ≤ i ≤ k. Therefore, S is linearly independent.

slide-63
SLIDE 63

Example

Given an arbitrary vector

  • x =

     a1 a2 . . . an      ∈ Rn, it is trivial to express x as a linear combination of the standard basis vectors of Rn, { e1, e2, . . . , en}:

  • x = a1

e1 + a2 e2 + · · · + an en.

slide-64
SLIDE 64

Example

Given an arbitrary vector

  • x =

     a1 a2 . . . an      ∈ Rn, it is trivial to express x as a linear combination of the standard basis vectors of Rn, { e1, e2, . . . , en}:

  • x = a1

e1 + a2 e2 + · · · + an en.

Problem

Given any orthogonal basis B of Rn (so not necessarily the standard basis), and an arbitrary vector x ∈ Rn, how do we express x as a linear combination of the vectors in B?

slide-65
SLIDE 65

Theorem (Fourier Expansion)

Let { f1, f2, . . . , fm} be an orthogonal basis of a subspace U of Rn. Then for any x ∈ U,

  • x =
  • x ·

f1 || f1||2

  • f1 +
  • x ·

f2 || f2||2

  • f2 + · · · +
  • x ·

fm || fm||2

  • fm.

This expression is called the Fourier expansion of x, and

  • x ·

fj || fj||2 , j = 1, 2, . . . , m are the Fourier coefficients.

slide-66
SLIDE 66

Example

Let f1 =   1 −1 2   , f2 =   2 1  , and f3 =   5 1 −2  , and let x =   1 1 1  . We saw in a Problem 12 that B = { f1, f2, f3} is an orthogonal subset of R3.

slide-67
SLIDE 67

Example

Let f1 =   1 −1 2   , f2 =   2 1  , and f3 =   5 1 −2  , and let x =   1 1 1  . We saw in a Problem 12 that B = { f1, f2, f3} is an orthogonal subset of R3. It follows that B is an orthogonal basis of R3. (Why?)

slide-68
SLIDE 68

Example

Let f1 =   1 −1 2   , f2 =   2 1  , and f3 =   5 1 −2  , and let x =   1 1 1  . We saw in a Problem 12 that B = { f1, f2, f3} is an orthogonal subset of R3. It follows that B is an orthogonal basis of R3. (Why?) To express x as a linear combination of the vectors of B, apply the Fourier Expansion Theorem. Assume x = t1 f1 + t2 f2 + t3 f3.

slide-69
SLIDE 69

Example

Let f1 =   1 −1 2   , f2 =   2 1  , and f3 =   5 1 −2  , and let x =   1 1 1  . We saw in a Problem 12 that B = { f1, f2, f3} is an orthogonal subset of R3. It follows that B is an orthogonal basis of R3. (Why?) To express x as a linear combination of the vectors of B, apply the Fourier Expansion Theorem. Assume x = t1 f1 + t2 f2 + t3

  • f3. Then

t1 = x · f1 || f1||2 = 2 6, t2 = x · f2 || f2||2 = 3 5, and t3 = x · f3 || f3||2 = 4 30.

slide-70
SLIDE 70

Example

Let f1 =   1 −1 2   , f2 =   2 1  , and f3 =   5 1 −2  , and let x =   1 1 1  . We saw in a Problem 12 that B = { f1, f2, f3} is an orthogonal subset of R3. It follows that B is an orthogonal basis of R3. (Why?) To express x as a linear combination of the vectors of B, apply the Fourier Expansion Theorem. Assume x = t1 f1 + t2 f2 + t3

  • f3. Then

t1 = x · f1 || f1||2 = 2 6, t2 = x · f2 || f2||2 = 3 5, and t3 = x · f3 || f3||2 = 4 30. Therefore,   1 1 1   = 1 3   1 −1 2   + 3 5   2 1   + 2 15   5 1 −2   .

slide-71
SLIDE 71

Proof (Fourier Expansion).

Let x ∈ U. Since { f1, f2, . . . , fm} is a basis of U, x = t1 f1 + t2 f2 + · · · + tm fm for some t1, t2, . . . , tm ∈ R. Notice that for any , , since is orthogonal Since is nonzero, we obtain The result now follows. If is an orthonormal basis, then the Fourier coeffjcients are simply , .

slide-72
SLIDE 72

Proof (Fourier Expansion).

Let x ∈ U. Since { f1, f2, . . . , fm} is a basis of U, x = t1 f1 + t2 f2 + · · · + tm fm for some t1, t2, . . . , tm ∈ R. Notice that for any i, 1 ≤ i ≤ m,

  • x ·

fi = (t1 f1 + t2 f2 + · · · + tm fm) · fi since is orthogonal Since is nonzero, we obtain The result now follows. If is an orthonormal basis, then the Fourier coeffjcients are simply , .

slide-73
SLIDE 73

Proof (Fourier Expansion).

Let x ∈ U. Since { f1, f2, . . . , fm} is a basis of U, x = t1 f1 + t2 f2 + · · · + tm fm for some t1, t2, . . . , tm ∈ R. Notice that for any i, 1 ≤ i ≤ m,

  • x ·

fi = (t1 f1 + t2 f2 + · · · + tm fm) · fi = ti fi · fi since { f1, f2, . . . , fm} is orthogonal = ti|| fi||2. Since is nonzero, we obtain The result now follows. If is an orthonormal basis, then the Fourier coeffjcients are simply , .

slide-74
SLIDE 74

Proof (Fourier Expansion).

Let x ∈ U. Since { f1, f2, . . . , fm} is a basis of U, x = t1 f1 + t2 f2 + · · · + tm fm for some t1, t2, . . . , tm ∈ R. Notice that for any i, 1 ≤ i ≤ m,

  • x ·

fi = (t1 f1 + t2 f2 + · · · + tm fm) · fi = ti fi · fi since { f1, f2, . . . , fm} is orthogonal = ti|| fi||2. Since fi is nonzero, we obtain ti = x · fi || fi||2 . The result now follows.

  • If

is an orthonormal basis, then the Fourier coeffjcients are simply , .

slide-75
SLIDE 75

Proof (Fourier Expansion).

Let x ∈ U. Since { f1, f2, . . . , fm} is a basis of U, x = t1 f1 + t2 f2 + · · · + tm fm for some t1, t2, . . . , tm ∈ R. Notice that for any i, 1 ≤ i ≤ m,

  • x ·

fi = (t1 f1 + t2 f2 + · · · + tm fm) · fi = ti fi · fi since { f1, f2, . . . , fm} is orthogonal = ti|| fi||2. Since fi is nonzero, we obtain ti = x · fi || fi||2 . The result now follows.

  • If {

f1, f2, . . . , fm} is an orthonormal basis, then the Fourier coeffjcients are simply tj = x · fj, j = 1, 2, . . . , m.

slide-76
SLIDE 76

Problem

Let

  • f1 =

   1 1    , f2 =    1 −1    , f3 =    1 1    , and

  • f4 =

   1 −1    . Show that B = { f1, f2, f3, f4} is an orthogonal basis of R4, and express

  • x =

a b c d T as a linear combination of f1, f2, f3 and f4.

Computing for gives us so is an orthogonal set. It follows that is independent, and since dim , also spans . Therefore, is an orthogonal basis of . By the Fourier Expansion Theorem,

slide-77
SLIDE 77

Problem

Let

  • f1 =

   1 1    , f2 =    1 −1    , f3 =    1 1    , and

  • f4 =

   1 −1    . Show that B = { f1, f2, f3, f4} is an orthogonal basis of R4, and express

  • x =

a b c d T as a linear combination of f1, f2, f3 and f4.

Solution

Computing fi · fj for 1 ≤ i < j ≤ 4 gives us

  • f1 ·

f2 = 0,

  • f1 ·

f3 = 0,

  • f1 ·

f4 = 0,

  • f2 ·

f3 = 0,

  • f2 ·

f4 = 0,

  • f3 ·

f4 = 0, so B is an orthogonal set. It follows that is independent, and since dim , also spans . Therefore, is an orthogonal basis of . By the Fourier Expansion Theorem,

slide-78
SLIDE 78

Problem

Let

  • f1 =

   1 1    , f2 =    1 −1    , f3 =    1 1    , and

  • f4 =

   1 −1    . Show that B = { f1, f2, f3, f4} is an orthogonal basis of R4, and express

  • x =

a b c d T as a linear combination of f1, f2, f3 and f4.

Solution

Computing fi · fj for 1 ≤ i < j ≤ 4 gives us

  • f1 ·

f2 = 0,

  • f1 ·

f3 = 0,

  • f1 ·

f4 = 0,

  • f2 ·

f3 = 0,

  • f2 ·

f4 = 0,

  • f3 ·

f4 = 0, so B is an orthogonal set. It follows that B is independent, and since |B| = 4 = dim(R4), B also spans R4. Therefore, B is an orthogonal basis of R4. By the Fourier Expansion Theorem,

slide-79
SLIDE 79

Problem

Let

  • f1 =

   1 1    , f2 =    1 −1    , f3 =    1 1    , and

  • f4 =

   1 −1    . Show that B = { f1, f2, f3, f4} is an orthogonal basis of R4, and express

  • x =

a b c d T as a linear combination of f1, f2, f3 and f4.

Solution

Computing fi · fj for 1 ≤ i < j ≤ 4 gives us

  • f1 ·

f2 = 0,

  • f1 ·

f3 = 0,

  • f1 ·

f4 = 0,

  • f2 ·

f3 = 0,

  • f2 ·

f4 = 0,

  • f3 ·

f4 = 0, so B is an orthogonal set. It follows that B is independent, and since |B| = 4 = dim(R4), B also spans R4. Therefore, B is an orthogonal basis of

  • R4. By the Fourier Expansion Theorem,
  • x =

a + b 2

  • f1 +

a − b 2

  • f2 +

c + d 2

  • f3 +

c − d 2

  • f4.