Machine Learning for Signal Processing Fundamentals of Linear - PowerPoint PPT Presentation

Machine Learning for Signal Processing Fundamentals of Linear Algebra - 2 Class 3. 8 Sep 2015 Instructor: Bhiksha Raj 11-755/18-797 1

Overview • Vectors and matrices • Basic vector/matrix operations • Various matrix types • Projections • More on matrix types • Matrix determinants • Matrix inversion • Eigenanalysis • Singular value decomposition • Matrix Calculus 11-755/18-797 2

Orthogonal/Orthonormal vectors     x u       A y B v       w   z      A . B 0 xu yv zw 0 • Two vectors are orthogonal if they are perpendicular to one another – A.B = 0 – A vector that is perpendicular to a plane is orthogonal to every vector on the plane • Two vectors are ortho normal if – They are orthogonal – The length of each vector is 1.0 – Orthogonal vectors can be made orthonormal by normalizing their lengths to 1.0 11-755/18-797 3

Orthogonal matrices All 3 at 90 o to    0 . 5 0 . 125 0 . 375 on another    0 . 5 0 . 125 0 . 375     0 0 . 75 0 . 5   • Orthogonal Matrix : AA T = A T A = I – The matrix is square – All row vectors are orthonormal to one another • Every vector is perpendicular to the hyperplane formed by all other vectors – All column vectors are also orthonormal to one another – Observation: In an orthogonal matrix if the length of the row vectors is 1.0, the length of the column vectors is also 1.0 – Observation : In an orthogonal matrix no more than one row can have all entries with the same polarity (+ve or – ve) 11-755/18-797 4

Orthogonal and Orthonormal Matrices q Ax • Orthogonal matrices will retain the length and relative angles between transformed vectors – Essentially, they are combinations of rotations, reflections and permutations – Rotation matrices and permutation matrices are all orthonormal 11-755/18-797 5

Orthogonal and Orthonormal Matrices    1 0 . 0675 0 . 1875      0 . 5 0 . 125 0 . 375   0 0 . 75 0 . 5   • If the vectors in the matrix are not unit length, it cannot be orthogonal – AA T != I, A T A != I – AA T = Diagonal or A T A = Diagonal, but not both – If all the entries are the same length, we can get AA T = A T A = Diagonal, though • A non-square matrix cannot be orthogonal – AA T =I or A T A = I, but not both 11-755/18-797 6

Matrix Rank and Rank-Deficient Matrices P * Cone = • Some matrices will eliminate one or more dimensions during transformation – These are rank deficient matrices – The rank of the matrix is the dimensionality of the transformed version of a full-dimensional object 11-755/18-797 7

Matrix Rank and Rank-Deficient Matrices Rank = 2 Rank = 1 • Some matrices will eliminate one or more dimensions during transformation – These are rank deficient matrices – The rank of the matrix is the dimensionality of the transformed version of a full-dimensional object 11-755/18-797 8

Projections are often examples of rank-deficient transforms M = W =  P = W (W T W) -1 W T ; Projected Spectrogram = P*M  The original spectrogram can never be recovered  P is rank deficient  P explains all vectors in the new spectrogram as a mixture of only the 4 vectors in W  There are only a maximum of 4 linearly independent bases  Rank of P is 4 11-755/18-797 9

Non-square Matrices   . 8 . 9  ˆ ˆ ˆ  x x . . x     1 2 N x x . . x   ˆ ˆ ˆ . 1 . 9 . .   y y y 1 2 N     1 2 N   ˆ ˆ ˆ   z z . . z     y y . . y . 6 0 1 2 N 1 2 N X = 2D data P = transform PX = 3D, rank 2 • Non-square matrices add or subtract axes – More rows than columns  add axes • But does not increase the dimensionality of the data axes • May reduce dimensionality of the data 11-755/18-797 10

Non-square Matrices   x x . . x  ˆ ˆ ˆ  x x . . x 1 2 N     1 2 N . 3 1 . 2   ˆ ˆ ˆ y y . . y     y y . . y   1 2 N 1 2 N   . 5 1 1     z z . . z 1 2 N X = 3D data, rank 3 P = transform PX = 2D, rank 2 • Non-square matrices add or subtract axes – More rows than columns  add axes • But does not increase the dimensionality of the data – Fewer rows than columns  reduce axes • May reduce dimensionality of the data 11-755/18-797 11

The Rank of a Matrix   . 8 . 9     . 3 1 . 2 . 1 . 9         . 5 1 1   . 6 0 • The matrix rank is the dimensionality of the transformation of a full- dimensioned object in the original space • The matrix can never increase dimensions – Cannot convert a circle to a sphere or a line to a circle • The rank of a matrix can never be greater than the lower of its two dimensions 11-755/18-797 12

The Rank of Matrix M =  Projected Spectrogram = P * M  Every vector in it is a combination of only 4 bases  The rank of the matrix is the smallest no. of bases required to describe the output  E.g. if note no. 4 in P could be expressed as a combination of notes 1,2 and 3, it provides no additional information  Eliminating note no. 4 would give us the same projection  The rank of P would be 3! 11-755/18-797 13

Matrix rank is unchanged by transposition     0 . 9 0 . 5 0 . 8 0 . 9 0 . 1 0 . 42     0 . 1 0 . 4 0 . 9 0 . 5 0 . 4 0 . 44             0 . 42 0 . 44 0 . 86 0 . 8 0 . 9 0 . 86 • If an N-dimensional object is compressed to a K-dimensional object by a matrix, it will also be compressed to a K-dimensional object by the transpose of the matrix 11-755/18-797 14

Matrix Determinant (r1+r2) (r2) (r1) (r2) (r1) • The determinant is the “volume” of a matrix • Actually the volume of a parallelepiped formed from its row vectors – Also the volume of the parallelepiped formed from its column vectors • Standard formula for determinant: in text book 11-755/18-797 15

Matrix Determinant: Another Perspective Volume = V 1 Volume = V 2   0 . 8 0 0 . 7   1 . 0 0 . 8 0 . 8       0 . 7 0 . 9 0 . 7 • The determinant is the ratio of N-volumes – If V 1 is the volume of an N- dimensional sphere “O” in N -dimensional space • O is the complete set of points or vertices that specify the object – If V 2 is the volume of the N-dimensional ellipsoid specified by A*O, where A is a matrix that transforms the space – |A| = V 2 / V 1 11-755/18-797 16

Matrix Determinants • Matrix determinants are only defined for square matrices – They characterize volumes in linearly transformed space of the same dimensionality as the vectors • Rank deficient matrices have determinant 0 – Since they compress full-volumed N-dimensional objects into zero- volume N-dimensional objects • E.g. a 3-D sphere into a 2-D ellipse: The ellipse has 0 volume (although it does have area) • Conversely, all matrices of determinant 0 are rank deficient – Since they compress full-volumed N-dimensional objects into zero-volume objects 11-755/18-797 17

Multiplication properties • Properties of vector/matrix products – Associative A  ( B  C )  ( A  B )  C – Distributive A  ( B  C )  A  B  A  C – NOT commutative!!! ฀ A  B  B  A • left multiplications ≠ right multiplications ฀ – Transposition      T T T A B B A ฀ 11-755/18-797 18

Determinant properties      • Associative for square matrices A B C A B C – Scaling volume sequentially by several matrices is equal to scaling once by the product of the matrices    • Volume of sum != sum of Volumes ( B C ) B C • Commutative – The order in which you scale the volume of an object is irrelevant      A B B A A B 11-755/18-797 19

Matrix Inversion • A matrix transforms an N-dimensional object to a different N-dimensional object   0 . 8 0 0 . 7    T 1 . 0 0 . 8 0 . 8       0 . 7 0 . 9 0 . 7 • What transforms the new   ? ? ? object back to the original?      1 ? ? ? Q T     – The inverse transformation   ? ? ? • The inverse transformation is called the matrix inverse 11-755/18-797 20

Matrix Inversion T -1 T      1 1 T TD D T T I • The product of a matrix and its inverse is the identity matrix – Transforming an object, and then inverse transforming it gives us back the original object      1 1 TT D D TT I 11-755/18-797 21

Machine Learning for Signal Processing Fundamentals of Linear - PowerPoint PPT Presentation

Machine Learning for Signal Processing Fundamentals of Linear Algebra - 2 Class 3. 8 Sep 2015 Instructor: Bhiksha Raj 11-755/18-797 1 Overview Vectors and matrices Basic vector/matrix operations Various matrix types

Tx Signal: 1000 Hz sine wave; Attenuation; Random noise with 0.5ms spike Tx Signal Noise Rx

Machine Learning for Signal Processing Lecture 1: Signal Representations Class 1. 27 August

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

Speech Processing 15-492/18-492 Speech Synthesis Signal Processing Signal Manipulation Signal

Waveform Generation Fundamental part of signal processing is the signal. Within the

Sampling a Signal an analog signal together with some samples of the signal. The samples

Signal Types Recall even digital signals are just voltages Analog signal Continuous

Signal Types Recall even digital signals are just voltages Analog signal Continuous

Machine Learning for Signal Processing Lecture 1: Signal Representations Class 1. 29 August

Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

Quiz Parts 1 and 2: Describe two interpretations of the matrix-vector product A v , one involving

ACCOUNTING FOR CHANGE- NAVIGATING THE CORONAVIRUS HEADWINDS PRESENTED BY MOSS ADAMS Please

Changes To AIRS Spectral Calibration For V6: A Progress Report Denis Elliott October 17,

Analyzing similarity of multiple cloned software systems Slawomir Duszynski

Machine Learning for Signal Processing Fundamentals of Linear Algebra - 2 Class 3. 8 Sep 2016

1 Low-rank approximations to a matrix using SVD First point: we can write the SVD as a sum of

Fukaya categories of symmetric products and bordered Heegaard-Floer homology Denis AUROUX

Numerical Linear Algebra EECS 442 David Fouhey Fall 2019, University of Michigan