Chapter 7 Norms and Distance Measures Chapter 7 Vector Norms - PowerPoint PPT Presentation

Chapter 7 Norms and Distance Measures Chapter 7

Vector Norms Norms are functions which measure the magnitude or length of a vector. They are commonly used to determine similarities between observations by measuring the distance between them. Find groups of similar observations/customers/products. Classify new objects into known groups. There are many ways to define both distance and similarity between vectors and matrices! Chapter 7

Norms in General A Norm, or distance metric, is a function that takes a vector as input and returns a scalar quantity ( f : R n → R ). A vector norm is typically denoted by two vertical bars surrounding the input vector, � x � , to signify that it is not just any function, but one that satisfies the following criteria: If c is a scalar, then 1 � c x � = | c |� x � The triangle inequality: 2 � x + y � ≤ � x � + � y � � x � = 0 if and only if x = 0. 3 � x � ≥ 0 for any vector x 4 Chapter 7

Euclidean Norm (Euclidean Distance) The Euclidean Norm , also known as the 2 -norm simply measures the Euclidean length of a vector (i.e. a point’s distance from the origin). Let x = ( x 1 , x 2 , . . . , x n ) . Then, � x 2 1 + x 2 2 + · · · + x 2 � x � 2 = n √ x T x . � x � 2 = Often write � ⋆ � rather than � ⋆ � 2 to denote the 2-norm, as it is by far the most commonly used norm. This is merely the “distance formula” from undergraduate mathematics, measuring the distance between the point x and the origin. Chapter 7

Euclidean Norm (Euclidean Distance) ( ( a x= b ||x||=√( a 2 + b2 ) b a Chapter 7

Length vs. Distance Why do we care about the length of a vector? Two Reasons We will often want to make all vectors the same length (A form of standardization). The length of the vector x − y gives the distance between x and y . Chapter 7

Euclidean Distance y | | x -y| | x Chapter 7

Euclidean Distance   x 1 − y 1 x 2 − y 2   x − y =  .  .   .   x n − y n � ( x 1 − y 1 ) 2 + ( x 2 − y 2 ) 2 + · · · + ( x n − y n ) 2 � x − y � = Square Root Sum of Squared Differences between the two vectors. Chapter 7

Suppose I have two vectors in 3-space: x = ( 1 , 1 , 1 ) and y = ( 1 , 0 , 0 ) Then the magnitude of x (i.e. its length or distance from the origin) is √ � 1 2 + 1 2 + 1 2 = � x � 2 = 3 and the magnitude of y is � 1 2 + 0 2 + 0 2 = 1 � y � 2 = and the distance between point x and point y is √ � ( 1 − 1 ) 2 + ( 1 − 0 ) 2 + ( 1 − 0 ) 2 = � x − y � 2 = 2 . Chapter 7

Unit Vectors In this course, we will regularly make use of vectors with length/magnitude equal to 1. These vectors are called unit vectors . For example,       1 0 0 e 1 = 0  , e 2 = 1  , e 3 = 0     0 0 1 are all unit vectors because � e 1 � = � e 2 � = � e 3 � = 1 . Simple enough! Chapter 7

Creating a unit vector If we have some random vector, x , we can always transform it into a unit vector by dividing every element by � x � . For example, take � 3 � x = 4 √ √ 3 2 + 4 2 = Then, � x � = 25 = 5. The new vector, � 3 � u = 1 5 4 is a unit vector: �� 3 � 2 � 2 � � 4 25 + 16 9 � u � = + = 25 = 1 5 5 Note that this implies u T u = 1 Chapter 7

How else can we measure distance? � ⋆ � 1 (1-norm) a.k.a. Taxicab metric, Manhattan Distance, City block distance � ⋆ � ∞ ( ∞ -norm) a.k.a Max norm, Supremum norm, Uniform Norm Mahalanobis Distance (A probabilistic distance that accounts for the variance of variables) Chapter 7

1-norm, � ⋆ � 1 � x � 1 = | x 1 | + | x 2 | + | x 3 | + · · · + | x n | This is often called the city block norm because it measures the distance between points along a rectangular grid (as a taxicab must travel on the streets of Manhattan). Chapter 7

1-norm, � ⋆ � 1 � x � 1 = | x 1 | + | x 2 | + | x 3 | + · · · + | x n | This is often called the city block norm because it measures the distance between points along a rectangular grid (as a taxicab must travel on the streets of Manhattan). So the 1 norm distance between two observations/vectors would be � x − y � 1 = | x 1 − y 1 | + | x 2 − y 2 | + · · · + | x n − y n | Chapter 7

∞ -norm, � ⋆ � ∞ The infinity norm is sometimes called "max distance": � x � ∞ = max {| x 1 | , | x 2 | , | x 3 | , . . . , | x n |} So the max distance between points/vectors x and y would be max {| x 1 − y 1 | , | x 2 − y 2 | , | x 3 − y 3 | , . . . , | x n − y n |} Chapter 7

Mahalanobis Distance Takes into account the distribution of the data, often times comparing distributions of different groups. ? Chapter 7

YES, this stuff is useful! Let’s take a quick look at an application, which we will probably explore for ourselves later. MovieLens is a website devoted to Non-commercial, personalized movie recommendations: https://movielens.org As part of a massive open source project in recommendation system development, this website releases large amounts of it’s data to the public to play with. Chapter 7

User-Rating Matrix User Movie 1 Movie 2 Movie 3 Movie 4 1 5 1 2 2 5 3 3 5 4 5 5 LOTS OF MISSING VALUES!! Chapter 7

What can we do with distance alone? http://lifeislinear.davidson.edu/movieV1.html Chapter 7

Chapter 7 Norms and Distance Measures Chapter 7 Vector Norms - PowerPoint PPT Presentation

Chapter 7 Norms and Distance Measures Chapter 7 Vector Norms Norms are functions which measure the magnitude or length of a vector. They are commonly used to determine similarities between observations by measuring the distance between them.

Topics 11/13/2006 Chapter 11, start Chapter 12 11/20/2006 Chapter 12 11/27/2006 Chapter 13

Topics 11/13/2006 Chapter 11, start Chapter 12 11/20/2006 Chapter 12 Inheritance Concepts

Chapter 13 Chapter 13 1 What is this? Chapter 13 2 What is this? Chapter 13 3 What is

CHAPTER CHAPTER VII CHAPTER CHAPTER VII VII VII MANAGEMENT AND MANAGEMENT AND

Appendix A Chapter 9 versus Chapter 1 1 at a Glance Chapter 9 Chapter 1 1 ( I n) voluntary Cannot

CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I :

Pushdown Automata Chapter 5 Chapter 5 Chapter 5 Chapter 5

Chapter 6 Programme design and development Lets Recap Chapter 2: Chapter 3: Chapter 1:

OWASP London Chapter Meeting 27th July 2017 London Chapter Chapter Leaders: Sam

Constraint Satisfaction Problem s C t i t S ti f ti P bl Reading: Chapter 6 (3 rd ed );

Chapter 3 Chapter 3 Data Description McGraw-Hill, Bluman, 7 th ed, Chapter 3 1 Ch Chapter 3

OWASP London Chapter Meeting 23rd November 2017 London Chapter Chapter Leaders: Sam

A.I.S. Class 22: Outline I Learning Objectives for Chapter 8 I Chapter 8 Quiz I New ACCESS Features

A.I.S. Class 27: Outline I Learning Objectives for Chapter 8 I Chapter 8 Quiz I New ACCESS Features

Chapters for the Final Exam Chapter 20: Electric forces and fields (Conceptual Questions) Chapter

Chapter: 9 9 9 9 Chapter: Chapter: Chapter: High-Speed Downlink High-Speed Downlink Packet

Machine Learning for Computational Linguistics A refresher on linear algebra ar ltekin

Pyramid Vector Quantization for Video Coding Jean-Marc Valin Daala Coding Party Sep 2013

CS 4 7 3 1 / 5 4 3 : Com puter Graphics Lecture 3 ( Part I I I ) : 3 D Modeling: Polygonal Meshes

AM 205: lecture 6 Last time: finished the data fitting topic Todays lecture: numerical

(Un)decidability results on real vector spaces arising from the formalization of mathematics John

Computer Graphics (CS 543) Lecture 3 (Part 1): Linear Algebra for Graphics (Points, Scalars,

Verification Process Training Child Nutrition 2020-2021 1 10/5/2020 Who Do You Call???

Learning objectives View the big picture'' of software quality in the context of a