Background Material DS-GA 1013 / MATH-GA 2824 Mathematical Tools for - PowerPoint PPT Presentation

Background Material DS-GA 1013 / MATH-GA 2824 Mathematical Tools for Data Science https://cims.nyu.edu/~cfgranda/pages/MTDS_spring20/index.html Sreyas Mohan and Carlos Fernandez-Granda

Vector spaces Inner product Norms Mean, Variance and Correlation Sample mean, variance and correlation Orthogonality Orthogonal projection Denoising

Vector space Consists of: ◮ A set V ◮ A scalar field (usually R or C ) ◮ Two operations + and ·

Properties ◮ For any � x , � y ∈ V , � x + � y belongs to V ◮ For any � x ∈ V and any scalar α , α · � x ∈ V ◮ There exists a zero vector � x + � 0 such that � 0 = � x for any � x ∈ V y = � ◮ For any � x ∈ V there exists an additive inverse � y such that � x + � 0, usually denoted by − � x

Properties ◮ The vector sum is commutative and associative, i.e. for all � x , � y , � z ∈ V � x + � y = � y + � x , ( � x + � y ) + � z = � x + ( � y + � z ) ◮ Scalar multiplication is associative, for any scalars α and β and any � x ∈ V α ( β · � x ) = ( α β ) · � x ◮ Scalar and vector sums are both distributive, i.e. for any scalars α and β and any � x , � y ∈ V ( α + β ) · � x = α · � x + β · � x , α · ( � x + � y ) = α · � x + α · � y

Concept Check Let V = { x | x ∈ R , x ≥ 0 } . Define addition operation for x , y ∈ V as x + y = x + y (normal addition) and scalar multiplication for x ∈ V and α ∈ R as α x = α. x (regular scaling). Is V a vector field?

Subspaces A subspace of a vector space V is any subset of V that is also itself a vector space

Linear dependence/independence A set of m vectors � x 1 , � x 2 , . . . , � x m is linearly dependent if there exist m scalar coefficients α 1 , α 2 , . . . , α m which are not all equal to zero and m � x i = � α i � 0 i = 1 Equivalently, any vector in a linearly dependent set can be expressed as a linear combination of the rest

Span The span of { � x 1 , . . . , � x m } is the set of all possible linear combinations � m � � span ( � x 1 , . . . , � x m ) := y | � � y = α i � x i for some scalars α 1 , α 2 , . . . , α m i = 1 The span of any set of vectors in V is a subspace of V

Basis and dimension A basis of a vector space V is a set of independent vectors { � x 1 , . . . , � x m } such that V = span ( � x 1 , . . . , � x m ) If V has a basis with finite cardinality then every basis contains the same number of vectors The dimension dim ( V ) of V is the cardinality of any of its bases Equivalently, the dimension is the number of linearly independent vectors that span V

Standard basis       1 0 0 0 1 0       e 1 = �  , � e 2 =  , . . . , e n = �  .   .   .  . . .       . . .     0 0 1 The dimension of R n is n

Concept Check ◮ (True/False) If S is a subset of vector space V , then span ( S ) contains the intersection of all subspace of V that contain S . ◮ The set of all n × n matrices with trace as zero forms a subspace W of the space of n × n matrices. Find a basis for W and calculate it’s dimension.

Concept Check - Answers ◮ True. ◮ We need to enforce that the sum of diagonal entries is zero, or that A 11 + A 22 + · · · + A nn = 0. The basis vectors can be { E ij } i � = j ∪ { E ii − E nn } i = 1 , 2 ,..., n − 1 . The dimension of W is n 2 − 1

Inner product Operation �· , ·� that maps a pair of vectors to a scalar

Properties ◮ If the scalar field is R , it is symmetric. For any � x , � y ∈ V � � x , � y � = � � y , � x � If the scalar field is C , then for any � x , � y ∈ V � � x , � y � = � � y , � x � , where for any α ∈ C α is the complex conjugate of α

Properties ◮ It is linear in the first argument, i.e. for any α ∈ R and any � x , � y , � z ∈ V � α � x , � y � = α � � x , � y � , � � x + � y , � z � = � � x , � z � + � � y , � z � . If the scalar field is R , it is also linear in the second argument ◮ It is positive definite: � � x , � x � is nonnegative for all � x ∈ V and if x = � � � x , � x � = 0 then � 0

Dot product y ∈ R n Inner product between � x , � � � x · � y := � x [ i ] � y [ i ] i R n endowed with the dot product is usually called a Euclidean space of dimension n y ∈ C n If � x , � � � x · � � x [ i ] � y := y [ i ] i

Matrix inner product The inner product between two m × n matrices A and B is � � A T B � A , B � := tr m n � � = A ij B ij i = 1 j = 1 where the trace of an n × n matrix is defined as the sum of its diagonal n � tr ( M ) := M ii i = 1 For any pair of m × n matrices A and B � � � AB T � B T A tr := tr

Function inner product The inner product between two complex-valued square-integrable functions f , g defined in an interval [ a , b ] of the real line is � b � f · � g := f ( x ) g ( x ) d x a

Norms Let V be a vector space, a norm is a function ||·|| from V to R with the following properties ◮ It is homogeneous. For any scalar α and any � x ∈ V || α � x || = | α | || � x || . ◮ It satisfies the triangle inequality || � x + � y || ≤ || � x || + || � y || . In particular, || � x || ≥ 0 x = � ◮ || � x || = 0 implies � 0

Inner-product norm Square root of inner product of vector with itself � || � x || �· , ·� := � � x , � x �

Inner-product norm ◮ Vectors in R n or C n : ℓ 2 norm � n √ � � � x [ i ] 2 || � x || 2 := � x · � x = � � i = 1 ◮ Matrices in R m × n or C m × n : Frobenius norm � m n � � � � � A 2 || A || F := tr ( A T A ) = � ij i = 1 j = 1 ◮ Square-integrable complex-valued functions: L 2 norm �� b | f ( x ) | 2 d x � || f || L 2 := � f , f � = a

Cauchy-Schwarz inequality For any two vectors � x and � y in an inner-product space |� � x , � y �| ≤ || � x || �· , ·� || � y || �· , ·� Assume || � x || �· , ·� � = 0, then || � y || �· , ·� � � x , � y � = − || � x || �· , ·� || � y || �· , ·� ⇐ ⇒ � y = − � x || � x || �· , ·� || � y || �· , ·� � � x , � y � = || � x || �· , ·� || � ⇒ � � y || �· , ·� ⇐ y = x || � x || �· , ·�

ℓ 1 and ℓ ∞ norms Norms in R n or C n not induced by an inner product n � || � x || 1 := | � x [ i ] | i = 1 || � x || ∞ := max | � x [ i ] | i

Norm balls ℓ 1 ℓ 2 ℓ ∞

Distance The distance between two vectors � x and � y induced by a norm ||·|| is d ( � x , � y ) := || � x − � y ||

Classification Aim: Assign a signal to one of k predefined classes Training data: n pairs of signals (represented as vectors) and labels: { � x 1 , l 1 } , . . . , { � x n , l n }

Nearest-neighbor classification nearest neighbor

Face recognition Training set: 360 64 × 64 images from 40 different subjects (9 each) Test set: 1 new image from each subject We model each image as a vector in R 4096 and use the ℓ 2 -norm distance

Face recognition Training set

Nearest-neighbor classification Errors: 4 / 40 Test image Closest image

Mean, Variance and Correlation ◮ Consider real-valued data corresponding to a single quantity or feature. We model such data as a scalar continuous random variable. ◮ In reality we usually have access to a finite number of data points, not to a continuous distribution. ◮ Mean of a random variable is the point that minimizes the expected distance to the random variable. ◮ Intuitively, it is the center of mass of the probability density, and hence of the dataset.

Mean Lemma: For any random variable ˜ a with mean E (˜ a ) , a ) 2 � � E (˜ a ) = arg min c ∈ R E ( c − ˜ .

Proof = c 2 − 2 c E (˜ � a ) 2 � � a 2 � Let g ( c ) := E ( c − ˜ a ) + E ˜ , we have f ′ ( c ) = 2 ( c − E (˜ a )) , f ′′ ( c ) = 2 . The function is strictly convex and has a minimum where the derivative equals zero, i.e. when c is equal to the mean.

Variance The variance of a random variable ˜ a � a )) 2 � Var (˜ a ) := E (˜ a − E (˜ quantifies how much it fluctuates around its mean. The standard deviation, defined as the square root of the variance, is therefore a measure of how spread out the dataset is around its center.

Background Material DS-GA 1013 / MATH-GA 2824 Mathematical Tools for - PowerPoint PPT Presentation

Background Material DS-GA 1013 / MATH-GA 2824 Mathematical Tools for Data Science https://cims.nyu.edu/~cfgranda/pages/MTDS_spring20/index.html Sreyas Mohan and Carlos Fernandez-Granda Vector spaces Inner product Norms Mean, Variance and

Material Handling Chapter 5 Designing material handling systems Overview of material

Rational Phosphorus Rational Phosphorus Management in Biosolids Management in Biosolids

INFORMATION SHARING INITIATIVES INFORMATION SHARING INITIATIVES MATERIAL AND NON- -MATERIAL

Angular Material Design Whats New in Angular Material Design Whats Cool in Material Design

PHYSICAL MATERIAL VIRTUAL SUBSTANCE SOURCE MATERIAL SCAN MATERIAL READY

Introduction to Laser Material Introduction to Laser Material Processing ME 677: Laser Material

Material Design in practice Marcin Korniluk material design promo video What is Material Design?

REQUIREMENTS OF HALAL REQUIREMENTS OF HALAL FOOD OOD MATERIAL MATERIAL OUTLINE OUTLINE 1.

Material Qualification, Ageing and Obsolescence Dr Jessica Gwyther Wood Overview Material

Spintronics material aspects Spintronics material aspects Why to do not combine

AN INTRODUCTION TO BACKGROUND SETTINGS: Allows you to change background BACKGROUND SETTINGS: Allows

Tracer Study Public Workshop August 27, 2019 Background Background Background Background

Background Background Background Background Design Task Museum! Museum! Design step1

Chapter 2 Background This chapter covers general background material for the thesis and provides

investigation of pen as structural self vetoing material for cryogenic low background experiments

VERSATILITY WITH PRECISION Three Material-Speci c Print Heads Tailored for Be er Material

S TANDARD DPA ATTACK 0.6 Distinguisher value 3 # std deviations 0.4 2 0.2 1 0 0

A Correlation-based Methodology to Infer Communication patterns between Cloud Virtual Machines

Intergenerational Educational Mobility in the Developing World Florencia Torche Stanford

Continuous Improvement Toolkit Correlation Continuous Improvement Toolkit . www.citoolkit.com

CFRG repor*ng back to TLS WG IETF 90, Toronto Kenny

20. Line integrals Lets look more at line integrals. Lets suppose we want to compute the

David Stifel (Lafayette College, USA) Tassew Woldehanna (Addis Ababa (University, Ethiopia)

How Much Should You Invest In Software Security? f ? Kelce S. Wilson, PhD, MBA, JD Technical