mathematical background
play

Mathematical Background Lijun Zhang zlj@nju.edu.cn - PowerPoint PPT Presentation

Mathematical Background Lijun Zhang zlj@nju.edu.cn http://cs.nju.edu.cn/zlj Outline Norms Analysis Functions Derivatives Linear Algebra Inner product Inner product on


  1. Mathematical Background Lijun Zhang zlj@nju.edu.cn http://cs.nju.edu.cn/zlj

  2. Outline  Norms  Analysis  Functions  Derivatives  Linear Algebra

  3. Inner product �  Inner product on � � � � � ���  Euclidean norm, or � -norm � � �/� � �/� � � � �  Cauchy-Schwartz inequality � � � � �  Angle between nonzero vectors 𝑦 � 𝑧 ∠�𝑦, 𝑧� � cos �� , 𝑦, 𝑧 ∈ 𝐒 � 𝑦 � 𝑧 �

  4. Inner product ��� , ���  Inner product on � � ⟨𝑌, 𝑍⟩ � tr 𝑌 � 𝑍 � � � 𝑌 �� 𝑍 �� ��� ��� Here tr� � denotes trace of a matrix. ���  Frobenius norm of a matrix �/� � � �/� � � 𝑌 � � tr 𝑌 � 𝑌 � � 𝑌 �� ��� ��� �  Inner product on � � � ⟨𝑌, 𝑍⟩ � tr�𝑌𝑍� � � � 𝑌 �� 𝑍 �� � � 𝑌 �� 𝑍 �� � 2 � 𝑌 �� 𝑍 �� ��� ��� ��� ���

  5. Norms  A function 𝑔: 𝐒 � → 𝐒 with dom 𝑔 � 𝐒 � is called a norm if  𝑔 is nonnegative: 𝑔�𝑦� � 0 for all 𝑦 ∈ 𝐒 �  𝑔 is definite: 𝑔�𝑦� � 0 only if 𝑦 � 0  𝑔 is homogeneous: 𝑔�𝑢𝑦� � |𝑢|𝑔�𝑦�, for all 𝑦 ∈ 𝐒 � and 𝑢 ∈ 𝐒  𝑔 satisfies the triangle inequality: 𝑔�𝑦 � 𝑧� � 𝑔�𝑦� � 𝑔�𝑧�, for all 𝑦, 𝑧 ∈ 𝐒 �  Distance  Between vectors 𝑦 and 𝑧 as the length of their difference, i.e., dist�𝑦, 𝑧� � 𝑦 � 𝑧

  6. Norms  Unit ball  The set of all vectors with norm less than or equal to one, ℬ � 𝑦 ∈ 𝐒 � | 𝑦 � 1 is called the unit ball of the norm ∥⋅∥ .  The unit ball satisfies the following properties:  ℬ is symmetric about the origin, i.e., 𝑦 ∈ ℬ if and only if � 𝑦 ∈ ℬ  ℬ is convex  ℬ is closed, bounded, and has nonempty interior Conversely, if 𝐷 ⊆ 𝐒 � is any set satisfying these  three conditions, the it is the unit ball of a norm: � �sup 𝑢 � 0 𝑢𝑦 ∈ 𝐷 � �� 𝑦

  7. Norms �  Some common norms on  Sum-absolute-value, or � -norm � � � �  Chebyshev or � -norm � � �  � -norm � � � � �/� � � , -quadratic norm is  For �� � �/� �/� � �

  8. Norms ���  Some common norms on  Sum-absolute-value norm � � ��� �� ��� ���  Maximum-absolute-value norm ��� ��

  9. Norms  Equivalence of norms  Suppose that � and � are norms � , there exist positive constants on � and , for all � � � � , then there  If is any norm on exists a quadratic norm � for which � � holds for all .

  10. Norms  Operator norms �  Suppose � and � are norms on � , respectively. Operator norm of and ��� induced by � and � is �,� � �  When � and � are Euclidean norms, the operator norm of is its maximum singular value, and is denoted � �/� 𝑌 � � 𝜏 ��� �𝑌� � 𝜇 ��� 𝑌 � 𝑌  Spectral norm or ℓ � -norm

  11. Norms  Operator norms  The norm induced by the ℓ � -norm on 𝐒 � and 𝐒 � , denoted 𝑌 � , is the max-row-sum norm, � 𝑌 � � sup 𝑌𝑣 � | 𝑣 � � 1 � max ���,…,� � 𝑌 �� ���  The norm induced by the ℓ � -norm on 𝐒 � and 𝐒 � , denoted 𝑌 � , is the max-column-sum norm, � 𝑌 � � max ���,…,� ∑ 𝑌 �� ���

  12. Norms  Dual norm � .  Let be a norm on  The associated dual norm, denoted ∗ , is defined as � ∗  We have the inequality � ∗  The dual of Euclidean norm sup 𝑨 � 𝑦| 𝑦 � � 1 � 𝑨 �  The dual of the � -norm sup 𝑨 � 𝑦| 𝑦 � � 1 � 𝑨 �

  13. Norms  Dual Norm  The dual of � -norm is the � -norm such that ��� is the  The dual of the � -norm on nuclear norm � �∗ � � �/� � �

  14. Outline  Norms  Analysis  Functions  Derivatives  Linear Algebra

  15. Analysis  Interior and Open Set  An element 𝑦 ∈ 𝐷 ⊆ 𝐒 � is called an interior point of 𝐷 if there exists an 𝜗 � 0 for which 𝑧 𝑧 � 𝑦 � � 𝜗� ⊆ 𝐷 i.e., there exists a ball centered at 𝑦 that lies entirely in 𝐷 .  The set of all points interior to 𝐷 is called the interior of 𝐷 and is denoted int 𝐷 .  A set is open if

  16. Analysis  Closed Set and Boundary �  A set is closed if its complement is open 𝐒 � ∖ 𝐷 � 𝑦 ∈ 𝐒 � |𝑦 ∉ 𝐷  The closure of a set 𝐷 is defined as cl 𝐷 � 𝐒 � ∖ int�𝐒 𝐨 ∖ 𝐷�  The boundary of the set 𝐷 is defined as bd 𝐷 � cl 𝐷 ∖ int 𝐷  𝐷 is closed if it contains its boundary. It is open if it contains no boundary points.

  17. Analysis  Supremum and infimum  The least upper bound or supremum of the set is denoted .  The greatest lower bound or infimum of the set is denoted .

  18. Outline  Norms  Analysis  Functions  Derivatives  Linear Algebra

  19. �  An example �� � Functions  Notation  

  20. Functions  Continuity � is continuous at �  A function if for all there exists a with , such that � �  Closed functions �  A function is closed if, for each , the sublevel set is closed. This is equivalent to ���

  21. Outline  Norms  Analysis  Functions  Derivatives  Linear Algebra

  22. Derivatives  Definition � and �  Suppose . The function is differentiable at if there ��� that satisfies exists a matrix 𝑔 𝑨 � 𝑔 𝑦 � 𝐸𝑔 𝑦 𝑨 � 𝑦 � lim � 0 𝑨 � 𝑦 � �∈��� �, ���, �→� in which case we refer to as the derivative (or Jacobian) of at .

  23. Derivatives  Definition  The affine function of given by is called the first-order approximation of at (or near) . � �� �

  24. Derivatives  Gradient When 𝑔 is real-valued (i.e., 𝑔: 𝐒 � → 𝐒 ) the  derivative 𝐸𝑔�𝑦� is a 1 � 𝑜 matrix (it is a row vector). Its transpose is called the gradient of the function: 𝛼𝑔�𝑦� � 𝐸𝑔�𝑦� � which is a column vector (in 𝐒 � ). Its components are the partial derivatives of 𝑔 : 𝛼𝑔�𝑦� � � 𝜖𝑔�𝑦� , 𝑗 � 1, ⋯ , 𝑜 𝜖𝑦 � The first-order approximation of 𝑔 at a point 𝑦 ∈  int dom 𝑔 can be expressed as (the affine function of 𝑨 ) 𝑔�𝑦� � 𝛼𝑔�𝑦� � �𝑨 � 𝑦�

  25. Derivatives  Examples 𝑔 𝑦 � 1 2 𝑦 � 𝑄𝑦 � 𝑟 � 𝑦 � 𝑠 𝛼𝑔 𝑦 � 𝑄𝑦 � 𝑟 � 𝑔 𝑌 � log det 𝑌 , dom 𝑔 � 𝐓 �� 𝛼𝑔 𝑌 � 𝑌 ��

  26. Derivatives  Chain rule Suppose 𝑔: 𝐒 � → 𝐒 � is differentiable at 𝑦 ∈ int  dom 𝑔 and 𝑕: 𝐒 � → 𝐒 � is differentiable at 𝑔�𝑦� ∈ int dom 𝑕 . Define the composition ℎ: 𝐒 � → 𝐒 � by ℎ�𝑨� � 𝑕�𝑔�𝑨�� . Then ℎ is differentiable at 𝑦 , with derivate 𝐸ℎ�𝑦� � 𝐸𝑕�𝑔�𝑦��𝐸𝑔�𝑦� Suppose 𝑔: 𝐒 � → 𝐒 , 𝑕: 𝐒 → 𝐒 , and ℎ 𝑦 � 𝑕�𝑔 𝑦 �  𝛼ℎ 𝑦 � 𝑕 � 𝑔 𝑦 𝛼𝑔�𝑦�

  27. Derivatives  Composition of Affine Function 𝑕 𝑦 � 𝑔�𝐵𝑦 � 𝑐� 𝛼𝑕 𝑦 � 𝐵 � 𝛼𝑔�𝐵𝑦 � 𝑐� 𝑔: 𝐒 � → 𝐒, 𝑕: 𝐒 → 𝐒 𝑦, 𝑤 ∈ 𝐒 � 𝑕 𝑢 � 𝑔 𝑦 � 𝑢𝑤 , 𝑕′ 𝑢 � 𝑤 � 𝛼𝑔 𝑦 � 𝑢𝑤

  28. Example 1 �  Consider the function � � 𝑦 � 𝑐 � � 𝑔 𝑦 � log � exp �𝑏 � ��� �  where � �  � 𝑕 𝑧 � log � exp �𝑧 � � ��� exp 𝑧 � 1 ⋮ 𝛼𝑕 𝑧 � � ∑ exp 𝑧 � exp 𝑧 � ���

  29. Example 1 �  Consider the function � � 𝑦 � 𝑐 � � 𝑔 𝑦 � log � exp �𝑏 � ��� �  where � �  1 𝛼𝑔 𝑦 � 𝐵 � 𝛼𝑕 𝐵𝑦 � 𝑐 � 1 � 𝑨 𝐵 � 𝑨 � 𝑦 � 𝑐 � exp 𝑏 � 𝑨 � ⋮ � 𝑦 � 𝑐 � exp 𝑏 �

  30. Example 2  Consider the function � � � � � �  where � �  � � � � � �� � � � �� � �� �

  31. Second Derivative  Definition  Suppose 𝑔: 𝐒 � → 𝐒 . The second derivative or Hessian matrix of 𝑔 at 𝑦 ∈ int dom 𝑔 , denoted 𝛼 � 𝑔�𝑦� , is given by 𝛼 � 𝑔�𝑦� �� � 𝜖 � 𝑔�𝑦� , 𝑗 � 1, ⋯ , 𝑜, 𝑘 � 1, ⋯ , 𝑜. 𝜖𝑦 � 𝜖𝑦 �  Second-order Approximation 𝑔�𝑦� � 𝛼𝑔�𝑦� � 𝑨 � 𝑦 � 1 2 𝑨 � 𝑦 � 𝛼 � 𝑔�𝑦��𝑨 � 𝑦�

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend