Low rank SDP extreme points and Applications Mohit Singh Georgia - PowerPoint PPT Presentation

Low rank SDP extreme points and Applications Mohit Singh Georgia Tech

SDP extreme points • Pataki, Gábor. "On the rank of extreme matrices in semidefinite programs and the multiplicity of optimal eigenvalues." Math of OR ‘98 . • Barvinok, Alexander. "Problems of distance geometry and convex properties of quadratic maps." Discrete & Computational Geometry ‘95 . • Applications. • S-lemma [Yakubovich’ 71] • Fair Dimensionality Reduction in Data Analysis [Tantipongpipat, Samadi, Morgernstern, Singh, Vempala ‘18,’19]

LP extreme points • Consider the linear program where �×� , � min 𝑑 � 𝑦 Theorem: Every extreme point has at most non-zero variables. Therefore, there exists an optimal solution that has at most non-zero variables. Numerous generalization, applications. [Neil’s Talk tomorrow]

Semi-definite Programming • A symmetric matrix is positive semi-definite (definite), i.e. if • has non-negative(positive) eigenvalues. � for some �×� with orthogonal columns and �×� symmetric, • diagonal with positive entries. (r=n). • � � . ( � � for all for all ) • Let � �� • • min〈𝐷, 𝑌〉 � �

Main Result • � � • Theorem[Barvinok’95, Pataki’98]: Every extreme point of the above SDP has � �� rank at most where � � � • Corollary 1: SDP has an optimal solution with rank at most � � • Corollary 2: If m=2, then there is always a rank 1 optimal solution.

Proof • Suppose not! • Let be rank where We find such that and are feasible. Contradiction. � where �×� with orthonormal columns, �×� diagonal • with positive entries. � are feasible. �×� , s.t. � , • Search for � Want: � � �

Proof (Contd) • Claim: It is enough to ensure � • � • symmetric. Proof: • � � � � � � � • Eigenvalues of are same as eigenvalues of • But and therefore, if will ensure . � �� constraints overs � variables. But the above are � If , then there is always a non-trivial solution.

Main Result • � � • Theorem[Barvinok’95, Pataki’98]: Every extreme point of the above SDP has � �� rank at most where � � � • Corollary 1: SDP (I) has an optimal solution with rank at most � � • Corollary 2: If m=2, then there is always a rank 1 optimal solution.

Farkas Lemmas: Linear and Quadratic. � such that � � � Farkas Lemma: Suppose � � Then, � some non-negative � � � � � � �×� symmetric matrices such that � S-Lemma (Yakubovich ‘79). Suppose � Then for some Proof: Consider the SDP. Dual : Max � . But then objective is at least 0. This Observe that primal optimal is rank 1. Thus implies dual is at least 0.

SDP extreme points • Pataki, Gábor. "On the rank of extreme matrices in semidefinite programs and the multiplicity of optimal eigenvalues." Math of OR ‘98 . • Barvinok, Alexander. "Problems of distance geometry and convex properties of quadratic maps." Discrete & Computational Geometry ‘95 . • Applications. • S-lemma [Yakubovich’ 71] • Fair Dimensionality Reduction in Data Analysis [Tantipongpipat, Samadi, Morgernstern, Singh, Vempala ‘18,’19]

Dimensionality Reduction • Data is usually represented in high dimensions. • There are few relevant directions. • Dimensionality reduction leads to representation in relevant directions. • Also computationally useful for any data analysis or algorithms.

PCA (Principle Component Analysis) �×� of • Data data points. Want to reduce from to dimensions. • Minimize reconstruction error: � � �:�� is the Frobenius norm, sum of square of error. �

PCA (Principle Component Analysis) �×� of • Data data points. Want to reduce from to dimensions. • Minimize reconstruction error: � � �:�� is the Frobenius norm, sum of square of error. � • Easily solved by SVD (Singular Value Decomposition). The optimal solution has a form where is projection matrix on top singular vectors of .

PCA objective � = 𝑈𝑠(𝐸 � 𝐸) − m𝑏𝑦 � = min �∈𝒬 � 𝐸 � 𝐸 ⋅ 𝑄 �:�� 𝐸 − 𝑉 � min �∈𝒬 � 𝐸 − 𝐸𝑄 � 𝒬 � = {𝑄 ∈ 𝑆 �×� : 𝑄 𝑡𝑧𝑛𝑛𝑓𝑢𝑠𝑗𝑑, 𝑠𝑏𝑜𝑙 𝑄 = 𝑒, 𝑄 � = 𝑄}

Applications and History of PCA • Pearson’1901 and Hotelling’1931. • Standard tool in data analysis. • Widely used in sciences, humanities, finance, image recognition. [Sirovich, Kirby’87, Turk, Pentland’91] • Fossil Teeth Data: Kuehneotherium and Morganucodon Species. [Gill et al, Nature 2014] • Random Projection to lower dimensions. • Johnson-Lindenstrauss Lemma ‘1984: All distances are preserved up to a small error.

Unfairness of the PCA problem Data belongs to users of different groups, say images of men and women. PCA ensures average error in projection is small. Errors for two different groups are different. Standard PCA on face data LFW of Equalizing male and female weight male and female. before PCA

Fair PCA � � ×� for • Given data matrices and a projection matrix � �×� . � � � • � � � � � � � � • Given target dimension . • Fair PCA: Find a projection matrix of rank at most d that minimizes the maximum error. Fair PCA as rank constrained SDP. Fair PCA:= � �∈𝒬 � �∈[�] 𝒬 � = {𝑄 ∈ 𝑆 �×� : 𝑄 𝑡𝑧𝑛𝑛𝑓𝑢𝑠𝑗𝑑, 𝑠𝑏𝑜𝑙 𝑄 = 𝑒, 𝑄 � = 𝑄} � 𝐸 � − 𝐸 � � 𝐸 � ⋅ 𝑄 ∀𝑗 = 1, … , 𝑙 𝑨 ≥ 𝑈𝑠 𝐸 � 𝑠𝑏𝑜𝑙 𝑄 = 𝑒 0 ≼ 𝑄 ≼ 𝐽

Fair Dimensionality Reduction • More generally, we are given utility functions � that measure the � utility of each group. � • Moreover, we are given a function that combines these utilities. Fair DR �∈𝒬 � 𝑕 𝑣 � 𝑄 , 𝑣 � 𝑄 , … , 𝑣 � 𝑄 max 𝒬 � = {𝑄 ∈ 𝑆 �×� : 𝑄 𝑡𝑧𝑛𝑛𝑓𝑢𝑠𝑗𝑑, 𝑠𝑏𝑜𝑙 𝑄 = 𝑒, 𝑄 � = 𝑄} Fair PCA: Special case with 𝑣 � 𝑄 = −𝐹𝑠𝑠 𝐸 � , 𝑄 and 𝑕 . = min � − 𝐸 � − 𝐸 � 𝑄 � ∗ � 𝑀𝑝𝑡𝑡 � 𝑄 = 𝐸 � − 𝐸 � 𝑄 � � ∗ is the best rank 𝑒 projection for group 𝑗. where 𝑄 � • Loss for being part of the other groups.

Related Work • Rank Constrained SDPs are widely used. • Signal processing [Davies and Eldar’12, Ahmed and Romberg’15] • Distance Matrices: Localization sensors [So and Ye’07] , nuclear magnetic resonance spectroscopy [Singer’08] • Item Response Data, Recommendation Systems[Goldberg et al’93] • Machine Learning: Multi-task Learning [Obozinski, Taskar, Jordan’10] , Natural Language Processing [Blei’12] • Survey by [Davenport, Romberg’2016] • Work by Barvinok’95, Pataki’98 on characterizations of extreme points of SDPs. • Algorithmic work by [Burer, Monteiro’03]

Our Results � Fair PCA:= min �∈𝒬 � max �∈[�] 𝐹𝑠𝑠 𝐸 � , 𝑄 ≔ 𝐸 � − 𝐸 � 𝑄 � 𝒬 � = {𝑄 ∈ 𝑆 �×� : 𝑄 𝑡𝑧𝑛𝑛𝑓𝑢𝑠𝑗𝑑, 𝑠𝑏𝑜𝑙 𝑄 = 𝑒, 𝑄 � = 𝑄} • Theorem 1: The Fair PCA problem is polynomial time solvable for k=2. • “Integrality” of SDPs. • Theorem 2: The Fair PCA problem is polynomial time solvable for constant k and d. • Algorithmic theory of quadratic maps. [Grigoriev and Pasechnik ’05] • Problem is NP-hard for general k, d=1. • Results generalize to Fair Dimensionality reduction when � is linear and is concave. Fair DR �∈𝒬 � 𝑕 𝑣 � 𝑄 , 𝑣 � 𝑄 , … , 𝑣 � 𝑄 max 𝒬 � = {𝑄 ∈ 𝑆 �×� : 𝑄 𝑡𝑧𝑛𝑛𝑓𝑢𝑠𝑗𝑑, 𝑠𝑏𝑜𝑙 𝑄 = 𝑒, 𝑄 � = 𝑄}

� Our Results: Approximation Fair PCA:= min �∈𝒬 � max �∈[�] 𝐹𝑠𝑠 𝐸 � , 𝑄 ≔ 𝐸 � − 𝐸 � 𝑄 � 𝒬 � = {𝑄 ∈ 𝑆 �×� : 𝑄 𝑡𝑧𝑛𝑛𝑓𝑢𝑠𝑗𝑑, 𝑠𝑏𝑜𝑙 𝑄 = 𝑒, 𝑄 � = 𝑄} • Theorem 3: There is a polynomial time algorithm for the Fair PCA problem that returns a rank � � at most � whose objective is better than the optimum. � • Extreme Points of SDPs. • Theorem 4: There is a polynomial time algorithm for the Fair PCA problem that returns a rank at most whose objective is at most , � � � � 𝐸 where 𝜏 � 𝐶 is the 𝑗 �� largest singular value of B. where Δ ≔ max �⊆[�] ∑ 𝜏 � � ∑ 𝐸 � �∈� � �� • Iterative Rounding Framework for SDPs.

SDP extreme points Theorem 1: Every extreme point of the SDP-Relaxation has rank at most Thus the objective of the two programs are identical. Rank-Constrained SDP SDP-Relaxation min 𝐷 ⋅ 𝑌 min 𝐷 ⋅ 𝑌 𝐵 ⋅ 𝑌 ≤ 𝑐 𝐵 ⋅ 𝑌 ≤ 𝑐 𝑠𝑏𝑜𝑙 𝑌 ≤ 𝑒 𝑈𝑠𝑏𝑑𝑓 𝑌 ≤ 𝑒 0 ≼ 𝑌 ≼ 𝐽 0 ≼ 𝑌 ≼ 𝐽 Corollary: Fair PCA is polynomial time solvable for 2 groups. Related: Barvinok’95, Pataki’98. S-Lemma [Yakubovich’71].

Low rank SDP extreme points and Applications Mohit Singh Georgia - PowerPoint PPT Presentation

Low rank SDP extreme points and Applications Mohit Singh Georgia Tech SDP extreme points Pataki, Gbor. "On the rank of extreme matrices in semidefinite programs and the multiplicity of optimal eigenvalues." Math of OR 98 .

2 3 4 5 8 9 MINNEAPOLIS MILWAUKEE MSA RANK #16 MSA RANK #39 CHICAGO MSA RANK #3

Alternate Offers / Capabilities in Alternate Offers / Capabilities in SIP/SDP SIP/SDP

SDP and eigenvalue bounds for the graph partition problem Renata Sotirov and Edwin van Dam

http://www.stanford.edu/yyye Joint work with Anthony So and Jiawei Zhang SDP Rank Reduction

Parallel Numerical Algorithms Chapter 6 Matrix Models Section 6.2 Low Rank Approximation

SDP and Moray Council Welcome to the Supplier Development Programme SDP is a business initiative

SDP Capability Negotiation draft-ietf-mmusic-sdp-capability-negotiation-12.txt Flemming Andreasen

STUN, TURN, ICE, SDP, SIP, WebSockets ... STUN, TURN, ICE, SDP, SIP, WebSockets ... This Talk

School Distinctive Programme [SDP] 2016 P1 & P2 What is SDP? It is one of the key

The SDP Content Attribute draft-ietf-mmusic-sdp-media-content-02.txt Jani.Hautakorpi@ericsson.com

LP/SDP LP/SDP Algorithms Algorithms Claire Mathieu Brown University Many optimization

Connection-Oriented Media Transport in SDP draft-ietf-mmusic-sdp-comedia-01.txt David Yon

On the minimum rank of a graph Jisu Jeong June 21, 2013 Jisu Jeong On the minimum rank of a

Extreme Heat Preparedness Objectives What is extreme heat ? How does it impact SF? What are the

2014: Extreme territories 2 2015: Extreme territories 3 2016: Extreme territories 4 2018:

1 SVD applications: rank, column, row, and null spaces Rank : the rank of a matrix is equal to:

Microeconomics 3200/4200: Part 1 P. Piacquadio p.g.piacquadio@econ.uio.no September 14, 2017

Lectures 67: Monotone Comparative Statics, with Applications to Producer Theory Alexander

Tensor Canonical Correlation Analysis and It Its applications Presenter: Yong LUO The work is

Characterization of Conditional Independence and Weak Realizations of Multivariate Gaussian

and Deep Reconstruction Dr. Uwe Kruger Department of Biomedical Engineering Jonsson Engineering

Empirical Study of a Two-Step Approach to Estimate Translation Quality J. Gonz alez-Rubio,

Regime for Exploration in India Anupama Sen Senior Research Fellow, Oxford Institute for Energy

Introduction to Java 1 / 10 Java Developed for home appliances - cross-platform VM a key

Low rank SDP extreme points and Applications Mohit Singh Georgia - PowerPoint PPT Presentation

Low rank SDP extreme points and Applications Mohit Singh Georgia Tech SDP extreme points Pataki, Gbor. "On the rank of extreme matrices in semidefinite programs and the multiplicity of optimal eigenvalues." Math of OR 98 .

2 3 4 5 8 9 MINNEAPOLIS MILWAUKEE MSA RANK #16 MSA RANK #39 CHICAGO MSA RANK #3

Alternate Offers / Capabilities in Alternate Offers / Capabilities in SIP/SDP SIP/SDP

SDP and eigenvalue bounds for the graph partition problem Renata Sotirov and Edwin van Dam

http://www.stanford.edu/yyye Joint work with Anthony So and Jiawei Zhang SDP Rank Reduction

Parallel Numerical Algorithms Chapter 6 Matrix Models Section 6.2 Low Rank Approximation

SDP and Moray Council Welcome to the Supplier Development Programme SDP is a business initiative

SDP Capability Negotiation draft-ietf-mmusic-sdp-capability-negotiation-12.txt Flemming Andreasen

STUN, TURN, ICE, SDP, SIP, WebSockets ... STUN, TURN, ICE, SDP, SIP, WebSockets ... This Talk

School Distinctive Programme [SDP] 2016 P1 &amp; P2 What is SDP? It is one of the key

The SDP Content Attribute draft-ietf-mmusic-sdp-media-content-02.txt Jani.Hautakorpi@ericsson.com

LP/SDP LP/SDP Algorithms Algorithms Claire Mathieu Brown University Many optimization

Connection-Oriented Media Transport in SDP draft-ietf-mmusic-sdp-comedia-01.txt David Yon

On the minimum rank of a graph Jisu Jeong June 21, 2013 Jisu Jeong On the minimum rank of a

Extreme Heat Preparedness Objectives What is extreme heat ? How does it impact SF? What are the

2014: Extreme territories 2 2015: Extreme territories 3 2016: Extreme territories 4 2018:

1 SVD applications: rank, column, row, and null spaces Rank : the rank of a matrix is equal to:

Microeconomics 3200/4200: Part 1 P. Piacquadio p.g.piacquadio@econ.uio.no September 14, 2017

Lectures 67: Monotone Comparative Statics, with Applications to Producer Theory Alexander

Tensor Canonical Correlation Analysis and It Its applications Presenter: Yong LUO The work is

Characterization of Conditional Independence and Weak Realizations of Multivariate Gaussian

and Deep Reconstruction Dr. Uwe Kruger Department of Biomedical Engineering Jonsson Engineering

Empirical Study of a Two-Step Approach to Estimate Translation Quality J. Gonz alez-Rubio,

Regime for Exploration in India Anupama Sen Senior Research Fellow, Oxford Institute for Energy

Introduction to Java 1 / 10 Java Developed for home appliances - cross-platform VM a key

School Distinctive Programme [SDP] 2016 P1 & P2 What is SDP? It is one of the key