Sketching and Streaming Matrix Norms David Woodruff IBM Almaden - PowerPoint PPT Presentation

Sketching and Streaming Matrix Norms David Woodruff IBM Almaden Based on joint works with Yi Li and Huy Nguyen

Turnstile Streaming Model � Underlying n-dimensional vector x initialized to 0 n � Long stream of updates x i ← x i + Δ i for Δ i in {-1,1} � At end of the stream, x is promised to be in the set {-M, -M+1, …, M-1, M} n for some bound M ≤ poly(n) � Output an approximation to f(x) whp � Goal: use as little space (in bits) as possible

Example Problem: Norms p = Ʃ i=1 n |x i | p � Suppose you want |x| p p ≤ Z ≤ (1+ Ɛ ) |x| p � Want Z for which (1- Ɛ ) |x| p p � p = 1 is Manhattan norm � Distances between distributions, network monitoring � p = 2 is (squared) Euclidean norm � Geometry, linear algebra � p = ∞ is max norm:|x| p = max x � � � denial of service attacks, etc.

Space Complexity of Norms � For 1 ≤ p ≤ 2 and constant approximation, can get log n space �� (� � For p > 2, the space is Θ � ) � Lower bound: k-party disjointness � k vectors x � , … , x � ∈ 0,1 � which have disjoint supports or uniquely intersect � x = ∑ x � presented in the stream in the following order: x � , … , x � � � x = (0, 1, 0, 0, 1, 0, 0, 0, 0, 1, 1, 1, 0, 1, 0, 0), or � x = (0, 1, 0, 0, 1, 0, k, 0, 0, 1, 1, 1, 0, 1, 0, 0) � Set k = 2 n �/� . Disjointness Ω( � � ) communication bound gives Ω( � � � ) stream memory bound

Matrix Norms � We understand vector norms very well � Recent interest in estimating matrix norms � Stream of updates to an n x n matrix A � A initialized to 0 � � , see updates A i,j ← A i,j + Δ i,j for Δ i,j in {-1,1} � Entries of A bounded in absolute value by poly(n) � Every matrix A = U ΣV & in its singular value decomposition, where U, V have orthonormal columns and Σ is a non-negative diagonal matrix � = ∑ σ � � where σ � = Σ �,� � Schatten p-norm A � �

Matrix Norms � = ∑ σ � � where σ � = Σ �,� � Schatten p-norm A � � � p = 0 is the rank � p = 1 is the trace norm ∑ σ � � ) � p = 2 is the Frobenius norm ∑ A �,( �,( , � � p = ∞ is the operator norm sup � � = ∑ σ � � up to a constant factor? � What is the complexity of approximating A � � � For one value of p, this is easy… � p = 2 norm can be estimated in log n bits of space � What about other values of p?

Matrix Norm Results � Thoughts? Conjectures? � An important special case: suppose A is sparse, i.e., has O(1) non-zero entries per row and per column �(n) upper bound for every 0 ≤ p ≤ ∞ � There is an O � Anything better for p ≠ 2?

What about even integers p? [LW16] �� (n � Show an O 0 ) upper bound for every even integer p � Matches the lower bound for vectors � The even integer p-norms are the only norms with non-trivial space!

Upper Bound Intuition for p = 4 ) = ∑ 1 = AA & < A � , A ( > ) � A 1 , where A � are the rows of A �,( 2 ) ≤ max ) ⋅ A ( ) � < A � , A ( > ) ≤ A � ) < A � , A � > ) �,( ) = 1 for all i, then � If A � ) (1) < A � , A ( > ) ≤ 1 for all i and j < A � , A ( > ) ≥ ϵ ∑ < A � , A � > ) (2) if ∑ ≥ ϵn �8( � � n terms < A � , A ( > ) for i ≠ j suffices � Implies uniformly sampling O < A � , A ( > ) for estimating ∑ �8(

1 < A � , A ( > ) ≤ 1 for all i,j 1 ;� 1 1 2 < < A � , A ( > ) ≥ ϵn 1 �8( ;� 1 These conditions imply uniformly �(n) entries works 1 sampling O � n entries, we sample O �(√n ) rows in their entirety (can To sample O � approximately do this in a stream) �(√n ) space given O(1) non-zero Can store all sampled rows using O � entries per row � Estimate (2) using all pairwise inner products in the sampled rows (some slight dependence issues) ) � When A � ) ≠ 1 for all i, instead sample rows proportional to A � )

Beyond p = 4 � For even integers p, let q = p/2. Then, � = ∑ � A � ∏ < A � ? , A � ?@A > , where i CF� = i � �D � A ,� � ,…,� E D � (B�,…,C �� (n � Sample O 0 ) rows in their entirety proportional to their squared norm � Approximate above sum by summing over all q-tuples from your sample � exists! � For non-even integers p and p = 0, no such expression for A �

��

… � 2n nodes � Create a t-clique for each hyperedge in Bob’s input � Add ‘tentacles’ according to Alice’s input x � Determine whether all cliques have an even or odd number of tentacles � Maximum matching size different by a constant factor in the cases H�I � If clique size is t, then with r tentacles, block matching size is r + ⌊ ) ⌋ � Matching size is 3n/4 if r are all even, Matching size is 3n/4-n/(2t) if r are all odd

Connection with Matrices � Consider the Tutte matrix A of the graph � A �,( = 0 if {i,j} is not an edge � A �,( = y �,( if {i,j} is an edge and i < j � A �,( = −y �,( if {i,j} is an edge and j < i � rank(A), under random assignment to the y �,( , is twice the maximum matching size, with high probability � Ω(n �� A � M ) lower bound for (1 + Θ H ) -approximation

Distributional BHH Problem � Distributional BHH [VY11]: Alice get a uniformly random x in 0,1 � , and Bob an independent, uniformly random perfect t-hyper-matching M on the n coordinates and a binary string w in 0,1 �/H . Promise: Mx ⊕ w = 1 �/H or Mx ⊕ w = 0 �/H � Let t be even. Distributional BHH problem [BS15]: � Replace x with new input x ← (x, x R ) � For i-th set S = x � A , … , x � M ∈ M, � if w � = 0, include x � A , … , x � M and x � A , … x � M in new input M � if w � = 1 , include {x � A , x � � , … , x � M } and {x � A , x � � , x � U , … , x � M } in the new input M � Correctness is preserved, and Mx = 1 �/H or Mx = 0 �/H � In graph, can partition t-cliques into pairs: in each pair number of tentacles is q and t-q, for a binomially distributed odd (even) integer q if Mx = 1 �/H (if Mx = 0 �/H )

Distributional BHH Problem � Consider Tutte matrix A with diagonal 0 and indeterminates equal to 1 � After permuting rows and columns, A is block-diagonal � Each block is (2t) x (2t) and corresponds to a clique with tentacles � t = 4 and the three possible blocks for an even number of tentacles: 1 0 0 0 0 1 1 1 0 1 1 1 1 0 0 0 0 1 1 1 0 0 0 0 0 1 0 0 -1 0 1 1 0 1 0 0 -1 0 1 1 -1 0 1 1 0 0 0 0 -1 -1 0 1 0 0 1 0 -1 -1 0 1 0 0 0 0 B ) = B X = -1 -1 0 1 B 1 = 0 0 0 0 -1 -1 -1 0 -1 -1 -1 0 0 0 0 1 0 0 0 0 -1 -1 -1 0 0 0 0 0 0 0 0 0 -1 0 0 0 0 0 0 0 0 0 0 0 -1 0 0 0 0 0 0 0 0 0 0 0 0 -1 0 0 0 0 0 0 0 0 0 0 0 -1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 -1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 -1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0

Distribution of Singular Values � = ∑ � � A � B � YZ[\�] ^ �� , � ≠ E C∼b H [ B C � � ] � Suppose E C∼a(H) B C � � E(t) is distribution on even integers q with Pr[q = i] = {t choose i}/ 2 H�� O(t) is distribution on odd integers q with Pr[q = i] = {t choose i}/ 2 H�� Since blocks B are of constant size, and pairs of blocks are independent, by Hoeffding � differs by a constant factor if Mx = 1 �/H or if Mx = 0 �/H bounds A � � ≠ E C∼b H [ B C � � ] ! � Suffices to show E C∼a(H) B C �

n ��e f Lower Bound for p not an Even Integer � ≠ E C∼b H [ B C � � ] � Just need to show E C∼a(H) B C � � Change the definition of blocks B C to make analysis tractable � Singular values are either 1 or roots of a quadratic equation depending on q � Analysis uses power series expansion of the roots and hypergeometric polynomials

Conclusions and Future Directions � Nearly tight bounds for sparse matrices for matrix norms for every p � For dense matrices, for p = 0 there is an n )�e f lower bound [AKL17] � Nothing better known for other values of p for dense matrices � When the streaming algorithm is a linear sketch: � Not clear if these lower bounds imply lower bounds for streams (though would be surprising if not) � n )�1/� bound for every p ≥ 2 , tight for even integers [LNW14,LW16] � For p not an even integer, conjecture an n )�e f lower bound

Sketching and Streaming Matrix Norms David Woodruff IBM Almaden - PowerPoint PPT Presentation

Sketching and Streaming Matrix Norms David Woodruff IBM Almaden Based on joint works with Yi Li and Huy Nguyen Turnstile Streaming Model Underlying n-dimensional vector x initialized to 0 n Long stream of updates x i x i + i for

Iterative Sketching Agile Arizona 2017 Agenda Who am I? The Power of Sketching When

Chapter 7 Norms and Distance Measures Chapter 7 Vector Norms Norms are functions which measure

Free Form Sketching System for Free Form Sketching System for Product Design Using Virtual

Curve Sketching Michael Freeze MAT 151 UNC Wilmington Summer 2013 1 / 10 Section 5.4 :: Curve

New matrix norms for structured matrix estimation Jean-Philippe Vert Optimization and Statistical

Norms Norm is a measure of size of a vector or matrix. Typical vector norms: Let v = [ v 1 , v

[3] The Matrix What is a matrix? Traditional answer Neo: What is the Matrix? Trinity: The answer

Matrix Multiplication Matrix Multiplication via Matrix-Vector Mult Defn. If matrix A is m n

Enhancing compliance Why do people comply? Adriaan Denkers FIOD-ECD Why do people obey the law?

Training Presentation Web Streaming Introduction What is Web Streaming? Who is Streaming?

20 STREAMING AGREEMENT 19 16 OCTOBER US$145 million Streaming Agreement US$145 million

2 Workloa d? 3 OLTP 4 OLAP OLTP 4 OLAP OLTP Streaming 4 Scan- OLAP OLTP Streaming

Introductory Matrix Operations Matrix Entries Defn. For matrix A , notation a ij means the en-

Building an IoT Platform with Matrix matthew@matrix.org http://www.matrix.org What is Matrix?

Liberating Communication with Matrix matthew@matrix.org http://www.matrix.org What is Matrix?

Gov 2000: 10. Multiple Regression in Matrix Form Matthew Blackwell Fall 2016 1 / 64 1. Matrix

Managerial Capital and Productivity: Evidence from a Training Program in the Bangladeshi Garment

THE SECOND YEAR AT EMORY PROGRAM: ASSESSING A RESIDENTIAL SECOND-YEAR INITIATIVE Ben Perlman

ROBERT W. WOODRUFF DEANS ACHIEVEMENT SCHOLARS INFORMATION SESSION 2019 AGENDA 1 OVERVIEW OF

Nevada Department of Wildlife Predator Management Plan Fiscal Year 2021 Summary on Plans and

Renewables Integration Study Next Steps Mark Rothleder Director, Market Analysis and Development

How much is water entering the Apalachicola River through Jim Woodruff Dam influenced by climate

Hy Hydr drolo logic ic M Modeling deling ABSI CAB 01/08/2020 Dr. Steve Leitman Florida

Numerical Linear Algebra in the Streaming Model David Woodruff IBM Almaden Data Streams A