Data Streams & Communication Complexity Lecture 3: Communication - PowerPoint PPT Presentation

Data Streams & Communication Complexity Lecture 3: Communication Complexity and Lower Bounds Andrew McGregor, UMass Amherst 1/23

Basic Communication Complexity ◮ Three friends Alice, Bob, and Charlie each have some information x , y , z and Charlie wants to compute some function P ( x , y , z ). 2/23

Basic Communication Complexity ◮ Three friends Alice, Bob, and Charlie each have some information x , y , z and Charlie wants to compute some function P ( x , y , z ). m 1 m 2 out x y z ◮ To help Charlie, Alice sends a message m 1 to Bob, and then Bob sends a message m 2 to Charlie. 2/23

Basic Communication Complexity ◮ Three friends Alice, Bob, and Charlie each have some information x , y , z and Charlie wants to compute some function P ( x , y , z ). m 1 m 2 out x y z ◮ To help Charlie, Alice sends a message m 1 to Bob, and then Bob sends a message m 2 to Charlie. ◮ Question: How large must the total length of the messages be for Charlie to evaluate P ( x , y , z ) correctly? 2/23

Basic Communication Complexity ◮ Three friends Alice, Bob, and Charlie each have some information x , y , z and Charlie wants to compute some function P ( x , y , z ). m 1 m 2 out x y z ◮ To help Charlie, Alice sends a message m 1 to Bob, and then Bob sends a message m 2 to Charlie. ◮ Question: How large must the total length of the messages be for Charlie to evaluate P ( x , y , z ) correctly? ◮ Deterministic: m 1 ( x ), m 2 ( m 1 , y ), out( m 2 , z ) = P ( x , y , z ) 2/23

Basic Communication Complexity ◮ Three friends Alice, Bob, and Charlie each have some information x , y , z and Charlie wants to compute some function P ( x , y , z ). m 1 m 2 out x y z ◮ To help Charlie, Alice sends a message m 1 to Bob, and then Bob sends a message m 2 to Charlie. ◮ Question: How large must the total length of the messages be for Charlie to evaluate P ( x , y , z ) correctly? ◮ Deterministic: m 1 ( x ), m 2 ( m 1 , y ), out( m 2 , z ) = P ( x , y , z ) ◮ Random: m 1 ( x , r ), m 2 ( m 1 , y , r ), out( m 2 , z , r ) where r is public random string. Require P r [out( m 2 , z , r ) = P ( x , y , z )] ≥ 9 / 10. 2/23

Stream Algorithms Yield Communication Protocols 3/23

Stream Algorithms Yield Communication Protocols ◮ Let Q be some stream problem. Suppose there’s a reduction x → S 1 , y → S 2 , z → S 3 such that knowing Q ( S 1 ◦ S 2 ◦ S 3 ) solves P ( x , y , z ). m 1 m 2 out x y z S 1 S 2 S 3 3/23

Stream Algorithms Yield Communication Protocols ◮ Let Q be some stream problem. Suppose there’s a reduction x → S 1 , y → S 2 , z → S 3 such that knowing Q ( S 1 ◦ S 2 ◦ S 3 ) solves P ( x , y , z ). m 1 m 2 out x y z S 1 S 2 S 3 ◮ An s -bit stream algorithm A for Q yields 2 s -bit protocol for P : 3/23

Stream Algorithms Yield Communication Protocols ◮ Let Q be some stream problem. Suppose there’s a reduction x → S 1 , y → S 2 , z → S 3 such that knowing Q ( S 1 ◦ S 2 ◦ S 3 ) solves P ( x , y , z ). m 1 m 2 out x y z S 1 S 2 S 3 ◮ An s -bit stream algorithm A for Q yields 2 s -bit protocol for P : Alice runs A of S 1 ; 3/23

Stream Algorithms Yield Communication Protocols ◮ Let Q be some stream problem. Suppose there’s a reduction x → S 1 , y → S 2 , z → S 3 such that knowing Q ( S 1 ◦ S 2 ◦ S 3 ) solves P ( x , y , z ). m 1 m 2 out x y z S 1 S 2 S 3 ◮ An s -bit stream algorithm A for Q yields 2 s -bit protocol for P : Alice runs A of S 1 ; sends memory state to Bob; 3/23

Stream Algorithms Yield Communication Protocols ◮ Let Q be some stream problem. Suppose there’s a reduction x → S 1 , y → S 2 , z → S 3 such that knowing Q ( S 1 ◦ S 2 ◦ S 3 ) solves P ( x , y , z ). m 1 m 2 out x y z S 1 S 2 S 3 ◮ An s -bit stream algorithm A for Q yields 2 s -bit protocol for P : Alice runs A of S 1 ; sends memory state to Bob; Bob instantiates A with state and runs it on S 2 ; 3/23

Stream Algorithms Yield Communication Protocols ◮ Let Q be some stream problem. Suppose there’s a reduction x → S 1 , y → S 2 , z → S 3 such that knowing Q ( S 1 ◦ S 2 ◦ S 3 ) solves P ( x , y , z ). m 1 m 2 out x y z S 1 S 2 S 3 ◮ An s -bit stream algorithm A for Q yields 2 s -bit protocol for P : Alice runs A of S 1 ; sends memory state to Bob; Bob instantiates A with state and runs it on S 2 ; sends state to Charlie who finishes running A on S 3 and infers P ( x , y , z ) from Q ( S 1 ◦ S 2 ◦ S 3 ). 3/23

Communication Lower Bounds imply Stream Lower Bounds ◮ Had there been t players, the s -bit stream algorithm for Q would have lead to a ( t − 1) s bit protocol P . 4/23

Communication Lower Bounds imply Stream Lower Bounds ◮ Had there been t players, the s -bit stream algorithm for Q would have lead to a ( t − 1) s bit protocol P . ◮ Hence, a lower bound of L on the communication required for P implies s ≥ L / ( t − 1) bits of space are required to solve Q . 4/23

Outline of Lecture Classic Problems and Reductions Information Statistics Approach Hamming Approximation 5/23

Outline Classic Problems and Reductions Information Statistics Approach Hamming Approximation 6/23

Indexing ◮ Consider a binary string x ∈ { 0 , 1 } n and j ∈ [ n ], e.g., � � x = 0 1 0 1 1 0 and j = 3 and define Index ( x , j ) = x j 7/23

Indexing ◮ Consider a binary string x ∈ { 0 , 1 } n and j ∈ [ n ], e.g., � � x = 0 1 0 1 1 0 and j = 3 and define Index ( x , j ) = x j ◮ Suppose Alice knows x and Bob knows j . 7/23

Indexing ◮ Consider a binary string x ∈ { 0 , 1 } n and j ∈ [ n ], e.g., � � x = 0 1 0 1 1 0 and j = 3 and define Index ( x , j ) = x j ◮ Suppose Alice knows x and Bob knows j . ◮ How many bits need to be sent by Alice for Bob to determine Index ( x , j ) with probability 9/10? 7/23

Indexing ◮ Consider a binary string x ∈ { 0 , 1 } n and j ∈ [ n ], e.g., � � x = 0 1 0 1 1 0 and j = 3 and define Index ( x , j ) = x j ◮ Suppose Alice knows x and Bob knows j . ◮ How many bits need to be sent by Alice for Bob to determine Index ( x , j ) with probability 9/10? Ω( n ) 7/23

Application: Median Finding ◮ Thm: Any algorithm that returns the exact median of length 2 n − 1 stream requires Ω( n ) memory. 8/23

Application: Median Finding ◮ Thm: Any algorithm that returns the exact median of length 2 n − 1 stream requires Ω( n ) memory. ◮ Reduction from Index: On input x ∈ { 0 , 1 } n , Alice generates S 1 = { 2 i + x i : i ∈ [ n ] } . On input j ∈ [ n ], Bob generates S 2 = { n − j copies of 0 and j − 1 copies of 2 n + 2 } . E.g., � � x = 0 1 0 1 1 0 → { 2 , 5 , 6 , 9 , 11 , 12 } j = 3 → { 0 , 0 , 0 , 14 , 14 } 8/23

Application: Median Finding ◮ Thm: Any algorithm that returns the exact median of length 2 n − 1 stream requires Ω( n ) memory. ◮ Reduction from Index: On input x ∈ { 0 , 1 } n , Alice generates S 1 = { 2 i + x i : i ∈ [ n ] } . On input j ∈ [ n ], Bob generates S 2 = { n − j copies of 0 and j − 1 copies of 2 n + 2 } . E.g., � � x = 0 1 0 1 1 0 → { 2 , 5 , 6 , 9 , 11 , 12 } j = 3 → { 0 , 0 , 0 , 14 , 14 } ◮ Then median( S 1 ∪ S 2 ) = 2 j + x j and this determines Index ( x , j ). 8/23

Application: Median Finding ◮ Thm: Any algorithm that returns the exact median of length 2 n − 1 stream requires Ω( n ) memory. ◮ Reduction from Index: On input x ∈ { 0 , 1 } n , Alice generates S 1 = { 2 i + x i : i ∈ [ n ] } . On input j ∈ [ n ], Bob generates S 2 = { n − j copies of 0 and j − 1 copies of 2 n + 2 } . E.g., � � x = 0 1 0 1 1 0 → { 2 , 5 , 6 , 9 , 11 , 12 } j = 3 → { 0 , 0 , 0 , 14 , 14 } ◮ Then median( S 1 ∪ S 2 ) = 2 j + x j and this determines Index ( x , j ). ◮ An s -space algorithm implies an s -bit protocol so s = Ω( n ) by the communication complexity of indexing. 8/23

Multi-Party Set-Disjointness ◮ Consider a t × n matrix where column has weight 0 , 1 , or t , e.g.,   0 0 0 1 0 0 1 0 0 1 1 0   C =   0 1 0 1 0 0   0 0 0 1 0 0 and let Disj t ( C ) = 1 if there is an all 1’s column and 0 otherwise. 9/23

Multi-Party Set-Disjointness ◮ Consider a t × n matrix where column has weight 0 , 1 , or t , e.g.,   0 0 0 1 0 0 1 0 0 1 1 0   C =   0 1 0 1 0 0   0 0 0 1 0 0 and let Disj t ( C ) = 1 if there is an all 1’s column and 0 otherwise. ◮ Consider t players where P i knows i -th row of C . 9/23

Multi-Party Set-Disjointness ◮ Consider a t × n matrix where column has weight 0 , 1 , or t , e.g.,   0 0 0 1 0 0 1 0 0 1 1 0   C =   0 1 0 1 0 0   0 0 0 1 0 0 and let Disj t ( C ) = 1 if there is an all 1’s column and 0 otherwise. ◮ Consider t players where P i knows i -th row of C . ◮ How many bits need to be communicated between the players to determine Disj t ( C )? 9/23

Multi-Party Set-Disjointness ◮ Consider a t × n matrix where column has weight 0 , 1 , or t , e.g.,   0 0 0 1 0 0 1 0 0 1 1 0   C =   0 1 0 1 0 0   0 0 0 1 0 0 and let Disj t ( C ) = 1 if there is an all 1’s column and 0 otherwise. ◮ Consider t players where P i knows i -th row of C . ◮ How many bits need to be communicated between the players to determine Disj t ( C )? Ω( n / t ) 9/23

Data Streams & Communication Complexity Lecture 3: Communication - PowerPoint PPT Presentation

Data Streams & Communication Complexity Lecture 3: Communication Complexity and Lower Bounds Andrew McGregor, UMass Amherst 1/23 Basic Communication Complexity Three friends Alice, Bob, and Charlie each have some information x , y , z

Communication Complexity Lecture 23 Computing with remote inputs 1 Communication Complexity

WITH C++ Prof. Amr Goneid AUC Part 9. Streams & Files Prof. amr Goneid, AUC 1 Streams

Stream Algorithmics Albert Bifet March 2012 Data Streams Big Data & Real Time Data Streams

Environmental Health Science Data Streams Data Streams Health Data Health Data Brian S.

Data Streams Many large sources of data are generated as streams of updates: IP Network

Data Streams Many large sources of data are generated as streams of updates: IP Network

Stream Bank Stabilization in Open Space Streams in open space There are approximately 35

CSE 143 Streams as C++ Classes Streams are C++ classes Streams have lots of built-in

Comparing Data Streams Using Hamming Norms Graham Cormode, Mayur Datar, Piotr Indyk, S.

Data Streams & Communication Complexity Lecture 2: Graph Spanners, Sparsifiers, & Sketches

A P A P A Proposal for Publishing Data A Proposal for Publishing Data l f l f P bli hi P bli

Communication Complexity BASICS Summer School 2015 Communication

SK Telecom 1 U U U U U U U- U - - communication - - - - - communication

Streams and File I/O Fundamentals of Computer Science Outline Overview of Streams and File

Scalable Machine Learning 3. Data Streams Alex Smola Yahoo! Research and ANU

Data Streams & Communication Complexity Lecture 1: Simple Stream Statistics in Small Space

Algorithmic Game Theory CoReLab (NTUA) Lecture 3: Tractability of Nash Equilibria PPAD

Random Latin Squares and 2-dimensional Expanders Roy Meshulam Technion Israel Institute of

Machine learning theory Introduction Hamid Beigy Sharif university of technology February 16,

Reinforcement Learning Machine Learning 10701/15781 Carlos Guestrin Carnegie Mellon

Graph Theory: Euler Graphs and Digraphs Pallab Dasgupta, Professor, Dept. of Computer Sc. and

Chapter 2. Walks (Chapters 1.7, 2.12.6) Prof. Tesler Math 154 Winter 2020 Prof. Tesler Ch.

7/8. A Century of Graph Theory A whistle - stop tour with Robin Wilson of graph theory

Finding compatible circuits in eulerian digraphs James Carraher University of Nebraska

Data Streams & Communication Complexity Lecture 3: Communication - PowerPoint PPT Presentation

Data Streams & Communication Complexity Lecture 3: Communication Complexity and Lower Bounds Andrew McGregor, UMass Amherst 1/23 Basic Communication Complexity Three friends Alice, Bob, and Charlie each have some information x , y , z

Communication Complexity Lecture 23 Computing with remote inputs 1 Communication Complexity

WITH C++ Prof. Amr Goneid AUC Part 9. Streams &amp; Files Prof. amr Goneid, AUC 1 Streams

Stream Algorithmics Albert Bifet March 2012 Data Streams Big Data &amp; Real Time Data Streams

Environmental Health Science Data Streams Data Streams Health Data Health Data Brian S.

Data Streams Many large sources of data are generated as streams of updates: IP Network

Data Streams Many large sources of data are generated as streams of updates: IP Network

Stream Bank Stabilization in Open Space Streams in open space There are approximately 35

CSE 143 Streams as C++ Classes Streams are C++ classes Streams have lots of built-in

Comparing Data Streams Using Hamming Norms Graham Cormode, Mayur Datar, Piotr Indyk, S.

Data Streams &amp; Communication Complexity Lecture 2: Graph Spanners, Sparsifiers, &amp; Sketches

A P A P A Proposal for Publishing Data A Proposal for Publishing Data l f l f P bli hi P bli

Communication Complexity BASICS Summer School 2015 Communication

SK Telecom 1 U U U U U U U- U - - communication - - - - - communication

Streams and File I/O Fundamentals of Computer Science Outline Overview of Streams and File

Scalable Machine Learning 3. Data Streams Alex Smola Yahoo! Research and ANU

Data Streams &amp; Communication Complexity Lecture 1: Simple Stream Statistics in Small Space

Algorithmic Game Theory CoReLab (NTUA) Lecture 3: Tractability of Nash Equilibria PPAD

Random Latin Squares and 2-dimensional Expanders Roy Meshulam Technion Israel Institute of

Machine learning theory Introduction Hamid Beigy Sharif university of technology February 16,

Reinforcement Learning Machine Learning 10701/15781 Carlos Guestrin Carnegie Mellon

Graph Theory: Euler Graphs and Digraphs Pallab Dasgupta, Professor, Dept. of Computer Sc. and

Chapter 2. Walks (Chapters 1.7, 2.12.6) Prof. Tesler Math 154 Winter 2020 Prof. Tesler Ch.

7/8. A Century of Graph Theory A whistle - stop tour with Robin Wilson of graph theory

Finding compatible circuits in eulerian digraphs James Carraher University of Nebraska

WITH C++ Prof. Amr Goneid AUC Part 9. Streams & Files Prof. amr Goneid, AUC 1 Streams

Stream Algorithmics Albert Bifet March 2012 Data Streams Big Data & Real Time Data Streams

Data Streams & Communication Complexity Lecture 2: Graph Spanners, Sparsifiers, & Sketches

Data Streams & Communication Complexity Lecture 1: Simple Stream Statistics in Small Space