Algorithms for Big Data (IX) Chihao Zhang Shanghai Jiao Tong - PowerPoint PPT Presentation

Algorithms for Big Data (IX) Chihao Zhang Shanghai Jiao Tong University Nov. 15, 2019 Algorithms for Big Data (IX) 1/15

, the complexity is measured by bits communicated between them. We focus on the special one-way communication model. Review Last week, we saw the communication model. In this model, Alice and Bob collaborate to compute some function . Alice has and Bob Algorithms for Big Data (IX) 2/15

, the complexity is measured by bits communicated between them. We focus on the special one-way communication model. Review Last week, we saw the communication model. Alice has and Bob Algorithms for Big Data (IX) 2/15 In this model, Alice and Bob collaborate to compute some function f ( x, y ) .

Review Last week, we saw the communication model. We focus on the special one-way communication model. Algorithms for Big Data (IX) 2/15 In this model, Alice and Bob collaborate to compute some function f ( x, y ) . Alice has x and Bob y , the complexity is measured by bits communicated between them.

Review Last week, we saw the communication model. Algorithms for Big Data (IX) 2/15 In this model, Alice and Bob collaborate to compute some function f ( x, y ) . Alice has x and Bob y , the complexity is measured by bits communicated between them. We focus on the special one-way communication model.

In this course, we consider protocols using public coins. Our lower bound applies to protocols using public coins, and hence also applies to ones using private coins. Randomness We allow randomness in our communication protocol. In this model, we assume there exits some random source in the environment so that both Alice and Bob can see it. Algorithms for Big Data (IX) 3/15

using private coins. Randomness We allow randomness in our communication protocol. In this model, we assume there exits some random source in the environment so that both Alice and Bob can see it. Our lower bound applies to protocols using public coins, and hence also applies to ones Algorithms for Big Data (IX) 3/15 In this course, we consider protocols using public coins.

Our lower bound applies to protocols using public coins, and hence also applies to ones using private coins. Randomness We allow randomness in our communication protocol. In this model, we assume there exits some random source in the environment so that both Alice and Bob can see it. Algorithms for Big Data (IX) 3/15 In this course, we consider protocols using public coins.

Randomness We allow randomness in our communication protocol. In this model, we assume there exits some random source in the environment so that both Alice and Bob can see it. Algorithms for Big Data (IX) 3/15 In this course, we consider protocols using public coins. Our lower bound applies to protocols using public coins, and hence also applies to ones using private coins.

Problems log Algorithms for Big Data (IX) bits of communication. log We showed that any randomized protocol requires . 1 DISJ x y bits of communication. There exits a random protocol using For x bits of one-way communication. requires at needs We showed by a counting argument that any deterministic protocol to compute EQ y 1 x EQ x y . , , sometimes we view it as the indicator vector of some subset of 4/15

Problems bits of communication. Algorithms for Big Data (IX) bits of communication. log We showed that any randomized protocol requires . 1 DISJ x y log There exits a random protocol using bits of one-way communication. requires at needs We showed by a counting argument that any deterministic protocol to compute EQ y 1 x EQ x y 4/15 For x ∈ { 0, 1 } n , sometimes we view it as the indicator vector of some subset of [ n ] , S ( x ) = { i ∈ [ n ] : x i = 1 } .

Problems DISJ x y Algorithms for Big Data (IX) bits of communication. log We showed that any randomized protocol requires . 1 bits of communication. log There exits a random protocol using bits of one-way communication. requires at needs We showed by a counting argument that any deterministic protocol to compute EQ 4/15 For x ∈ { 0, 1 } n , sometimes we view it as the indicator vector of some subset of [ n ] , S ( x ) = { i ∈ [ n ] : x i = 1 } . EQ ( x , y ) ≜ 1 [ x = y ]

Problems DISJ x y Algorithms for Big Data (IX) bits of communication. log We showed that any randomized protocol requires . 1 bits of communication. log There exits a random protocol using 4/15 For x ∈ { 0, 1 } n , sometimes we view it as the indicator vector of some subset of [ n ] , S ( x ) = { i ∈ [ n ] : x i = 1 } . EQ ( x , y ) ≜ 1 [ x = y ] ▶ We showed by a counting argument that any deterministic protocol to compute EQ requires at needs n bits of one-way communication.

Problems DISJ x y 1 . We showed that any randomized protocol requires log bits of communication. Algorithms for Big Data (IX) 4/15 For x ∈ { 0, 1 } n , sometimes we view it as the indicator vector of some subset of [ n ] , S ( x ) = { i ∈ [ n ] : x i = 1 } . EQ ( x , y ) ≜ 1 [ x = y ] ▶ We showed by a counting argument that any deterministic protocol to compute EQ requires at needs n bits of one-way communication. ▶ There exits a random protocol using O ( log n ) bits of communication.

We showed that any randomized protocol requires Problems log bits of communication. Algorithms for Big Data (IX) 4/15 For x ∈ { 0, 1 } n , sometimes we view it as the indicator vector of some subset of [ n ] , S ( x ) = { i ∈ [ n ] : x i = 1 } . EQ ( x , y ) ≜ 1 [ x = y ] ▶ We showed by a counting argument that any deterministic protocol to compute EQ requires at needs n bits of one-way communication. ▶ There exits a random protocol using O ( log n ) bits of communication. DISJ ( x , y ) ≜ 1 [ S ( x ) ∩ S ( y ) = ∅ ] .

Problems Algorithms for Big Data (IX) 4/15 For x ∈ { 0, 1 } n , sometimes we view it as the indicator vector of some subset of [ n ] , S ( x ) = { i ∈ [ n ] : x i = 1 } . EQ ( x , y ) ≜ 1 [ x = y ] ▶ We showed by a counting argument that any deterministic protocol to compute EQ requires at needs n bits of one-way communication. ▶ There exits a random protocol using O ( log n ) bits of communication. DISJ ( x , y ) ≜ 1 [ S ( x ) ∩ S ( y ) = ∅ ] . ▶ We showed that any randomized protocol requires Ω ( log n ) bits of communication.

bits, then any randomized one-way protocol with error at most input also costs at least bits” of a randomized protocol applies to the worst input with worst random bits. Yao’s Lemma Algorithms for Big Data (IX) We remark that “costs at least bits one-way communication. on any costs at least The main tool to prove lower bounds for randomized protocol is Yao’s lemma. is wrong on Pr with one-way communication protocol such that any deterministic over If there exists some distribution Lemma 5/15

bits” of a randomized protocol applies to the worst input with worst random bits. Yao’s Lemma The main tool to prove lower bounds for randomized protocol is Yao’s lemma. Lemma We remark that “costs at least Algorithms for Big Data (IX) 5/15 If there exists some distribution D over { 0, 1 } a × { 0, 1 } b such that any deterministic one-way communication protocol P with Pr ( x,y ) ∼ D [ P is wrong on ( x, y )] ≤ ε costs at least k bits, then any randomized one-way protocol with error at most ε on any input also costs at least k bits one-way communication.

Yao’s Lemma The main tool to prove lower bounds for randomized protocol is Yao’s lemma. Lemma input with worst random bits. Algorithms for Big Data (IX) 5/15 If there exists some distribution D over { 0, 1 } a × { 0, 1 } b such that any deterministic one-way communication protocol P with Pr ( x,y ) ∼ D [ P is wrong on ( x, y )] ≤ ε costs at least k bits, then any randomized one-way protocol with error at most ε on any input also costs at least k bits one-way communication. We remark that “costs at least k bits” of a randomized protocol applies to the worst

We remark that the converse of Yao’s lemma is also correct. Proof of Yao’s Lemma Thus, by the assumption, there exists a distribution Algorithms for Big Data (IX) is wrong on Pr , This is sufgicient to imply that for some is wrong on Pr such that bits. Let costs less than By the definition of the costs of a random protocol, each . deterministic protocols as a distribution over can view We can first toss all coins and then run the protocol based on the results. Therefore, we bits in the worst case. be a randomized protocol costs less than 6/15

Algorithms for Big Data (IX) Chihao Zhang Shanghai Jiao Tong - PowerPoint PPT Presentation

Algorithms for Big Data (IX) Chihao Zhang Shanghai Jiao Tong University Nov. 15, 2019 Algorithms for Big Data (IX) 1/15 , the complexity is measured by bits communicated between them. We focus on the special one-way communication model.

Big Data Algorithms with Medical Applications Yixin Chen Outline Challenges to big data

Machine Learning Anders Holst SICS Big Data Analytics Analysis Big Data Big Value Big Data

CS535 Big Data 1/22/2020 Sangmi Lee Pallickara CS535 Big Data | Computer Science Department

COMP9313: Big Data Management Introduction to Big Data Management What is big data? Tweeted by

Algorithms for Big Data (X) Chihao Zhang Shanghai Jiao Tong University Nov. 22, 2019 Algorithms

Algorithms for Big Data (X) Chihao Zhang Shanghai Jiao Tong University Nov. 22, 2019 Algorithms

Big- Big -O O Analyzing Algorithms Asymptotically Analyzing Algorithms Asymptotically P1 P2

ANALYSIS OF ALGORITHMS AND BIG-O CS16: Introduction to Algorithms & Data Structures Tuesday,

Analysis of Algorithms & Big-O CS16: Introduction to Algorithms & Data Structures Spring

Algorithms Chapter 3 Chapter Summary Algorithms n Example Algorithms n Algorithmic Paradigms

Algorithms for Big Data (VI) Chihao Zhang Shanghai Jiao Tong University Oct. 25, 2019

HOW BIG IS BIG DATA FOR AN INSURER LIKE AXA? CHALLENGES & OPPORTUNITIES Paris Big Data

BIG DATA CONFERENCE How to transform data into money using Big Data technologies INTRO THE

BIG DATA: Revolutionizing construction business through socmed data mining REVOLUTIONIZING

Getting the Big (Data) Picture Eva Andreasson , Cloudera Big Data? Todays Big Data Landscape

Fundamentals of Big Data BIG DATA F UN DAMEN TALS W ITH P YS PARK Upendra Devisetty Science

Alice goes floating Frank Mittelbach TUG 2016, Toronto, Canada, July 2016 /Alice goes floating

Serializable Snapshot Isolation in PostgreSQL Dan Ports Kevin Grittner University of Washington

Dec 05, 2006 Yinghua Wu Where are we? After learning all the foundation of modern

Covariant Quantum Error Correction R e f e r e n c e f r a m e e n c o d i n g , s y m m e t r

Overview Key exchange Session vs. interchange keys Classical, public key methods

Natural Language Processing Anoop Sarkar anoopsarkar.github.io/nlp-class Simon Fraser University

6.1 Shape Matching Hao Li http://cs621.hao-li.com 1 Acknowledgement Images and Slides are

Alignment-Guided Chunking Yanjun Ma , Nicolas Stroppa, Andy Way { yma,nstroppa,away }

Algorithms for Big Data (IX) Chihao Zhang Shanghai Jiao Tong - PowerPoint PPT Presentation

Algorithms for Big Data (IX) Chihao Zhang Shanghai Jiao Tong University Nov. 15, 2019 Algorithms for Big Data (IX) 1/15 , the complexity is measured by bits communicated between them. We focus on the special one-way communication model.

Big Data Algorithms with Medical Applications Yixin Chen Outline Challenges to big data

Machine Learning Anders Holst SICS Big Data Analytics Analysis Big Data Big Value Big Data

CS535 Big Data 1/22/2020 Sangmi Lee Pallickara CS535 Big Data | Computer Science Department

COMP9313: Big Data Management Introduction to Big Data Management What is big data? Tweeted by

Algorithms for Big Data (X) Chihao Zhang Shanghai Jiao Tong University Nov. 22, 2019 Algorithms

Algorithms for Big Data (X) Chihao Zhang Shanghai Jiao Tong University Nov. 22, 2019 Algorithms

Big- Big -O O Analyzing Algorithms Asymptotically Analyzing Algorithms Asymptotically P1 P2

ANALYSIS OF ALGORITHMS AND BIG-O CS16: Introduction to Algorithms &amp; Data Structures Tuesday,

Analysis of Algorithms &amp; Big-O CS16: Introduction to Algorithms &amp; Data Structures Spring

Algorithms Chapter 3 Chapter Summary Algorithms n Example Algorithms n Algorithmic Paradigms

Algorithms for Big Data (VI) Chihao Zhang Shanghai Jiao Tong University Oct. 25, 2019

HOW BIG IS BIG DATA FOR AN INSURER LIKE AXA? CHALLENGES &amp; OPPORTUNITIES Paris Big Data

BIG DATA CONFERENCE How to transform data into money using Big Data technologies INTRO THE

BIG DATA: Revolutionizing construction business through socmed data mining REVOLUTIONIZING

Getting the Big (Data) Picture Eva Andreasson , Cloudera Big Data? Todays Big Data Landscape

Fundamentals of Big Data BIG DATA F UN DAMEN TALS W ITH P YS PARK Upendra Devisetty Science

Alice goes floating Frank Mittelbach TUG 2016, Toronto, Canada, July 2016 /Alice goes floating

Serializable Snapshot Isolation in PostgreSQL Dan Ports Kevin Grittner University of Washington

Dec 05, 2006 Yinghua Wu Where are we? After learning all the foundation of modern

Covariant Quantum Error Correction R e f e r e n c e f r a m e e n c o d i n g , s y m m e t r

Overview Key exchange Session vs. interchange keys Classical, public key methods

Natural Language Processing Anoop Sarkar anoopsarkar.github.io/nlp-class Simon Fraser University

6.1 Shape Matching Hao Li http://cs621.hao-li.com 1 Acknowledgement Images and Slides are

Alignment-Guided Chunking Yanjun Ma , Nicolas Stroppa, Andy Way { yma,nstroppa,away }

ANALYSIS OF ALGORITHMS AND BIG-O CS16: Introduction to Algorithms & Data Structures Tuesday,

Analysis of Algorithms & Big-O CS16: Introduction to Algorithms & Data Structures Spring

HOW BIG IS BIG DATA FOR AN INSURER LIKE AXA? CHALLENGES & OPPORTUNITIES Paris Big Data