Random Local Exploration Techniques for Sublinear-Time Algorithms - PowerPoint PPT Presentation

Local MIS Simulation Nguyen, O. (2008) Main idea: select maximal independent set greedily consider vertices in random order Random order ≡ random numbers r ( v ) assigned to each vertex ? To check if v ∈ I recursively check if neighbors w s.t. r ( w ) < r ( v ) are in I v ∈ I ⇐ ⇒ none of them in I

Local MIS Simulation Nguyen, O. (2008) Main idea: select maximal independent set greedily consider vertices in random order Random order ≡ random numbers r ( v ) assigned to each vertex .43 .79 .41 .65 ? .60 .36 .75 To check if v ∈ I recursively check if neighbors w s.t. r ( w ) < r ( v ) are in I v ∈ I ⇐ ⇒ none of them in I

Local MIS Simulation Nguyen, O. (2008) Main idea: select maximal independent set greedily consider vertices in random order Random order ≡ random numbers r ( v ) assigned to each vertex .43 .79 .41 .65 ? .60 .11 .36 ? .75 To check if v ∈ I recursively check if neighbors w s.t. r ( w ) < r ( v ) are in I v ∈ I ⇐ ⇒ none of them in I

Local MIS Simulation Nguyen, O. (2008) Main idea: select maximal independent set greedily consider vertices in random order Random order ≡ random numbers r ( v ) assigned to each vertex .43 .79 .41 .65 ? .60 ? .11 .36 ? .75 To check if v ∈ I recursively check if neighbors w s.t. r ( w ) < r ( v ) are in I v ∈ I ⇐ ⇒ none of them in I

Local MIS Simulation Nguyen, O. (2008) Main idea: select maximal independent set greedily consider vertices in random order Random order ≡ random numbers r ( v ) assigned to each vertex .43 .79 .41 .65 ? .60 .11 .36 ? .75 To check if v ∈ I recursively check if neighbors w s.t. r ( w ) < r ( v ) are in I v ∈ I ⇐ ⇒ none of them in I

Local MIS Simulation Nguyen, O. (2008) Main idea: select maximal independent set greedily consider vertices in random order Random order ≡ random numbers r ( v ) assigned to each vertex .43 .79 .41 .65 ? .60 .11 .36 .75 To check if v ∈ I recursively check if neighbors w s.t. r ( w ) < r ( v ) are in I v ∈ I ⇐ ⇒ none of them in I

Local MIS Simulation Nguyen, O. (2008) Main idea: select maximal independent set greedily consider vertices in random order Random order ≡ random numbers r ( v ) assigned to each vertex .43 .79 ? .41 .65 ? .60 .11 .36 .75 To check if v ∈ I recursively check if neighbors w s.t. r ( w ) < r ( v ) are in I v ∈ I ⇐ ⇒ none of them in I

Local MIS Simulation Nguyen, O. (2008) Main idea: select maximal independent set greedily consider vertices in random order Random order ≡ random numbers r ( v ) assigned to each vertex .43 .79 .41 .65 ? .60 .11 .36 .75 To check if v ∈ I recursively check if neighbors w s.t. r ( w ) < r ( v ) are in I v ∈ I ⇐ ⇒ none of them in I

Local MIS Simulation Nguyen, O. (2008) Main idea: select maximal independent set greedily consider vertices in random order Random order ≡ random numbers r ( v ) assigned to each vertex .43 .22 ? .79 .41 .65 ? .60 .11 .36 .75 To check if v ∈ I recursively check if neighbors w s.t. r ( w ) < r ( v ) are in I v ∈ I ⇐ ⇒ none of them in I

Local MIS Simulation Nguyen, O. (2008) Main idea: select maximal independent set greedily consider vertices in random order Random order ≡ random numbers r ( v ) assigned to each vertex .43 .22 ? ? .79 .41 .65 ? .60 .27 .11 .36 .75 To check if v ∈ I recursively check if neighbors w s.t. r ( w ) < r ( v ) are in I v ∈ I ⇐ ⇒ none of them in I

Local MIS Simulation Nguyen, O. (2008) Main idea: select maximal independent set greedily consider vertices in random order Random order ≡ random numbers r ( v ) assigned to each vertex .43 .22 ? .79 .41 .65 ? .60 .27 .11 .36 .75 To check if v ∈ I recursively check if neighbors w s.t. r ( w ) < r ( v ) are in I v ∈ I ⇐ ⇒ none of them in I

Local MIS Simulation Nguyen, O. (2008) Main idea: select maximal independent set greedily consider vertices in random order Random order ≡ random numbers r ( v ) assigned to each vertex .43 .22 .79 .41 .65 ? .60 .27 .11 .36 .75 To check if v ∈ I recursively check if neighbors w s.t. r ( w ) < r ( v ) are in I v ∈ I ⇐ ⇒ none of them in I

Local MIS Simulation Nguyen, O. (2008) Main idea: select maximal independent set greedily consider vertices in random order Random order ≡ random numbers r ( v ) assigned to each vertex .43 .22 .79 .41 .65 .60 .27 .11 .36 .75 To check if v ∈ I recursively check if neighbors w s.t. r ( w ) < r ( v ) are in I v ∈ I ⇐ ⇒ none of them in I

Local MIS Simulation Nguyen, O. (2008) Main idea: select maximal independent set greedily consider vertices in random order Random order ≡ random numbers r ( v ) assigned to each vertex .43 .22 .79 .41 .65 .60 .27 .11 .36 .75 To check if v ∈ I recursively check if neighbors w s.t. r ( w ) < r ( v ) are in I v ∈ I ⇐ ⇒ none of them in I E[ #visited vertices ] and query complexity of order 2 O ( d )

Bounding Expected Query Complexity Pr[ a given path of length k is explored ] = 1 / ( k + 1)!

Bounding Expected Query Complexity Pr[ a given path of length k is explored ] = 1 / ( k + 1)! ( number of vertices at distance k ) ≤ d k

Bounding Expected Query Complexity Pr[ a given path of length k is explored ] = 1 / ( k + 1)! ( number of vertices at distance k ) ≤ d k E [ number of vertices explored at distance k ] ≤ d k / ( k + 1)!

Bounding Expected Query Complexity Pr[ a given path of length k is explored ] = 1 / ( k + 1)! ( number of vertices at distance k ) ≤ d k E [ number of vertices explored at distance k ] ≤ d k / ( k + 1)! E [ number of explored vertices ] ≤ � ∞ k =0 d k / ( k + 1)! ≤ e d /d

Bounding Expected Query Complexity Pr[ a given path of length k is explored ] = 1 / ( k + 1)! ( number of vertices at distance k ) ≤ d k E [ number of vertices explored at distance k ] ≤ d k / ( k + 1)! E [ number of explored vertices ] ≤ � ∞ k =0 d k / ( k + 1)! ≤ e d /d Expected query complexity = O ( d ) · e d /d = O ( e d )

Improvement Heuristic: Consider neighbors w of v in ascending order of r ( w ) Once you find w ∈ I , v �∈ I (i.e., don’t check other neighbors) .82 .42 .91 .55 ? .60 .36 .77

Improvement Heuristic: Consider neighbors w of v in ascending order of r ( w ) Once you find w ∈ I , v �∈ I (i.e., don’t check other neighbors) .82 .42 .91 .55 ? .60 .11 .36 .77

Improvement Heuristic: Consider neighbors w of v in ascending order of r ( w ) Once you find w ∈ I , v �∈ I (i.e., don’t check other neighbors) .82 .42 .91 .55 .60 .11 .36 .77

Improvement Heuristic: Consider neighbors w of v in ascending order of r ( w ) Once you find w ∈ I , v �∈ I (i.e., don’t check other neighbors) .82 .42 .91 .55 .60 .11 .36 .77 Yoshida, Yamamoto, Ito (STOC 2009): E permutations , start vertex [ #recursive calls ] ≤ 1 + m n

Improvement Heuristic: Consider neighbors w of v in ascending order of r ( w ) Once you find w ∈ I , v �∈ I (i.e., don’t check other neighbors) .82 .42 .91 .55 .60 .11 .36 .77 Yoshida, Yamamoto, Ito (STOC 2009): E permutations , start vertex [ #recursive calls ] ≤ 1 + m n Which gives: expected query complexity for random vertex = O ( d 2 )

Algorithm for Vertex Cover

Vertex Cover Goal: find smallest set S of nodes such that each edge has endpoint in S Classical 2 -approximation algorithm [Gavril & Yannakakis]: Greedily find a maximal matching M Output the set of nodes matched in M

Sublinear-Time Algorithm General Idea: construct oracle O that answers queries: Is e ∈ E in M ? for a fixed maximal matching M

Sublinear-Time Algorithm General Idea: construct oracle O that answers queries: Is e ∈ E in M ? for a fixed maximal matching M maximal matching ≡ maximal independent set in the line graph

Sublinear-Time Algorithm General Idea: construct oracle O that answers queries: Is e ∈ E in M ? for a fixed maximal matching M maximal matching ≡ maximal independent set in the line graph approximate the number of vertices matched in M up to ± ǫn by checking for O (1 /ǫ 2 ) vertices if they are matched #queries to O = ( #tested nodes ) · ( max-degree ) = O ( d/ǫ 2 )

Sublinear-Time Algorithm General Idea: construct oracle O that answers queries: Is e ∈ E in M ? for a fixed maximal matching M maximal matching ≡ maximal independent set in the line graph approximate the number of vertices matched in M up to ± ǫn by checking for O (1 /ǫ 2 ) vertices if they are matched #queries to O = ( #tested nodes ) · ( max-degree ) = O ( d/ǫ 2 ) This gives: VC − ǫn ≤ ( computed value ) ≤ 2 · VC + ǫn

Sublinear-Time Algorithm General Idea: construct oracle O that answers queries: Is e ∈ E in M ? for a fixed maximal matching M maximal matching ≡ maximal independent set in the line graph approximate the number of vertices matched in M up to ± ǫn by checking for O (1 /ǫ 2 ) vertices if they are matched #queries to O = ( #tested nodes ) · ( max-degree ) = O ( d/ǫ 2 ) This gives: VC − ǫn ≤ ( computed value ) ≤ 2 · VC + ǫn Running time: 2 O ( d ) /ǫ 2

VC − ǫn ≤ output ≤ 2 · VC + ǫn Parnas, Ron (2007): d O (log( d ) /ǫ 3 ) queries via simulation of local distributed algorithms Marko, Ron (2007): d O (log( d/ǫ )) queries via Luby’s algorithm Nguyen, O. (2008): 2 O ( d ) /ǫ 2 queries the algorithm and proof presented here Yoshida, Yamamoto, Ito (2009): O ( d 4 /ǫ 2 ) queries the Nguyen, O. algorithm + analysis of the heuristic O., Ron, Rosen, Rubinfeld (2012): ˜ O ( d/ǫ 3 ) queries further refinements of NO and YYI sampling from the neighbor sets near optimal: Ω( d ) lower bound due to Parnas, Ron (2007)

Better Approximation for Maximum Matching

Maximum Matching Goal: find a set of disjoint edges of maximum cardinality

Review of Properties Augmenting Path: a path that improves matching

Review of Properties Augmenting Path: a path that improves matching M = matching, M ∗ = maximum matching Fact: There are | M ∗ | − | M | disjoint augmenting paths for M

Review of Properties Augmenting Path: a path that improves matching M = matching, M ∗ = maximum matching Fact: There are | M ∗ | − | M | disjoint augmenting paths for M Fact: k No augmenting paths of length < 2 k + 1 ⇒ | M | ≥ k +1 | M ∗ |

Review of Properties Augmenting Path: a path that improves matching M = matching, M ∗ = maximum matching Fact: There are | M ∗ | − | M | disjoint augmenting paths for M Fact: k No augmenting paths of length < 2 k + 1 ⇒ | M | ≥ k +1 | M ∗ | To get (1 + ǫ ) -approximation, set k = ⌈ 1 /ǫ ⌉

Standard Algorithm Lemma [Hopcroft, Karp 1973]: M = matching with no augmenting paths of length < t P = maximal set of vertex-disjoint augmenting paths of length t for M M ′ = M with all paths in P applied Claim: M ′ has only augmenting paths of length > t

Standard Algorithm Lemma [Hopcroft, Karp 1973]: M = matching with no augmenting paths of length < t P = maximal set of vertex-disjoint augmenting paths of length t for M M ′ = M with all paths in P applied Claim: M ′ has only augmenting paths of length > t Algorithm: M := empty matching for i = 1 to k : find maximal set of disjoint augmenting paths of length 2 i − 1 apply all paths to M return M

Transformation Standard Algorithm: Eliminate Eliminate Eliminate Eliminate augmenting augmenting augmenting augmenting ∅ ⇒ ⇒ M 1 ⇒ ⇒ M 2 ⇒ ⇒ M 3 ⇒ ⇒ M 4 paths paths paths paths of length 1 of length 3 of length 5 of length 7 Constant−Time Algorithm: Oracle O 2 : Oracle O 3 : Oracle O 1 : Oracle O 4 : no no no no augmenting augmenting approximation augmenting augmenting sampling paths paths paths paths of length ≤ 3 of length ≤ 5 of length ≤ 1 of length ≤ 7 Oracle O i : provides query access to M i simulates applying to M i − 1 a maximal set of disjoint augmenting paths of length 2 i − 1

Transformation Sample graph considered by O 2 : O i ’s graph has degree d O ( i )

Query Complexity Can’t apply the previous approach! every query may disclose some information about the random numbers algorithm could use it to form a malicious query

Query Complexity Can’t apply the previous approach! every query may disclose some information about the random numbers algorithm could use it to form a malicious query Locality Lemma: for q queries, needs to visit at most q 2 · 2 O ( d 4 ) /δ vertices with probability 1 − δ

Query Complexity Can’t apply the previous approach! every query may disclose some information about the random numbers algorithm could use it to form a malicious query Locality Lemma: for q queries, needs to visit at most q 2 · 2 O ( d 4 ) /δ vertices with probability 1 − δ Query complexity: 2 d O (1 /ǫ ) queries for (1 , ǫn ) -approximation

Query Complexity Can’t apply the previous approach! every query may disclose some information about the random numbers algorithm could use it to form a malicious query Locality Lemma: for q queries, needs to visit at most q 2 · 2 O ( d 4 ) /δ vertices with probability 1 − δ Query complexity: 2 d O (1 /ǫ ) queries for (1 , ǫn ) -approximation Yoshida, Yamamoto, Ito (2009) Query complexity: d O (1 /ǫ 2 ) uniform on higher level ⇒ close to uniform on lower

Roadmap 1. Simulation of greedy algorithms 2. Partitioning oracles 3. Random walks

Hyperfinite Graphs (All graphs of degree O (1) ) ( ǫ, δ ) -partition ( ǫ, δ ) -hyperfinite graphs: can remove ǫ | V | edges and get components of size at most δ

Random Local Exploration Techniques for Sublinear-Time Algorithms - PowerPoint PPT Presentation

Random Local Exploration Techniques for Sublinear-Time Algorithms Krzysztof Onak IBM Research Sublinear-Time Algorithms BIG DATA Sublinear-Time Algorithms Sublinear-time algorithms: Fast answer based on inspecting a tiny fraction of the

Sublinear Algorithms for Big Data Qin Zhang 1-1 Part 3: Sublinear in Time 2-1 Sublinear in

Sublinear Algorithms for Big Data Qin Zhang 1-1 Part 2: Sublinear in Communication 2-1

L ECTURE 2 Last time Introduction Basic models for sublinear-time computation Simple

B669 Sublinear Algorithms for Big Data Qin Zhang 1-1 Part 1: Sublinear in Space 2-1 The model

Sublinear Algorithms for ( + 1) Vertex Coloring Sepehr Assadi University of Pennsylvania

Sublinear Geometric Algorithms Sublinear Geometric Algorithms B. Chazelle, D. Liu, A. Magen B.

B669 Sublinear Algorithms for Big Data Qin Zhang 1-1 Part 1: Sublinear in Space 2-1 The model

L ECTURE 6 Last time Limitations of sublinear-time algorithms Yaos Minimax Principle

Random Numbers RANDOM VS PSEUDO RANDOM Truly Random numbers From Wolfram: A random number

Sublinear Algorithms Lectures 1 and 2 Sofya Raskhodnikova Penn State University 1 Tentative

Distributed Directed SSSP in Sublinear Time Jason Li Carnegie Mellon University Joint work with

Sublinear Algorithms for Big Data Part 4: Random Topics Qin Zhang 1-1 Topic 3: Random sampling

A Sublinear Bipartiteness Tester for Bounded Degree Graphs Oded Goldreich Dana Ron

Sublinear Algorithms Lecture 5 Sofya Raskhodnikova Penn State University Thanks to Madhav Jha

Sublinear bounds for a quantitative Doignon-Bell-Scarf Theorem Stephen R. Chestnut Robert

Sublinear Algorithms Lecture 1 Sofya Raskhodnikova Boston University 1 Organizational Course

r t r r

Correlations between Parallel Patterns and Multi-core Benchmarks Vivek Kale IWMSE workshop May

Scalable performance analysis with Projections Sanjay Kale, http://charm.cs.illinois.edu Based

Michael Spillane President, Product & Categories Good morning, and thank you for joining

FaSST: Fast, Scalable, and Simple Distributed Transactions with Two-Sided (RDMA) Datagram RPCs

Storm: a fast transactional dataplane for remote data structures Stanko Novakovic Yizhou Shan

FaSST: Fast, Scalable, and Simple Distributed Transactions with Two-Sided (RDMA) Datagram RPCs

What is a Choreography? A choreography is a way to organize a multiparty web application in a

Random Local Exploration Techniques for Sublinear-Time Algorithms - PowerPoint PPT Presentation

Random Local Exploration Techniques for Sublinear-Time Algorithms Krzysztof Onak IBM Research Sublinear-Time Algorithms BIG DATA Sublinear-Time Algorithms Sublinear-time algorithms: Fast answer based on inspecting a tiny fraction of the

Sublinear Algorithms for Big Data Qin Zhang 1-1 Part 3: Sublinear in Time 2-1 Sublinear in

Sublinear Algorithms for Big Data Qin Zhang 1-1 Part 2: Sublinear in Communication 2-1

L ECTURE 2 Last time Introduction Basic models for sublinear-time computation Simple

B669 Sublinear Algorithms for Big Data Qin Zhang 1-1 Part 1: Sublinear in Space 2-1 The model

Sublinear Algorithms for ( + 1) Vertex Coloring Sepehr Assadi University of Pennsylvania

Sublinear Geometric Algorithms Sublinear Geometric Algorithms B. Chazelle, D. Liu, A. Magen B.

B669 Sublinear Algorithms for Big Data Qin Zhang 1-1 Part 1: Sublinear in Space 2-1 The model

L ECTURE 6 Last time Limitations of sublinear-time algorithms Yaos Minimax Principle

Random Numbers RANDOM VS PSEUDO RANDOM Truly Random numbers From Wolfram: A random number

Sublinear Algorithms Lectures 1 and 2 Sofya Raskhodnikova Penn State University 1 Tentative

Distributed Directed SSSP in Sublinear Time Jason Li Carnegie Mellon University Joint work with

Sublinear Algorithms for Big Data Part 4: Random Topics Qin Zhang 1-1 Topic 3: Random sampling

A Sublinear Bipartiteness Tester for Bounded Degree Graphs Oded Goldreich Dana Ron

Sublinear Algorithms Lecture 5 Sofya Raskhodnikova Penn State University Thanks to Madhav Jha

Sublinear bounds for a quantitative Doignon-Bell-Scarf Theorem Stephen R. Chestnut Robert

Sublinear Algorithms Lecture 1 Sofya Raskhodnikova Boston University 1 Organizational Course

r t r r

Correlations between Parallel Patterns and Multi-core Benchmarks Vivek Kale IWMSE workshop May

Scalable performance analysis with Projections Sanjay Kale, http://charm.cs.illinois.edu Based

Michael Spillane President, Product &amp; Categories Good morning, and thank you for joining

FaSST: Fast, Scalable, and Simple Distributed Transactions with Two-Sided (RDMA) Datagram RPCs

Storm: a fast transactional dataplane for remote data structures Stanko Novakovic Yizhou Shan

FaSST: Fast, Scalable, and Simple Distributed Transactions with Two-Sided (RDMA) Datagram RPCs

What is a Choreography? A choreography is a way to organize a multiparty web application in a

Michael Spillane President, Product & Categories Good morning, and thank you for joining