Sofya

�� Sofya Raskhodnikova Penn State University ��

�� !��!� �� "� ��#��$��%��

��

�� Input: a list of n numbers x 1 , x 2 ,..., x n • Question: Is the list sorted? Requires reading entire list: � (n) time • Approximate version: Is the list sorted or � -far from sorted? (An � fraction of x i ’s have to be changed to make it sorted.) [Ergün Kannan Kumar Rubinfeld Viswanathan 98, Fischer 01]: O((log n)/ � ) time � (log n) queries • Attempts: 1. Test: Pick a random i and reject if x i > x i+1 . Fails on: 1 1 1 1 1 1 1 0 0 0 0 0 0 0 � 1/2-far from sorted 2. Test: Pick random i < j and reject if x i > x j . Fails on: 1 0 2 1 3 2 4 3 5 4 6 5 7 6 � 1/2-far from sorted �

�� Idea: Associate positions in the list with vertices of the directed line. � � �� Construct a graph (2-spanner) ≤ n log n edges • by adding a few “shortcut” edges ( i, j ) for i < j • where each pair of vertices is connected by a path of length at most 2 �

�� Test [Dodis Goldreich Lehman Raskhodnikova Ron Samorodnitsky 99] Pick a random edge ( x i ,x j ) from the 2-spanner and reject if x i > x j . 1 2 5 4 3 6 7 5 4 3 � � � �� Analysis: • Call an edge ( x i ,x j ) violated if x i > x j , and good otherwise. • If x i is an endpoint of a bad edge, call it bad . Otherwise, call it good . Claim 1. All good numbers x i are sorted. Proof: Consider any two good numbers, x i and x j . They are connected by a path of (at most) two good edges ( x i ,x k ), ( x k ,x j ). � x i ≤ x k and x k ≤ x j � x i ≤ x j �

�� Test [Dodis Goldreich Lehman Raskhodnikova Ron Samorodnitsky 99] Pick a random edge ( x i ,x j ) from the 2-spanner and reject if x i > x j . 1 2 5 4 3 6 7 5 4 3 � � � �� Analysis: • Call an edge ( x i ,x j ) violated if x i > x j , and good otherwise. • If x i is an endpoint of a bad edge, call it bad . Otherwise, call it good . Claim 1. All good numbers x i are sorted. Claim 2. An � -far list violates �� /(2 log n) fraction of edges in 2-spanner. Proof: If a list is � -far from sorted, it has � � n bad numbers. (Claim 1) � 2-TC-spanner has � � n/2 violated edges out of � n log n �

�� Test [Dodis Goldreich Lehman Raskhodnikova Ron Samorodnitsky 99] Pick a random edge ( x i ,x j ) from the 2-spanner and reject if x i > x j . 1 2 5 4 3 6 7 5 4 3 � � � �� Analysis: • Call an edge ( x i ,x j ) violated if x i > x j , and good otherwise. Claim 2. An � -far list violates �� /(2 log n) fraction of edges in 2-spanner. By Witness Lemma, it suffices to sample (4 log n )/ � edges from 2-spanner. Algorithm Sample (4 log n)/ � edges ( x i ,x j ) from the 2-spanner and reject if x i > x j . Guarantee: All sorted lists are accepted. All lists that are � -far from sorted are rejected with probability � 2/3. Time: O((log n)/ � ) �

�� !�� • Binary-Search-Based Test worked only for testing if a sequence is strictly increasing. – There is a simple reduction from testing strict sortedness to testing non-strict sortedness. • Spanner-based test is nonadaptive: queries can be determined in advance, before seeing answers to previous queries. – Binary-Search-Based Test can be made nonadaptive. �

��" #��! • A list of n numbers x 1 , x 2 ,..., x n is Lipschitz if the numbers do not change too quickly: �� for all � . 1 2 2 1 2 3 2 • The spanner-based test for sortedness can test the Lipschitz property in �� time. • It applies to a more general class of properties. ��

��

$��%��!�&'�� Input: a string � ∈ 0,� � 0 0 0 1 … 0 1 0 0 Goal: Estimate the fraction of 1’s in � (like in polls) It suffices to sample � = � ⁄ � � positions and output the average to get the fraction of 1’s ±� (i.e., additive error � ) with probability �� 2/3 Hoeffding Bound Let Y � , … , Y : be independently distributed random variables in [0,1] and ! ≥ δ � 2e ��4 5 �! . let Y = ∑ Y � (sample sum). Then Pr Y � E Y �"� ! Y � = value of sample � . Then E[Y] = ∑ E[Y � ] = � ⋅ (fraction of 1’s in � ) �"� Pr (sample average) � � fracti�n��f��′s�in�� ≥ � = Pr Y � E Y ≥ �� 2e ��4 5 �! = 26 �� < ��3 substitute � = � ⁄ � � � = � ⁄ � � Apply Hoeffding Bound with < = �� < = ��

��'��(�� [Chazelle Rubinfeld Trevisan] Input: a graph = = �>, ?� on n vertices • in adjacency lists representation (a list of neighbors for each vertex) • maximum degree d Exact Answer: � (dn) time Additive approximation: # of CC ±εn with probability �� 2/3 Time: @ � @ • Known: � A 5 �� A , � A 5 @ � �� • Today: � A B ⋅ �� A � �� !��! �"�� "#!��$��$%%��&��#%�%��#��%'%��()%'�*+'%��#��%��%��,%��#��%��((%��((��

Sofya - PowerPoint PPT Presentation

Sofya Raskhodnikova Penn State University

Graph Analysis with Node Differential Privacy Node Differential Privacy Sofya Sofya

Sublinear Algorithms Lecture 1 Sofya Raskhodnikova Boston University 1 Organizational Course

Sublinear Algorithms Lecture 5 Sofya Raskhodnikova Penn State University Thanks to Madhav Jha

Monotonicity Testing Yevgeniy Dodis, Oded Goldreich, Eric Lehman, Sofya Raskhodnikova, Dana Ron

Smooth Sensitivity and Sampling Sofya Raskhodnikova Penn State University Joint work with Kobbi

Techniques for Private Data Analysis Sofya Raskhodnikova Penn State University Based on joint

Sublinear Algorithms Lecture 26 Sofya Raskhodnikova Penn State University Thanks to Madhav Jha

Sublinear Algorithms Lecture 6 Sofya Raskhodnikova Penn State University Thanks to Madhav Jha

Sublinear Algorithms Lecture 3 Sofya Raskhodnikova Penn State University 1 Graph Properties

in Property Testing and Local List Decoding Sofya Raskhodnikova Boston University Joint work

-Testing Sofya Raskhodnikova Penn State University, visiting Boston University and

Sublinear Algorithms Lectures 1 and 2 Sofya Raskhodnikova Penn State University 1 Tentative

Randomness in Computing L ECTURE 1 Randomness in Computing Course information Verifying

Randomness in Computing L ECTURE 15 Last time Poisson approximation Application: max load

Matching with the simplest semiorder preference relations: stability and Pareto-efficiency Sofya

Approximat e k-MSTs and k-St einer Trees via t he Primal-Dual Met hod and Lagrangean Relaxat

A Prins's algorithm repeatedly adds the minimum one endpoint in T weight edge w PRIM G YE E

Minimum Spanning Tree spanning tree of minimum total weight e.g., connect all the

Triangula)ons and MST

Impromptu Updating of MST and ST in a Distributed Dynamic Graph Valerie King, University of

L ECTURE 3 Last time Properties of lists and functions. Testing if a list is

An Efficient Characterization of Submodular Spanning Tree Games Zhuan Khye Koh Laura Sanit` a

Code Documentation http://cs.mst.edu Important! Its for you Its for others