Large Deviaons and Exponenal Random Graphs Yufei Zhao MIT May - PowerPoint PPT Presentation

Large Devia*ons and Exponen*al Random Graphs Yufei Zhao MIT May 2018

Universality Problem-dependent Central limit theorem: Large deviations ! − # $ % < ' < ! + # ) % ' − ! ≫ % Key questions: • What is the probability of seeing large deviation? (often exponentially small) • What does a typical conditioned instance look like? • How to model/estimate/sample?

Warm up: sum of independent random variables Let ! = # $ + # & + ⋯ + # ( # ) ’s are i.i.d. random variables with finite variance *+,* • Central Limit Theorem: -./01 * → Normal as 9 → ∞ • Large deviation theory (Cramér’s theorem): ℙ ! ≥ 9= ≈ ? +(@ A where B(=) is the rate function , which depends on the distribution of the # ) ’s A $+A e.g., if # ) ~Bernoulli(K) , then B = = = log M + 1 − = log $+M

Sums of dependent random variables E.g., ! = # $ % , $ ' , … , $ ) $ % , $ ' … i.i.d. Bernoulli random variables f – a low degree polynomial • Moments calculation: *[! , ] often easy to compute • Central limit theorem: follows with enough control on moments • Large deviations: ???

The upper tail problem Let X be the number of triangles in the Erdős–Rényi random graph G ( n , p ) ( n vertices, every pair is an edge with probability p independently) !" = $ 3 & ' Central Limit Theorem (Ruciński ’88): X is asymptotially normal, i.e., " − !" → Normal, as $ → ∞, provided $& → ∞, $ 1 − & → ∞ Var " Problem: Estimate ℙ " ≥ 1 + = !" (fixed = > 0)

X = # triangles in G ( n , p ). ℙ % ≥ 1 + ) *% = ? Random Structures & Algorithms 2002 Order of log ℙ % ≥ 1 + ) *% Janson, Oleszkiewicz, Rucinski ’04 independently determined by Bollobás ’81, ’85 Janson, Luczak, Rucinski ’02, ’04 DeMarco & Kahn ’11 Vu ’01 and Kim & Vu ’04 Chatterjee ’11 ChaIerjee & Dey ’10

What can “cause” a random graph to have too many triangles? replica symmetry • Overall increase in edge density • Some extra edges forming a clique • Some some number of vertices forming a hub connecting to symmetry everything else breaking • … ! ", $

Summary of what we now know/believe X = # triangles in !(#, %) Large deviation: ' ≥ 1 + + ,' (constant + ) • Sparse setting: % → 0 (not too quickly) as # → ∞ • If + > 27/8 , plant a clique • If + < 27/8 , plant a hub • Dense setting: constant p • Some range of + : replica symmetry (uniform density boost) • Outside of this range: symmetry breaking (precise structure unknown)

How to compute large devia2ons 1. Prove a large deviation principle (LDP) that reduces the problem to a variational problem (maximization/minimization problem modeling the “most likely cause”) 2. Solve this variational problem

Review of large deviations Fixed 0 < p < q < 1. X ~ Binomial( n , p ). P ( X ≥ nq ) = ?? log P ( X ≥ nq ) = − ( I p ( q ) + o (1)) n as n → ∞ “cost of tilting” Relative entropy (KL divergence): p + (1 − x ) log 1 − x I p ( x ) := x log x 1 − p p 1

Triangles in G ( n , p ) For each pair (", $) of vertices • Tilt its probability to some & "$ ≥ ( • Pay ) * (& +, ) cost in log probability. Objective: minimize relative entropy cost min ∑ 12+3,24 ) * & +, 4 6 & 6 Constraint: enough triangles ∑ 12+3,3524 & +, & +5 & ,5 ≥ 4 6 & 6 ) This actually works! The minimum is asymptotically − log ℙ(< ≥ Chatterjee—Varadhan ’11 dense setting: p constant Chatterjee—Dembo ’16 sparse setting: p ≥ n −1/42 log n p ≥ n −1/18 log n Eldan ’17+ improved:

Another interpreta,on By Gibbs variational principle, a conditional probability distribution is given by the entropy-maximizing probability distribution subject to the conditions. Large deviation principle (whenever it holds): For random graphs, we can approximate this distribution by an entropy-maximizing product measure (independent edges)

Graphon variational problem !(), *) = !(*, )) • A graphon is a symmetric measurable function !: 0,1 & → 0,1 . Discrete variational problem Graphon varia=onal problem [Cha@erjee—Varadhan] Minimize ∑ ./012/3 4 5 6 02 Minimize ∫ >,. ? 4 5 ! ), * @)@* Subject to Subject to 6 02 6 08 6 28 ≥ : 3 6 < 7 >,. B ! ), * ! ), C ! *, C @)@*@C ≥ 6 < A ./01218/3 • Due to compactness of the space of graphons under cut metric (Lovasz— Szegedy), the above minimum is always attained • In general we do NOT know how to solve the variational problem

What do the minimizing graphons represent? The set of relative entropy minimizing graphons represents the most likely graphs conditioned on the rare event. Replica symmetry: If minimized (uniquely) by the constant graphon, then the conditioned random graph is close to Erdős–Rényi (in cut distance).

Sparse setting G ( n , p ) ! = ! # → 0 as & → ∞ , perhaps slowly

Order of the rate Theorem (DeMarco—Kahn ’11, Cha@erjee ’11) . Let X denote the number of triangles in G ( n , p ). Fix . > 0. For ' ≳ (log &)/& , ℙ ; ≥ 1 + . <; = ' 2 3 (/ 1 4 1 ) Proof of lower bound: clique Force a clique on ! = Θ $ (&') vertices = $ &' * ' * triangles Obtain ) / ≥ 1 + . * 0 1 = ' 2 3 (/ 1 4 1 ) Occurs with probability ' G ( n , p )

Theorem (Chatterjee—Dembo/Eldan + Lubetzky—Z.) . Let X denote the number of triangles in G ( n , p ). Improve this! Fix ! > 0. With " → 0 and and " ≥ & '(/(* log & , 678 ( 9: ;/< , ( >: ? ; @ ; ℙ / ≥ 1 + ! 2/ = " (45 ( Proof of lower bound: complete to rest ✓ n ◆ ✓ n ◆ clique ∼ δ p 3 ∼ δ p 3 of the graph 3 3 3 δ p 2 n 1 K δ 1 / 3 pn extra triangles δ 1 / 3 pn extra triangles With probability: With probability: G ( n , p ) G ( n , p ) p (1+ o (1)) 1 p (1+ o (1)) 1 3 δ p 2 n 2 2 δ 2 / 3 p 2 n 2 Preferred for δ < 27/8 Preferred for δ > 27/8

Theorem (Cha6erjee—Dembo/Eldan + Lubetzky—Z.) . Let X denote the number of triangles in G ( n , p ). Similar results for Fix ! > 0. With " → 0 and and " ≥ & '(/(* log & , the number of K t 678 ( 9: ;/< , ( >: ? ; @ ; ℙ / ≥ 1 + ! 2/ = " (45 ( [Bhattacharya, Ganguly, Lubetzky, Z. ’17] Proof of lower bound: Solution for every H ✓ n ◆ ✓ n ◆ 1 1 3 δ p 2 (1 + o (1)) δ p 3 (1 + o (1)) δ p 3 3 1 δ 1 / 3 p 3 extra triangles extra triangles p p With probability: With probability: p (1+ o (1)) 1 p (1+ o (1)) 1 3 δ p 2 n 2 2 δ 2 / 3 p 2 n 2

Theorem (Bhattacharya, Ganguly, Lubetzky, Z. ’17) . Fix ' > 0 and a graph H . Let X H = # copies of H in G ( n , p ). With 8 → 0 and and 8 ≥ < =2/>?(&) log < , M N O , ℙ F & ≥ 1 + ' GF & = 8 H I J KL 2 where Δ = max deg H , and c H ( δ ) > 0 is an explicit constant … For example , ' -/$ , + % & ' = min + For ! = # $ 0 ' 2/- , ' 2/- , −1 + 1 + + + For ! = # 1 % & ' = min , ' , ' 2/- , + % & ' = min + For ! = 6 1 7 '

Theorem (Bhattacharya, Ganguly, Lubetzky, Z. ’17) . Fix ! > 0 and a graph H . Let X H = # copies of H in G ( n , p ). With " → 0 and and " ≥ & '(/*+(-) log & , = > ? @ ℙ 3 - ≥ 1 + ! 63 - = " 8 9 : ;< ( where Δ = max deg H , and c H ( δ ) > 0 is an explicit constant … For example F - ! = 1 + ! (/C − 1 For A = B C,E F - ! = − H @ + I For A = @ 5 + 4 1 + !

Theorem (Bha@acharya, Ganguly, Lubetzky, Z. ’17) . Fix 6 > 0 and a graph H . Let X H = # copies of H in G ( n , p ). With D → 0 and and D ≥ G H=/IJ(") log G , V W X Y ℙ O " ≥ 1 + 6 PO " = D Q R S TU = where Δ = max deg H , and c H ( δ ) > 0 is an explicit constant … Independence polynomial: ! " # ≔ ∑ &'()* +), - # |-| Let H * denote the subgraph of H induced by its maximum degree vertices. Let / > 0 satisfy ! " ∗ / = 1 + 6 . Then, for a connected graph H , 7 " 6 = 8min /, = > 6 >/@(") if C is regular / if H is irregular

Large deviations in random hypergraphs Ongoing joint work with Yang Liu • ! (#) (%, ') : random k -uniform hypergraph, where every triple appears with probability p independently • Given some fixed 3-uniform hypergraph H , what can you say about upper tails of H -densities in ! (() (%, ') ? • Possible ways to embed extra edges • Plant clique: all triples contained in some chosen subset S of vertices • Plant 2-hub: all triples with at least two vertices in S • Plant 1-hub: all triples with at least one vertex in S • A simultaneous overlay of these constructions • Currently we understand what happens when H is a clique …

Arithmetic progressions Theorem (Bhattacharya, Ganguly, Shao, Z.) . The order in the exponent was Fix k and " > 0. Let X k denote the number of determined by k -term arithmetic progressions in a random Warnke, and holds subset of {1, 2, …, N } where every element is for all +/($*+) # ≳ log % included with probability p . With # → 0 and # ≥ % *+/(.$ ($*&)/& ) log % , % Recent improvement by Briët--Gopi ;< = > ? ℙ 4 $ ≥ 1 + " 74 $ = # +9: + "# $ % & • Proof of lower bound: plant an interval of length ∼

Dense se&ng G ( n , p ) ! constant " → ∞

Large Deviaons and Exponenal Random Graphs Yufei Zhao MIT May - PowerPoint PPT Presentation

Large Deviaons and Exponenal Random Graphs Yufei Zhao MIT May 2018 Universality Problem-dependent Central limit theorem: Large deviations ! # $ % < ' < ! + # ) % ' ! % Key questions: What is the probability of

Func+on applica+ons (calls, invoca+ons) lambda denotes a anonymous func+on To use a func+on, you

Random Numbers RANDOM VS PSEUDO RANDOM Truly Random numbers From Wolfram: A random number

Graphs () Graphs () Graphs Graphs Graphs are collections of nodes

Weighted graphs Weighted graphs Weighted graphs Weighted graphs Graphs with numbers, called

Random Walks on Graphs Larry Fenn DATE Larry Fenn Random Walks on Graphs Introduction

Sta n d a r d s a n d Devia tion s: Th e Role of Rou tin e in Testin g Micha el Bolton, Dev elop

Back to Random Walks on Graphs Random walk on a graph: Stationary distribution: Back to Random

Random Graphs Omid Etesami Large graphs World Wide Web Internet Social Networks

Week 4 Kullmann Graphs and directed graphs Elementary Graph Algorithms Representing graphs

Graphs Graphs Examples Definitions Implementation/Representation of graphs Graphs

On some classes of Deza graphs Deza graphs without 3-cocliques Line graphs V.V. Kabanov 1 Deza

Random Graphs Random Graphs 1 / 19 Generative Models Could hope to understand networks in the

On the limit approach to random regular graphs Bal azs Szegedy Joint work with Agnes

Pregel Large-Scale Graph Processing William Jones Analysing large graphs is hard. We are

Searching on Graphs November 16, 2016 CMPE 250 Graphs- Searching on Graphs November 16, 2016 1

CS200: Graphs Prichard Ch. 14 Rosen Ch. 10 CS200 - Graphs 1 Graphs A collection of What can

Approximate Matchings in Dynamic Graph Streams Sanjeev Khanna University of Pennsylvania Joint

Random Dieudonn e Modules and the Cohen-Lenstra Heuristics David Zureick-Brown Bryden Cais

Bijective approach to percolation on triangulations and Liouville quantum gravity Olivier

Lecture 2. Random Matrix Theory and Phase Transitions of PCA Yuan Yao Hong Kong University of

Spline approximation of a random process with singularity Konrad Abramowicz Department of

Advanced Algorithms (II) Shanghai Jiao Tong University Chihao Zhang March 9th, 2020 Random

Random Regular Graphs and Differential Equations Nick Wormald University of Waterloo Minicourse

Translating the Cantor set by a random Randall Dougherty Jack Lutz R. Daniel Mauldin Jason

Large Devia*ons and Exponen*al Random Graphs Yufei Zhao MIT May - PowerPoint PPT Presentation

Large Devia*ons and Exponen*al Random Graphs Yufei Zhao MIT May 2018 Universality Problem-dependent Central limit theorem: Large deviations ! # $ % < ' < ! + # ) % ' ! % Key questions: What is the probability of

Func+on applica+ons (calls, invoca+ons) lambda denotes a anonymous func+on To use a func+on, you

Random Numbers RANDOM VS PSEUDO RANDOM Truly Random numbers From Wolfram: A random number

Graphs () Graphs () Graphs Graphs Graphs are collections of nodes

Weighted graphs Weighted graphs Weighted graphs Weighted graphs Graphs with numbers, called

Random Walks on Graphs Larry Fenn DATE Larry Fenn Random Walks on Graphs Introduction

Sta n d a r d s a n d Devia tion s: Th e Role of Rou tin e in Testin g Micha el Bolton, Dev elop

Back to Random Walks on Graphs Random walk on a graph: Stationary distribution: Back to Random

Random Graphs Omid Etesami Large graphs World Wide Web Internet Social Networks

Week 4 Kullmann Graphs and directed graphs Elementary Graph Algorithms Representing graphs

Graphs Graphs Examples Definitions Implementation/Representation of graphs Graphs

On some classes of Deza graphs Deza graphs without 3-cocliques Line graphs V.V. Kabanov 1 Deza

Random Graphs Random Graphs 1 / 19 Generative Models Could hope to understand networks in the

On the limit approach to random regular graphs Bal azs Szegedy Joint work with Agnes

Pregel Large-Scale Graph Processing William Jones Analysing large graphs is hard. We are

Searching on Graphs November 16, 2016 CMPE 250 Graphs- Searching on Graphs November 16, 2016 1

CS200: Graphs Prichard Ch. 14 Rosen Ch. 10 CS200 - Graphs 1 Graphs A collection of What can

Approximate Matchings in Dynamic Graph Streams Sanjeev Khanna University of Pennsylvania Joint

Random Dieudonn e Modules and the Cohen-Lenstra Heuristics David Zureick-Brown Bryden Cais

Bijective approach to percolation on triangulations and Liouville quantum gravity Olivier

Lecture 2. Random Matrix Theory and Phase Transitions of PCA Yuan Yao Hong Kong University of

Spline approximation of a random process with singularity Konrad Abramowicz Department of

Advanced Algorithms (II) Shanghai Jiao Tong University Chihao Zhang March 9th, 2020 Random

Random Regular Graphs and Differential Equations Nick Wormald University of Waterloo Minicourse

Translating the Cantor set by a random Randall Dougherty Jack Lutz R. Daniel Mauldin Jason

Large Deviaons and Exponenal Random Graphs Yufei Zhao MIT May - PowerPoint PPT Presentation

Large Deviaons and Exponenal Random Graphs Yufei Zhao MIT May 2018 Universality Problem-dependent Central limit theorem: Large deviations ! # $ % < ' < ! + # ) % ' ! % Key questions: What is the probability of