Chapter 1: Getting Started The Probabilistic Method Summer 2020 - - PowerPoint PPT Presentation

β–Ά
chapter 1 getting started
SMART_READER_LITE
LIVE PREVIEW

Chapter 1: Getting Started The Probabilistic Method Summer 2020 - - PowerPoint PPT Presentation

Chapter 1: Getting Started The Probabilistic Method Summer 2020 Freie Universitt Berlin Chapter Overview Survey quick applications of the basic method to different areas 1 Unsatisfiable Formulae Chapter 1: Getting Started The


slide-1
SLIDE 1

Chapter 1: Getting Started

The Probabilistic Method Summer 2020 Freie UniversitΓ€t Berlin

slide-2
SLIDE 2

Chapter Overview

  • Survey quick applications of the basic method to different areas
slide-3
SLIDE 3

Β§1 Unsatisfiable Formulae

Chapter 1: Getting Started The Probabilistic Method

slide-4
SLIDE 4

Boolean Logic

Binary values

  • Computers can only talk in 0s and 1s
  • In logical applications, we map those to πΊπ‘π‘šπ‘‘π‘“ and π‘ˆπ‘ π‘£π‘“

Logical operators

  • Can obtain new truth values from old ones

Not: ¬ Or: ∨ And: ∧

Boolean formulae

  • Can build any π‘ˆπ‘ π‘£π‘“/πΊπ‘π‘šπ‘‘π‘“ expression using these operations
  • Such a formula is a function 𝑔: 0,1 π‘œ β†’ 0,1
slide-5
SLIDE 5

Anatomy of a Formula

Every Boolean formula can be written in Conjunctive Normal Form: Variables

  • 𝑦𝑗 ∈ 0,1

Literals

  • Variable 𝑦𝑗 or its negation ¬𝑦𝑗

Clauses

  • β€˜OR’ of literals
  • e.g.: 𝑦1 ∨ ¬𝑦2 ∨ 𝑦3

CNF Formula

  • β€˜AND’ of several clauses
  • e.g.: 𝑦1 ∨ ¬𝑦2 ∨ 𝑦3 ∧ ¬𝑦1 ∨ 𝑦2 ∧ (𝑦2 ∨ 𝑦3 ∨ 𝑦4 ∨ ¬𝑦5)
slide-6
SLIDE 6

A Little Complexity

Satisfiability Problem (SAT)

  • Given a Boolean formula 𝑔, can 𝑔 ever evaluate to π‘ˆπ‘ π‘£π‘“?
  • If not, say 𝑔 is unsatisfiable

Theorem 1.1.1 (Cook, 1971; Levin, 1973) SAT is 𝑂𝑄-Complete, i.e. is probably very difficult. A universal model

  • Most interesting problems can be reduced to SAT instances
  • e.g.: Travelling Salesman Problem, Subgraph Isomorphism, Largest Clique
slide-7
SLIDE 7

Restricted Formulae

Simplifying the problem

  • Perhaps the problem is easier for β€˜nice’ formulae
  • 𝑙-SAT: each clause must have exactly 𝑙 literals from distinct variables

Theorem 1.1.2 (Karp, 1972) For all 𝑙 β‰₯ 3, 𝑙-SAT is still 𝑂𝑄-Complete. Does size matter?

  • Karp β‡’ unsatisfiability does not require long clauses
  • Does it at least require many clauses? Are short formulae always satisfiable?
slide-8
SLIDE 8

Minimum Unsatisfiability

Extremal problem

  • How small can an unsatisfiable instance of 𝑙-SAT be?

Definition 1.1.3 Given 𝑙 ∈ β„•, let 𝑛0 𝑙 be the minimum 𝑛 ∈ β„• for which there is an unsatisfiable instance of 𝑙-SAT with 𝑛 clauses. Small 𝑙-SAT is easy

  • Can solve instances of 𝑙-SAT with 𝑛 < 𝑛0 𝑙 clauses in constant time!
  • (Existential) answer is always: yes (satisfiable)
slide-9
SLIDE 9

A First Lower Bound

Lower bounds

  • Given any instance of 𝑙-SAT with few clauses, need to show it is satisfiable
  • First idea: build a satisfying argument greedily

A worked example (𝑙 = 3) 𝑦1 ∨ ¬𝑦2 ∨ 𝑦3 ∧ ¬𝑦1 ∨ 𝑦2 ∨ ¬𝑦4 ∧ ¬𝑦2 ∨ ¬𝑦3 ∨ ¬𝑦5

π’šπŸ π’šπŸ‘ π’šπŸ’ π’šπŸ“ π’šπŸ” ? ? ? ? ?

slide-10
SLIDE 10

A First Lower Bound

Lower bounds

  • Given any instance of 𝑙-SAT with few clauses, need to show it is satisfiable
  • First idea: build a satisfying argument greedily

A worked example (𝑙 = 3) 𝑦1 ∨ ¬𝑦2 ∨ 𝑦3 ∧ ¬𝑦1 ∨ 𝑦2 ∨ ¬𝑦4 ∧ ¬𝑦2 ∨ ¬𝑦3 ∨ ¬𝑦5 Step 1

  • Select 𝑦1 as the designated variable for the first clause

π’šπŸ π’šπŸ‘ π’šπŸ’ π’šπŸ“ π’šπŸ” ? ? ? ? ?

slide-11
SLIDE 11

A First Lower Bound

Lower bounds

  • Given any instance of 𝑙-SAT with few clauses, need to show it is satisfiable
  • First idea: build a satisfying argument greedily

A worked example (𝑙 = 3) 1 ∨ ¬𝑦2 ∨ 𝑦3 ∧ 0 ∨ 𝑦2 ∨ ¬𝑦4 ∧ ¬𝑦2 ∨ ¬𝑦3 ∨ ¬𝑦5 Step 1

  • Select 𝑦1 as the designated variable for the first clause
  • Set 𝑦1 = 1 to satisfy the clause

π’šπŸ π’šπŸ‘ π’šπŸ’ π’šπŸ“ π’šπŸ” 1 ? ? ? ?

slide-12
SLIDE 12

A First Lower Bound

Lower bounds

  • Given any instance of 𝑙-SAT with few clauses, need to show it is satisfiable
  • First idea: build a satisfying argument greedily

A worked example (𝑙 = 3) 1 ∨ ¬𝑦2 ∨ 𝑦3 ∧ 0 ∨ 𝑦2 ∨ ¬𝑦4 ∧ ¬𝑦2 ∨ ¬𝑦3 ∨ ¬𝑦5 Step 2

  • The second clause is still unsatisfied

π’šπŸ π’šπŸ‘ π’šπŸ’ π’šπŸ“ π’šπŸ” 1 ? ? ? ?

slide-13
SLIDE 13

A First Lower Bound

Lower bounds

  • Given any instance of 𝑙-SAT with few clauses, need to show it is satisfiable
  • First idea: build a satisfying argument greedily

A worked example (𝑙 = 3) 1 ∨ 0 ∨ 𝑦3 ∧ 0 ∨ 1 ∨ ¬𝑦4 ∧ 0 ∨ ¬𝑦3 ∨ ¬𝑦5 Step 2

  • The second clause is still unsatisfied
  • Select 𝑦2 as its designated variable, and set 𝑦2 = 1

π’šπŸ π’šπŸ‘ π’šπŸ’ π’šπŸ“ π’šπŸ” 1 1 ? ? ?

slide-14
SLIDE 14

A First Lower Bound

Lower bounds

  • Given any instance of 𝑙-SAT with few clauses, need to show it is satisfiable
  • First idea: build a satisfying argument greedily

A worked example (𝑙 = 3) 1 ∨ 0 ∨ 0 ∧ 0 ∨ 1 ∨ ¬𝑦4 ∧ 0 ∨ 1 ∨ ¬𝑦5 Step 3

  • The third clause is still unsatisfied, so we set 𝑦3 = 0

π’šπŸ π’šπŸ‘ π’šπŸ’ π’šπŸ“ π’šπŸ” 1 1 ? ?

slide-15
SLIDE 15

A First Lower Bound

Lower bounds

  • Given any instance of 𝑙-SAT with few clauses, need to show it is satisfiable
  • First idea: build a satisfying argument greedily

A worked example (𝑙 = 3) 1 ∨ 0 ∨ 0 ∧ 0 ∨ 1 ∨ ¬𝑦4 ∧ 0 ∨ 1 ∨ ¬𝑦5 Step 3

  • The third clause is still unsatisfied, so we set 𝑦3 = 0
  • This satisfies the formula, so we are done

π’šπŸ π’šπŸ‘ π’šπŸ’ π’šπŸ“ π’šπŸ” 1 1 ? ?

slide-16
SLIDE 16

A First Lower Bound

Proof

  • Let 𝑔 be an arbitrary 𝑙-SAT formula with 𝑛 ≀ 𝑙 clauses
  • We use the greedy algorithm, satisfying each clause one at a time
  • When dealing with the 𝑗th clause, for 1 ≀ 𝑗 ≀ 𝑛, either:
  • it is already satisfied by our previous assignments, or
  • we have set at most 𝑗 βˆ’ 1 ≀ 𝑛 βˆ’ 1 < 𝑙 variables, so there is a free variable to choose
  • Hence we can satisfy all the clauses
  • Thus 𝑔 is satisfiable

∎

Proposition 1.1.4 For all 𝑙 ∈ β„•, 𝑛0 𝑙 > 𝑙.

slide-17
SLIDE 17

Being Greedy Doesn’t Always Pay

What if we have more clauses?

  • This greedy algorithm can get stuck

Extending our example

𝑦1 ∨ ¬𝑦2 ∨ 𝑦3 ∧ ¬𝑦1 ∨ 𝑦2 ∨ ¬𝑦4 ∧ ¬𝑦2 ∨ ¬𝑦3 ∨ ¬𝑦5 ∧ ¬𝑦1 ∨ ¬𝑦2 ∨ 𝑦3

π’šπŸ π’šπŸ‘ π’šπŸ’ π’šπŸ“ π’šπŸ” ? ? ? ? ?

slide-18
SLIDE 18

Being Greedy Doesn’t Always Pay

What if we have more clauses?

  • This greedy algorithm can get stuck

Extending our example

1 ∨ 0 ∨ 0 ∧ 0 ∨ 1 ∨ ¬𝑦4 ∧ 0 ∨ 1 ∨ ¬𝑦5 ∧ 0 ∨ 0 ∨ 0

Steps 1-3

  • Proceed as before, with the same assignments
  • Now the final clause is unsatisfiable

π’šπŸ π’šπŸ‘ π’šπŸ’ π’šπŸ“ π’šπŸ” 1 1 ? ?

slide-19
SLIDE 19

Being Greedy Doesn’t Always Pay

What if we have more clauses?

  • This greedy algorithm can get stuck

Extending our example

1 ∨ 0 ∨ 1 ∧ 0 ∨ 1 ∨ ¬𝑦4 ∧ 0 ∨ 0 ∨ 1 ∧ 0 ∨ 0 ∨ 1

Formula is still satisfiable

  • Could have satisfied earlier clauses with different variables

π’šπŸ π’šπŸ‘ π’šπŸ’ π’šπŸ“ π’šπŸ” 1 1 1 ?

slide-20
SLIDE 20

Proposition 1.1.5 For all 𝑙 ∈ β„•, 𝑛0 𝑙 ≀ 2𝑙.

An Unsatisfiable Formula

Intuition

  • Clauses with unique variables are can always be satisfied
  • Maybe hardest when all clauses share the same variables

Building an unsatisfiable formula

  • With 𝑙 variables, there are 2𝑙 possible inputs and 2𝑙 possible clauses
  • Each clause is unsatisfied by a unique input
  • e.g.: 𝑦1 ∨ ¬𝑦2 ∨ ¬𝑦3 is not satisfied by 𝑦1, 𝑦2, 𝑦3 = 0,1,1
  • β‡’ The formula with all possible clauses is unsatisfiable
slide-21
SLIDE 21

A Tight Result

Upper bound

  • Previous construction

Lower bound

  • Need to show every 𝑙-SAT instance with 𝑛 < 2𝑙 clauses is satisfiable

Existential reformulation

  • Given: 𝑙-SAT formula 𝑔 with π‘œ variables and 𝑛 < 2𝑙 clauses
  • Goal: show there is some Τ¦

𝑦 ∈ Ξ© = {0,1}π‘œ with the property 𝑔 Τ¦ 𝑦 = 1

Theorem 1.1.6 For all 𝑙 ∈ β„•, 𝑛0 𝑙 = 2𝑙.

slide-22
SLIDE 22

Randomness to the Rescue

Existential reformulation

  • Given: 𝑙-SAT formula 𝑔 with π‘œ variables and 𝑛 < 2𝑙 clauses
  • Goal: show there is some Τ¦

𝑦 ∈ Ξ© = {0,1}π‘œ with the property 𝑔 Τ¦ 𝑦 = 1

The probabilistic method

  • Choose Τ¦

𝑦 ∈ {0,1}π‘œ uniformly at random

  • Show β„™ 𝑔 Τ¦

𝑦 = 1 > 0

Theorem 1.1.6 For all 𝑙 ∈ β„•, 𝑛0 𝑙 = 2𝑙.

slide-23
SLIDE 23

Union Bound Given arbitrary events 𝐹𝑗, we have ℙڂ𝑗 𝐹𝑗 ≀ σ𝑗 β„™ 𝐹𝑗 .

Bounding Probabilities

Setting

  • Given: 𝑔, a 𝑙-SAT formula with π‘œ variables and 𝑛 < 2𝑙 clauses
  • Given: uniformly random Τ¦

𝑦 ∈ {0,1}π‘œ

  • Goal: show β„™ 𝑔 Τ¦

𝑦 = 1 > 0

Bad events

  • Equivalently, want to show β„™ 𝑔 Τ¦

𝑦 = 0 < 1

  • Let 𝐹𝑗 be the event that the 𝑗th clause is not satisfied by Τ¦

𝑦

  • 𝑔 Τ¦

𝑦 = 0 =ڂ𝑗=1

𝑛 𝐹𝑗

  • β‡’ β„™ 𝑔 Τ¦

𝑦 = 0 = ℙڂ𝑗=1

𝑛 𝐹𝑗

slide-24
SLIDE 24

Completing the Proof

Individual clauses

  • Recall: 𝐹𝑗 is the event that the 𝑗th clause is not satisfied by Τ¦

𝑦

  • 𝐹𝑗 only depends on the values of the 𝑙 variables it contains
  • Exactly one of the 2𝑙 possible values does not satisfy the clause
  • β‡’ β„™ 𝐹𝑗 = 2βˆ’π‘™

β„™ 𝑔 Τ¦ 𝑦 = 0 = β„™ ራ

𝑗=1 𝑛

𝐹𝑗 ≀ ෍

𝑗=1 𝑛

β„™ 𝐹𝑗 = 𝑛2βˆ’π‘™ < 1 Conclusion

  • Therefore β„™ 𝑔 Τ¦

𝑦 = 1 = 1 βˆ’ β„™ Τ¦ 𝑦 = 0 > 0

  • Hence there is some Τ¦

𝑦 ∈ {0,1}π‘œ for which 𝑔 Τ¦ 𝑦 = 1 ∎

slide-25
SLIDE 25

Trivial unsatisfiability

  • In our construction to show 𝑛0 𝑙 ≀ 2𝑙, each clause had the same variables
  • Clauses are then forced to be in conflict with one another

Non-repetitive formulae

  • A 𝑙-SAT formula is non-repetitive if each clause has a distinct set of variables
  • e.g.: cannot have both (𝑦1 ∨ ¬𝑦2 ∨ 𝑦4) and (¬𝑦1 ∨ ¬𝑦2 ∨ ¬𝑦4) as clauses

Extremal problem

  • How many variables must an unsatisfiable non-repetitive 𝑙-SAT formula have?

Definition 1.1.7 Given 𝑙 ∈ β„•, let π‘œ0 𝑙 be the minimum π‘œ ∈ β„• for which there is an unsatisfiable non-repetitive 𝑙-SAT formula with π‘œ variables.

Is Repetition Necessary?

slide-26
SLIDE 26

Theorem 1.1.6 For all 𝑙 ∈ β„•, 𝑛0 𝑙 = 2𝑙.

A Lower Bound

Observation

  • A non-repetitive 𝑙-SAT formula with π‘œ variables can have at most π‘œ

𝑙 clauses

Corollary 1.1.8 For all 𝑙, π‘œ ∈ β„•, if π‘œ

𝑙 < 2𝑙, then π‘œ0 𝑙 > π‘œ.

slide-27
SLIDE 27

An Upper Bound

Existential formulation

  • Set of objects Ξ©: non-repetitive 𝑙-SAT formulae with π‘œ variables
  • Desired property 𝒬: βˆ€ Τ¦

𝑦 ∈ {0,1}π‘œ, 𝑔 Τ¦ 𝑦 = 0

Probabilistic approach

  • There are π‘œ

𝑙 sets of 𝑙 variables:

  • For each variable 𝑦𝑗, there are two possible literals: 𝑦𝑗 and ¬𝑦𝑗
  • Total of 2𝑙 possible clauses for this set of variables
  • Choose one uniformly at random
  • Make these choices independently
  • This gives us a random 𝑔 ∈ Ξ©
  • Want to show β„™ βˆ€ Τ¦

𝑦 ∈ {0,1}π‘œ, 𝑔 Τ¦ 𝑦 = 0 > 0

slide-28
SLIDE 28

Analysing the Bad Events

Satisfying assignments

  • For each Τ¦

𝑦 ∈ {0,1}π‘œ, let 𝐹 Τ¦

𝑦 be the event that 𝑔 Τ¦

𝑦 = 1

  • We want to show β„™Ϊ‚ Τ¦

𝑦 𝐹 Τ¦ 𝑦 < 1

  • Union bound:
  • There are 2π‘œ possible Τ¦

𝑦

  • β„™Ϊ‚ Τ¦

𝑦 𝐹 Τ¦ 𝑦 ≀ Οƒ Τ¦ 𝑦 β„™ 𝐹 Τ¦ 𝑦

  • Suffices to have β„™ 𝐹 Τ¦

𝑦 < 2βˆ’n for all Τ¦

𝑦

Fix an assignment Τ¦ 𝑦

  • For 𝑔 Τ¦

𝑦 = 1, Τ¦ 𝑦 must satisfy each of the π‘œ

𝑙 clauses

  • Let 𝐺

𝑗 be the event that Τ¦

𝑦 satisfies the 𝑗th clause

  • Then 𝐹 Τ¦

𝑦 =βˆ©π‘— 𝐺 𝑗

slide-29
SLIDE 29

Computing Probabilities

Recall

  • 𝑔 formed by choosing a random clause for each set of variables
  • 𝐹 Τ¦

𝑦: event that 𝑔 Τ¦

𝑦 = 1; 𝐺

𝑗: event that Τ¦

𝑦 satisfies the 𝑗th clause of 𝑔

  • Suffices to show β„™ 𝐹 Τ¦

𝑦 = β„™ βˆ©π‘— 𝐺 𝑗 < 2βˆ’π‘œ

Independence

  • Clauses are chosen independently β‡’ events 𝐺

𝑗 are independent

  • β‡’ β„™ βˆ©π‘— 𝐺

𝑗 = ς𝑗 β„™ 𝐺 𝑗

Satisfying a single clause

  • Given 𝑗 and our fixed Τ¦

𝑦, unique choice of literals such that 𝐺

𝑗 doesn’t hold

  • β‡’ β„™ 𝐺

𝑗 = 1 βˆ’ 2βˆ’π‘™

slide-30
SLIDE 30

Theorem 1.1.9 For all 𝑙 ∈ β„•, if π‘œ

𝑙 < 2𝑙, then π‘œ0 𝑙 > π‘œ, and if π‘œ 𝑙 > 2π‘™π‘œ ln 2, then

π‘œ0 𝑙 ≀ π‘œ.

Putting it all together

A final calculation

  • We therefore have β„™ 𝐹 Τ¦

𝑦 = ς𝑗 β„™ 𝐺 𝑗 = 1 βˆ’ 2βˆ’π‘™

π‘œ 𝑙

Exponential bound For all 𝑦 ∈ ℝ, 1 + 𝑦 ≀ 𝑓𝑦

  • β‡’ β„™ 𝐹 Τ¦

𝑦 = 1 βˆ’ 2βˆ’π‘™

π‘œ 𝑙 ≀ π‘“βˆ’2βˆ’π‘™ π‘œ 𝑙

  • This is less than 2βˆ’π‘œ if π‘œ

𝑙 > 2π‘™π‘œ ln 2

slide-31
SLIDE 31

Just Kidding, There’s One More Calculation

Binomial estimates

  • For all 1 ≀ 𝑙 ≀ π‘œ, we have

π‘œ 𝑙 𝑙

≀

π‘œ 𝑙 ≀ π‘œπ‘“ 𝑙 𝑙

  • If 𝑙 = π›½π‘œ, then π‘œ

𝑙 = 2 1+𝑝 1 𝐼 𝛽 π‘œ as π‘œ β†’ ∞

  • Binary entropy: 𝐼 𝛽 = βˆ’π›½ log 𝛽 βˆ’ 1 βˆ’ 𝛽 log 1 βˆ’ 𝛽
  • For more estimates:

http://page.mi.fu-berlin.de/shagnik/notes/binomials.pdf

Theorem 1.1.9 For all 𝑙 ∈ β„•, if π‘œ

𝑙 < 2𝑙, then π‘œ0 𝑙 > π‘œ, and if π‘œ 𝑙 > 2π‘™π‘œ ln 2, then

π‘œ0 𝑙 ≀ π‘œ.

slide-32
SLIDE 32

Just Kidding, There’s One More Calculation

Lower bound

  • We use

π‘œ π›½π‘œ = 2 1+𝑝 1 𝐼 𝛽 π‘œ

  • Therefore π‘œ

𝑙 ∼ 2𝑙 if 𝑙 = π›½π‘œ and 𝐼 𝛽 = 𝛽

  • This happens for 𝛽 = 0.7729 …

Upper bound

  • Binomial coefficient grows very fast
  • π‘œ+1

𝑙

=

π‘œ+1 π‘œβˆ’π‘™+1 π‘œ 𝑙 ∼ 1 1βˆ’π›½ π‘œ 𝑙

  • β‡’ for some constant 𝑑, if π‘œβ€² = π›½βˆ’1𝑙 + 𝑑 log 𝑙, then π‘œβ€²

𝑙

∼ 2π‘™π‘œ ln 2

Corollary 1.1.10 As 𝑙 β†’ ∞, π‘œ0 𝑙 = 1.2938 … + 𝑝 1 𝑙.

slide-33
SLIDE 33

Any questions?

slide-34
SLIDE 34

Β§2 Prefix-free Codes

Chapter 1: Getting Started The Probabilistic Method

slide-35
SLIDE 35

😒 😣 πŸ˜‘ πŸ™ƒ 😡

A Motivating Example

Evolutionary pitfalls

  • Imagine in some parallel universe a species evolves so that:
  • they develop binary computers, and
  • they communicate their emotional states through a set of five emojis

Technical problem

  • How should a binary computer transmit these states?
slide-36
SLIDE 36

A First Attempt

Binary encoding

  • We can index the emojis with integers 0 βˆ’ 4
  • The integers 0 βˆ’ 7 can be written as binary strings of length 3
  • Computers can send these strings to represent the emojis

Examples

😣😒😡 β†’ 001 | 000 | 100 β†’ 001000100 011000 β†’ 011 | 000 β†’ πŸ™ƒπŸ˜’

😒 😣 πŸ˜‘ πŸ™ƒ 😡

000 001 010 011 100

slide-37
SLIDE 37

A Problem and a Fix

Wasteful encoding

  • This is a bit costly – it takes three bits per emoji
  • Can we reduce the bandwidth by using a shorter encoding?

Idea

  • We need to encode five emojis
  • There are six non-empty binary strings of length at most two
  • Five is less than six

😒 😣 πŸ˜‘ πŸ™ƒ 😡

1 00 01 10

slide-38
SLIDE 38

The Problem in the β€œFix”

Encoding is simple

πŸ™ƒπŸ™ƒπŸ™ƒ β†’ 01 | 01 | 01 β†’ 010101

But how do we decode?

010101 β†’ 01 | 01 | 01 β†’ πŸ™ƒπŸ™ƒπŸ™ƒ ? 010101 β†’ 0 | 1 | 0 | 1 | 0 | 1 β†’ 😒😣😒😣😒😣? 010101 β†’ 01 | 0 | 10 | 1 β†’ πŸ™ƒπŸ˜’πŸ˜΅πŸ˜£?

😒 😣 πŸ˜‘ πŸ™ƒ 😡

1 00 01 10

slide-39
SLIDE 39

Coding: General Framework

Set-up

  • Have an alphabet 𝐡 = 𝑏1, 𝑏2, … , π‘π‘œ of size π‘œ
  • Want to encode the letters of the alphabet as binary strings

Encoding

  • Represent each 𝑏𝑗 with a word π‘₯𝑗 ∈ 0,1 βˆ—, a non-empty finite binary string
  • Let ℓ𝑗 = π‘₯𝑗 be the length of the word π‘₯𝑗

Objectives

  • Decipherability: given a concatenation of words, should be able to recover the
  • riginal words uniquely
  • Efficiency: would like to make the lengths ℓ𝑗 as small as possible
slide-40
SLIDE 40

Prefix-free Codes

Prefixes

  • Given a word π‘₯ ∈ 0,1 βˆ—, the β„“-prefix of π‘₯ is the subword of the first β„“ bits
  • e.g. the non-empty prefixes of π‘₯ = 00101 are 0, 00, 001, 0010 and 00101
  • but the substring 010 is not a prefix

Prefix-free codes

  • We say a code from an alphabet 𝐡 to 0,1 βˆ— is prefix-free if no codeword π‘₯𝑗 is

a prefix of any other codeword π‘₯

π‘˜, π‘˜ β‰  𝑗

  • Equivalently, if we place the codewords in the (infinite) binary tree of 0,1 βˆ—,

no codeword is an ancestor of another

slide-41
SLIDE 41

Decipherability

Proof

  • Want to show that the concatenation π‘₯ = π‘₯𝑗1π‘₯𝑗2 … π‘₯𝑗𝑑 can be decoded
  • Base case: 𝑑 = 0
  • In this case, π‘₯ is the empty string β‡’ no codewords
  • Induction step: 𝑑 β‰₯ 1
  • Start from the beginning of π‘₯, and read until the prefix is some codeword π‘₯

π‘˜

  • Must terminate, as π‘₯𝑗1 is a prefix of π‘₯
  • Cannot terminate on another codeword, as otherwise π‘₯

π‘˜ would be a prefix of π‘₯𝑗1

  • Thus we know 𝑏𝑗1 is the first letter
  • Remove π‘₯𝑗1, and decode π‘₯β€² = π‘₯𝑗2π‘₯𝑗3π‘₯𝑗4 … π‘₯𝑗𝑑 (induction hypothesis)

∎

Proposition 1.2.1 All prefix-free codes are decipherable.

slide-42
SLIDE 42

Examples

Uniform codes Given β„“, any injection 𝐡 β†’ {0,1}β„“ is prefix- free

βˆ… 1 00 01 10 11 000 001 010 011 100 101 110 111 😒 😣 πŸ˜‘ πŸ™ƒ 😡 ___

Length We must have 𝐡 ≀ 0,1 β„“ = 2β„“ β‡’ β„“ β‰₯ log 𝐡 β‡’ β„“ β‰₯ log 𝐡

slide-43
SLIDE 43

Examples - II

Improvements Can sometimes find prefix-free codes with a shorter average codeword length

βˆ… 1 00 01 10 11 😒 😣 πŸ˜‘____________ 000 001 010 011 100 101 110 111 πŸ™ƒ 😡

slide-44
SLIDE 44

Theorem 1.2.2 (Kraft, 1949) Given an alphabet 𝐡 of size π‘œ, any prefix-free code with codeword lengths β„“1, β„“2, … , β„“π‘œ must satisfy σ𝑗=1

π‘œ

2βˆ’β„“π‘— ≀ 1.

Short Prefix-free Codes

Extremal problem

  • How small can the average length of a codeword of a prefix-free code be?

Corollary 1.2.3 (Convexity) Given an alphabet 𝐡 of size π‘œ, the average length of the codewords in any prefix-free code is at least log π‘œ.

slide-45
SLIDE 45

Proof Idea

Existential reformulation

  • Want to show that an encoding with shorter codewords is not prefix-free
  • Given:
  • an encoding π‘₯1, π‘₯2, … , π‘₯π‘œ of 𝐡 with lengths β„“1, β„“2, … , β„“π‘œ such that σ𝑗=1

π‘œ

2βˆ’β„“π‘— > 1

  • Seek:
  • Codewords π‘₯𝑗, π‘₯

π‘˜, 𝑗 β‰  π‘˜, such that π‘₯𝑗 is a prefix of π‘₯ π‘˜

Key observation

  • Suppose we have a string π‘₯ ∈ 0,1 βˆ— such that both π‘₯𝑗, π‘₯

π‘˜ are prefixes of π‘₯

  • Then π‘₯𝑗 is a prefix of π‘₯

π‘˜ or wj is a prefix of π‘₯𝑗

New objective

  • Find a string π‘₯ ∈ 0,1 βˆ— that contains at least two codewords as prefixes
slide-46
SLIDE 46

Basic Fact For any random variable π‘Œ, the events π‘Œ β‰₯ 𝔽 π‘Œ and π‘Œ ≀ 𝔽 π‘Œ must occur with positive probability.

Probabilistic Framework

Probability space

  • Let 𝑀 = max π‘šπ‘—: 𝑗 ∈ π‘œ

be the length of the longest codeword

  • Let π‘₯ ∈ 0,1 𝑀 be a uniformly random string of length 𝑀

Random variables

  • Let π‘Œ =

𝑗: π‘₯𝑗 is a prefix of π‘₯ count the number of codeword prefixes

Simpler objective

  • Since π‘Œ is integer-valued, it suffices to show 𝔽 π‘Œ > 1
slide-47
SLIDE 47

Linearity of Expectation For any sequence π‘Œ1, π‘Œ2, … , π‘Œπ‘œ of random variables, and any sequence 𝑑1, 𝑑2, … , π‘‘π‘œ of constants, if π‘Œ = 𝑑1π‘Œ1 + 𝑑2π‘Œ2 + β‹― + π‘‘π‘œπ‘Œπ‘œ, then 𝔽 π‘Œ = 𝑑1𝔽 π‘Œ1 + 𝑑2𝔽 π‘Œ2 + β‹― + π‘‘π‘œπ”½ π‘Œπ‘œ .

Computing the Expectation

Indicator random variables

  • For each 𝑗 ∈ π‘œ , let 𝐹𝑗 be the event that π‘₯𝑗 is a prefix of the random string π‘₯
  • Let π‘Œπ‘— = 1𝐹𝑗 be the indicator function of this event
  • Then π‘Œ = σ𝑗=1

π‘œ

π‘Œπ‘—

Reduction to probabilities

  • We therefore have 𝔽 π‘Œ = σ𝑗=1

π‘œ

𝔽[π‘Œπ‘—] = σ𝑗=1

π‘œ

β„™(𝐹𝑗)

slide-48
SLIDE 48

Finishing the Proof

Recall

  • π‘₯ is a uniformly random string
  • π‘Œ is the number of codewords that are prefixes of π‘₯
  • 𝐹𝑗 is the event that the codeword π‘₯𝑗 is a prefix of π‘₯
  • 𝔽 π‘Œ = σ𝑗=1

π‘œ

β„™(𝐹𝑗)

Computing probabilities

  • The event 𝐹𝑗 only depends on the first ℓ𝑗 bits of π‘₯
  • This is a uniformly random string in 0,1 ℓ𝑗
  • β‡’ β„™ 𝐹𝑗 = 2βˆ’β„“π‘—

A grand finale

  • β‡’ if 𝔽 π‘Œ = σ𝑗=1

π‘œ

2βˆ’β„“π‘— > 1, there is some π‘₯ ∈ 0,1 βˆ— with two codewords as prefixes

  • Hence, in any prefix-free code, σ𝑗=1

π‘œ

2βˆ’β„“π‘— ≀ 1 ∎

slide-49
SLIDE 49

Linearity of Expectation

Union bound revisited

  • In the previous calculation, we saw the expression σ𝑗 β„™ 𝐹𝑗
  • Union bound: β„™ βˆͺ𝑗 𝐹𝑗 < σ𝑗 β„™ 𝐹𝑗
  • ∴ σ𝑗 β„™ 𝐹𝑗 < 1 β‡’ with positive probability, none of the events 𝐹𝑗 occur

Using linearity instead

  • σ𝑗 β„™ 𝐹𝑗 is the expectation of the number π‘Œ of events 𝐹𝑗 that occur
  • ∴ σ𝑗 β„™ 𝐹𝑗 < 1 β‡’ with positive probability, π‘Œ = 0
  • With linearity, we get information when σ𝑗 β„™ 𝐹𝑗 β‰₯ 1 as well
slide-50
SLIDE 50

Any questions?

slide-51
SLIDE 51

Β§3 Sum-free Subsets

Chapter 1: Getting Started The Probabilistic Method

slide-52
SLIDE 52

Sum Theorems

Definition 1.3.1 A set 𝐡 is sum-free if there are no 𝑦, 𝑧, 𝑨 ∈ 𝐡 with 𝑦 + 𝑧 = 𝑨. Theorem 1.3.3 (Schur, 1912) β„• cannot be partitioned into finitely many sum-free sets. Theorem 1.3.2 (Fermat, 1637; Wiles, 1995) For all π‘œ β‰₯ 3, the set π‘¦π‘œ: 𝑦 ∈ β„• is sum-free.

slide-53
SLIDE 53

Sum-free Subsets of [π‘œ]

Answer

  • If 𝐡 is sum-free, then A ≀

π‘œ 2

  • Odd integers: 𝑃 = 𝑦 ∈ π‘œ : 𝑦 ≑ 1 mod 2
  • Large integers: 𝑀 = 𝑦 ∈ π‘œ : 𝑦 >

π‘œ 2

  • These are the only two maximum sum-free subsets of [π‘œ]

Question How large can a sum-free subset of [π‘œ] be?

slide-54
SLIDE 54

Sum-free Subsets of Sets

Extremal function

  • Given a set S βŠ† β„•, let 𝑔 𝑇 = max

𝐡 : 𝐡 βŠ† 𝑇, 𝐡 sumβˆ’free

  • Let 𝑔 π‘œ = min 𝑔 𝑇 : 𝑇 βŠ‚ β„•, 𝑇 = π‘œ
  • Question: how quickly does 𝑔(π‘œ) grow?

Question (ErdΕ‘s, 1965) Does every set of π‘œ natural numbers have a large sum-free subset? Theorem 1.3.4 (Deshouillers, Freiman, SΓ³s) If 𝐡 βŠ† [π‘œ] is sum-free, then either 𝐡 βŠ† 𝑃, 𝐡 βŠ† 𝑀, or 𝐡 <

2 5 π‘œ + 1.

slide-55
SLIDE 55

Upper Bounds

Beating trivial

  • Recall: biggest sum-free subsets have odd or large integers
  • Let π‘ˆ βŠ† π‘œ be a set of

π‘œ 10 large odd integers, take 𝑇 = π‘œ βˆ– π‘ˆ

  • If 𝐡 βŠ† 𝑇 is sum-free, then either 𝐡 βŠ† 𝑃 βˆ– π‘ˆ, 𝐡 βŠ† 𝑀 βˆ– π‘ˆ or 𝐡 <

2 5 π‘œ + 1

  • Thus 𝑔

9 10 π‘œ ≀ 𝑔 𝑇 < 2 5 π‘œ + 1

  • β‡’ 𝑔 π‘œ ≀

4 9 π‘œ + 10 9

A trivial bound

  • 𝑔 π‘œ ≀ 𝑔

π‘œ =

π‘œ 2

  • Any good set should have lots of (well-distributed) sums
  • [π‘œ] has lots of sums – could this be best possible?
slide-56
SLIDE 56

Lower Bounds

Goal

  • Given a set 𝑇 of π‘œ natural numbers, find a large sum-free 𝐡 βŠ† 𝑇

Greedy approach

  • Start with 𝐡 = βˆ…, and add elements one-by-one, keeping 𝐡 sum-free
  • If 𝐡 = 𝑏, 𝐡 defines at most 𝑏+1

2

sums

  • If 𝑏+1

2

< π‘œ βˆ’ 𝑏, there is an element of 𝑇 βˆ– 𝐡 that can be added to 𝐡

  • β‡’ 𝑔 π‘œ >

2π‘œ βˆ’ 2

Theorem 1.3.5 (ErdΕ‘s, 1965) For all π‘œ ∈ β„•, 𝑔 π‘œ β‰₯

1 3 (π‘œ + 1).

slide-57
SLIDE 57

A Cyclic Digression

The problem with π‘œ

  • π‘œ does have large sum-free sets, 𝑃 and 𝑀
  • But 𝑇 might be far away from these

The cyclic group has more symmetry

  • Largest sum-free set in β„€π‘ž, π‘ž prime?
  • 𝑁 = 𝑦:

1 3 π‘ž < 𝑦 < 2 3 π‘ž is sum-free

  • Cauchy-Davenport: if 𝐡 βŠ† β„€π‘ž, then 𝐡 + 𝐡 β‰₯ min 2 𝐡 βˆ’ 1, π‘ž
  • Since 𝐡 ∩ 𝐡 + 𝐡 = βˆ…, 𝐡 ≀

π‘ž 3

  • β„€π‘ž has many large sum-free sets
  • For any 𝛽 ∈ β„€π‘ž βˆ– 0 , 𝛽𝑁 = {𝛽𝑦: 𝑦 ∈ 𝑁} is also sum-free
slide-58
SLIDE 58

Finding Large Sum-free Subsets

Proof idea

  • Given a set 𝑇 βŠ‚ β„• of size π‘œ, embed 𝑇 in β„€π‘ž for some suitable π‘ž
  • β„€π‘ž has many large sum-free subsets
  • Find one that intersects 𝑇 significantly

Randomness to the rescue

  • A random sum-free subset works!

Theorem 1.3.5 (ErdΕ‘s, 1965) For all π‘œ ∈ β„•, 𝑔 π‘œ β‰₯

1 3 (π‘œ + 1).

slide-59
SLIDE 59

Setting Up

Choosing a prime

  • Let π‘ž = 3𝑙 + 2 be prime with π‘ž > max 𝑇
  • Then 𝑁 = {𝑙 + 1, 𝑙 + 2, … , 2𝑙 + 1} is sum-free with size 𝑙 + 1
  • Embed 𝑇 βŠ† β„€π‘ž

Choosing a sum-free subset

  • Let 𝛽 ∈ β„€π‘ž βˆ– 0 be chosen uniformly at random
  • Let 𝑇𝛽 = 𝑇 ∩ 𝛽𝑁
  • 𝑇𝛽 βŠ† 𝑇 is sum-free:
  • If 𝑦 + 𝑧 = 𝑨 in 𝑇𝛽, then 𝑦 + 𝑧 ≑ 𝑨 mod π‘ž , so this would be a sum in 𝛽𝑁
slide-60
SLIDE 60

No Devil in the Details

Using linearity

  • 𝑇𝛽 = Οƒπ‘‘βˆˆπ‘‡ 1 π‘‘βˆˆπ›½π‘
  • β‡’ 𝔽 𝑇𝛽

= 𝔽 Οƒπ‘‘βˆˆπ‘‡ 1 π‘‘βˆˆπ›½π‘ = Οƒπ‘‘βˆˆπ‘‡ 𝔽 1 π‘‘βˆˆπ›½π‘ = Οƒπ‘‘βˆˆπ‘‡ β„™ 𝑑 ∈ 𝛽𝑁

Computing probabilities

  • 𝑑 ∈ 𝛽𝑁 ⇔ π›½βˆ’1𝑑 ∈ 𝑁
  • 𝛽 uniform over β„€π‘ž βˆ– 0 β‡’ π›½βˆ’1 uniform β‡’ π›½βˆ’1𝑑 uniform
  • β‡’ β„™ π›½βˆ’1𝑑 ∈ 𝑁 =

𝑁 π‘žβˆ’1 = 𝑙+1 3𝑙+1 > 1 3

Finishing the proof

  • β‡’ 𝔽 𝑇𝛽

>

1 3 𝑇 β‡’ for some 𝛽, 𝑇𝛽 β‰₯ 1 3 (π‘œ + 1)

  • This gives a sum-free subset of 𝑇 of the desired size

∎

slide-61
SLIDE 61

Finishing the Story

Improving the lower bound

  • Using Fourier analysis, Bourgain (1997) proved 𝑔 π‘œ β‰₯

1 3 (π‘œ + 2) for π‘œ β‰₯ 3

  • Best-known bound to date

Upper bounds

  • Blow-ups of small constructions: several improvements over the years
  • Until Eberhard, Green and Manners (2014) proved 𝑔 π‘œ ≀

1 3 + 𝑝 1

π‘œ

  • Construction randomised, but intricate
slide-62
SLIDE 62

Any questions?

slide-63
SLIDE 63

Β§4 SchΓΌtte Tournaments

Chapter 1: Getting Started The Probabilistic Method

slide-64
SLIDE 64

War by Proxy

Rival superpowers

  • Two powerful nations go to war
  • Hire private military companies to do the actual fighting

Objectives

  • Have enough power
  • Need to ensure that hired companies can defeat any of the companies the enemy hires
  • Be economical
  • Hire as few companies as possible

Problem

  • How many companies must be hired?
slide-65
SLIDE 65

A Graph Theoretic Representation

Tournaments

  • Build a directed graph
  • Vertices: private military companies
  • Arcs: edge 𝑦 β†’ 𝑧 if 𝑦 would defeat 𝑧 in battle
  • For every pair {𝑦, 𝑧}, exactly one of the arcs 𝑦 β†’ 𝑧 or 𝑧 β†’ 𝑦 is in the graph
  • Such a graph is called a tournament

Objectives

  • Dominating set
  • A subset of vertices 𝑇 such that, for every 𝑦 ∈ π‘Š βˆ– 𝑇, there is some 𝑑 ∈ 𝑇 with 𝑑 β†’ 𝑦
  • Then we can always defeat the enemy’s army, regardless of their choice
  • Economical
  • Want to choose a small dominating set
slide-66
SLIDE 66

Small Examples

Easy case

1 2 3

slide-67
SLIDE 67

Small Examples

Easy case

  • One vertex beats all others

1 2 3

slide-68
SLIDE 68

Small Examples

Easy case

  • One vertex beats all others
  • Defeats any choice the enemy makes

1 2 3

slide-69
SLIDE 69

Small Examples

Easy case

  • One vertex beats all others
  • Defeats any choice the enemy makes

1 2 3

slide-70
SLIDE 70

Small Examples

Easy case

  • One vertex beats all others
  • Defeats any choice the enemy makes
  • Dominating set of size one

1 2 3

slide-71
SLIDE 71

Small Examples

Easy case

  • One vertex beats all others
  • Defeats any choice the enemy makes
  • Dominating set of size one

1 2 3

Harder case

  • No such vertex

1 2 3

slide-72
SLIDE 72

Small Examples

Easy case

  • One vertex beats all others
  • Defeats any choice the enemy makes
  • Dominating set of size one

1 2 3

Harder case

  • No such vertex
  • Any vertex we choose loses to another

1 2 3

slide-73
SLIDE 73

Small Examples

Easy case

  • One vertex beats all others
  • Defeats any choice the enemy makes
  • Dominating set of size one

1 2 3

Harder case

  • No such vertex
  • Any vertex we choose loses to another

1 2 3

slide-74
SLIDE 74

Small Examples

Easy case

  • One vertex beats all others
  • Defeats any choice the enemy makes
  • Dominating set of size one

1 2 3

Harder case

  • No such vertex
  • Any vertex we choose loses to another

1 2 3

slide-75
SLIDE 75

Small Examples

Easy case

  • One vertex beats all others
  • Defeats any choice the enemy makes
  • Dominating set of size one

1 2 3

Harder case

  • No such vertex
  • Any vertex we choose loses to another

1 2 3

slide-76
SLIDE 76

Small Examples

Easy case

  • One vertex beats all others
  • Defeats any choice the enemy makes
  • Dominating set of size one

1 2 3

Harder case

  • No such vertex
  • Any vertex we choose loses to another
  • Dominating set of size two exists

1 2 3

slide-77
SLIDE 77

An Extremal Reformulation

Worst-case scenario

  • How large can the smallest dominating set in an π‘œ-vertex tournament be?
  • Inverse formulation
  • Say π‘ˆ has the SchΓΌtte property 𝑇𝑙 if it has no dominating set of size at most 𝑙
  • Let 𝜏(𝑙) be the minimum number of vertices in a tournament with the property 𝑇𝑙
  • β‡’ if π‘œ < 𝜏(𝑙), then π‘ˆ has a dominating set of size ≀ 𝑙

Proving bounds on 𝜏(𝑙)

  • Lower bound: 𝜏 𝑙 > π‘œ
  • Prove that any tournament on π‘œ vertices has a dominating set of size ≀ 𝑙
  • Upper bound: 𝜏 𝑙 ≀ π‘œ
  • Prove there is a tournament on π‘œ vertices without a dominating set of size ≀ 𝑙
slide-78
SLIDE 78

The Greedy Lower Bound

A recursive algorithm

  • Given an optimal tournament π‘ˆ, let 𝑀 ∈ π‘Š(π‘ˆ)
  • Let 𝐡 be the vertices dominating 𝑀, and 𝐢 the vertices 𝑀 dominates
  • Thus π‘Š π‘ˆ = 𝐡 βˆͺ 𝐢 βˆͺ 𝑀 , with 𝐡 β†’ 𝑀 β†’ 𝐢
  • Let 𝑇′ be a dominating set in π‘ˆ[𝐡], and set 𝑇 = 𝑇′ βˆͺ {𝑀}
  • If 𝑦 ∈ π‘Š π‘ˆ βˆ– 𝑇:
  • If 𝑦 ∈ 𝐡, then 𝑦 is dominated by 𝑇′, so there is an 𝑑 ∈ 𝑇′ βŠ† 𝑇 with 𝑑 β†’ 𝑦
  • If 𝑦 βˆ‰ 𝐡, then 𝑦 ∈ 𝐢, so 𝑀 β†’ 𝑦
  • Thus 𝑇 is a dominating set for π‘ˆ

Proposition 1.4.1 For all 𝑙 ∈ β„•, 𝜏 𝑙 β‰₯ 2𝑙+1 βˆ’ 1.

slide-79
SLIDE 79

Choosing the Right Vertex

Large out-degree

  • If 𝐡 is small, then it has a small dominating set
  • Thus we should choose 𝑀 to make 𝐡 as small as possible
  • β‡’ choose a vertex of maximum out-degree
  • Average out-degree is

1 π‘œ π‘œ 2 = 1 2 (π‘œ βˆ’ 1)

  • β‡’ by choosing 𝑀 of maximum out-degree, we ensure 𝐡 ≀

1 2 (π‘œ βˆ’ 1)

Induction

  • Since π‘ˆ has the property 𝑇𝑙, π‘ˆ[𝐡] must have the property π‘‡π‘™βˆ’1
  • β‡’

1 2 π‘œ βˆ’ 1 β‰₯ 𝐡 β‰₯ 𝜏 𝑙 βˆ’ 1 β‰₯ 2𝑙 βˆ’ 1 (induction hypothesis)

  • Solving gives 𝜏 𝑙 = π‘œ β‰₯ 2𝑙+1 βˆ’ 1

∎

slide-80
SLIDE 80

An Indomitable Tournament

Goal

  • Need to construct a tournament with no dominating set of size 𝑙
  • Greedy argument: tournament should be close to regular
  • Idea: try a random tournament π‘ˆ

Random tournament

  • Vertex set: π‘Š = π‘œ
  • For every pair 𝑦, 𝑧 ∈ [π‘œ], choose 𝑦 β†’ 𝑧 or 𝑧 β†’ 𝑦 uniformly at random

Theorem 1.4.2 (ErdΕ‘s, 1963) If π‘œ

𝑙

1 βˆ’ 2βˆ’π‘™ π‘œβˆ’π‘™ < 1, then there is an π‘œ-vertex tournament with the property 𝑇𝑙.

slide-81
SLIDE 81

Disproving Domination

Bad events

  • Given a set 𝑇 ∈

π‘œ 𝑙 , let 𝐹𝑇 be the event that 𝑇 is a dominating set

  • Then β„™ π‘ˆ has property 𝑇𝑙 = 1 βˆ’ β„™ βˆͺ𝑇 𝐹𝑇 β‰₯ 1 βˆ’ σ𝑇 β„™ 𝐹𝑇
  • Suffices to show σ𝑇 β„™ 𝐹𝑇 < 1

Computing probabilities

  • Fix 𝑇 ∈

π‘œ 𝑙

  • For 𝑇 to dominate a fixed vertex 𝑀, cannot have all edges 𝑀 β†’ 𝑇
  • 𝑙 edges, chosen independently β‡’ probability is 1 βˆ’ 2βˆ’π‘™
  • This must be true for all vertices in π‘Š βˆ– 𝑇
  • Edges again independent β‡’ β„™ 𝐹𝑇 = 1 βˆ’ 2βˆ’π‘™ π‘œβˆ’π‘™
  • β‡’ σ𝑇 β„™ 𝐹𝑇 =

π‘œ 𝑙

1 βˆ’ 2βˆ’π‘™ π‘œβˆ’π‘™ < 1. ∎

slide-82
SLIDE 82

Corollary 1.4.3 As 𝑙 β†’ ∞, 𝜏 𝑙 ≀ 𝑙22𝑙 ln 2 + 𝑝 1 .

Computing the Bound

Find the smallest π‘œ for which π‘œ

𝑙

1 βˆ’ 2βˆ’π‘™ π‘œβˆ’π‘™ < 1

  • Estimates:
  • π‘œ

𝑙 ≀ π‘œπ‘™ and 1 βˆ’ 2βˆ’π‘™ < π‘“βˆ’2βˆ’π‘™

  • β‡’ suffices to have π‘œπ‘™π‘“βˆ’2βˆ’π‘™ π‘œβˆ’π‘™ < 1
  • ⇔ 𝑙 ln π‘œ < π‘œ βˆ’ 𝑙 2βˆ’π‘™ βˆ—
  • βˆ— β‡’ π‘œ > 2𝑙
  • β‡’ ln π‘œ > 𝑙 ln 2
  • βˆ— β‡’ π‘œ > 𝑙22𝑙 ln 2
  • β‡’ ln π‘œ > 𝑙 ln 2 + 𝑝 1

, so this suffices

slide-83
SLIDE 83

Any questions?

slide-84
SLIDE 84

Β§5 Ramsey Numbers

Chapter 1: Getting Started The Probabilistic Method

slide-85
SLIDE 85

Theorem 1.5.2 (ErdΕ‘s, 1947) As 𝑙 β†’ ∞, we have 𝑆 𝑙 β‰₯ 1 𝑓 2 + 𝑝 1 𝑙 2

𝑙.

Reviewing the Classics

Definition 1.5.1 (Ramsey number) Given 𝑙 ∈ β„•, 𝑆 𝑙 is the minimum π‘œ for which any π‘œ-vertex graph has either a clique or independent set on 𝑙 vertices. Proof idea

  • Show that a uniformly random graph on this many vertices works
slide-86
SLIDE 86

Ramsey Upper Bounds

Proof by induction

  • Introduce the asymmetric Ramsey numbers

Theorem 1.5.3 (ErdΕ‘s-Szekeres, 1935) For all 𝑙 ∈ β„•, we have 𝑆 𝑙 ≀

2π‘™βˆ’2 π‘™βˆ’1 . In particular, as 𝑙 β†’ ∞,

𝑆 𝑙 ≀

1+𝑝 1 4 πœŒπ‘™ 4𝑙.

Definition 1.5.4 (Asymmetric Ramsey numbers) Given β„“, 𝑙 ∈ β„•, 𝑆(β„“, 𝑙) is the minimum π‘œ for which any π‘œ-vertex graph contains either a clique on β„“ vertices or an independent set on 𝑙 vertices.

slide-87
SLIDE 87

Theorem 1.5.5 (ErdΕ‘s-Szekeres, 1935) For all β„“, 𝑙 ∈ β„•, 𝑆 β„“, 𝑙 ≀ β„“ + 𝑙 βˆ’ 2 β„“ βˆ’ 1 = 𝑃 π‘™β„“βˆ’1 .

Asymmetric Ramsey Bounds

Problem

  • For fixed β„“ ∈ β„•, how does 𝑆 β„“, 𝑙 grow as 𝑙 β†’ ∞?
  • Asymmetric Ramsey numbers grow at most polynomially
  • Can we find matching lower bounds?
slide-88
SLIDE 88

Theorem 1.5.6 (TurΓ‘n, 1941) An π‘œ-vertex 𝐿ℓ-free graph can have at most 1 βˆ’

1 β„“βˆ’1 π‘œ 2 edges.

TurΓ‘n’s Lower Bound

Goal

  • Find a 𝐿ℓ-free graph with no large independent sets

Intuition

  • More edges β‡’ fewer independent sets
  • How dense can a 𝐿ℓ-free graph be?

Construction

  • Complete (β„“ βˆ’ 1)-partite graph π‘ˆπ‘œ,β„“βˆ’1
  • 𝛽 π‘ˆπ‘œ,β„“βˆ’1 =

π‘œ β„“βˆ’1 β‡’ 𝑆 β„“, 𝑙 > β„“ βˆ’ 1

𝑙 βˆ’ 1

slide-89
SLIDE 89

What About Randomness?

𝑆 𝑙 lower bound

  • Symmetric situation – can switch edges and non-edges
  • Used a uniformly random graph
  • Equivalently: each edge appears independently with probability

1 2

𝑆(β„“, 𝑙) for fixed β„“ ∈ β„•, 𝑙 β†’ ∞

  • Situation far from symmetric
  • β€œeasier” to make clique on β„“ vertices than an independent set on 𝑙 vertices
  • Should focus on graphs with fewer edges

ErdΕ‘s-RΓ©nyi model

  • 𝐻 π‘œ, π‘ž : π‘œ vertices, each edge appears independently with probability π‘ž
  • Allows us to β€œsee” sparser graphs
slide-90
SLIDE 90

A Random Lower Bound

Proof idea

  • Sample the random graph 𝐻 ∼ 𝐻(π‘œ, π‘ž)
  • What could go wrong?
  • Could find a clique on β„“ vertices
  • Could find an independent set on 𝑙 vertices

Theorem 1.5.7 Given β„“, 𝑙, π‘œ ∈ β„• and π‘ž ∈ 0,1 , if π‘œ β„“ π‘ž

β„“ 2 + π‘œ

𝑙 1 βˆ’ π‘ž

𝑙 2 < 1,

then 𝑆 β„“, 𝑙 > π‘œ.

slide-91
SLIDE 91

Analysing Bad Events

Bad cliques

  • Given a set 𝑇 ∈

[π‘œ] β„“ , let 𝐹𝑇 be the event that 𝐻[𝑇] is a clique

  • β„“

2 pairs, each an edge independently with probability π‘ž

  • β‡’ β„™ 𝐹𝑇 = π‘ž

β„“ 2

Bad independent sets

  • Given a set π‘ˆ ∈

π‘œ 𝑙 , let πΊπ‘ˆ be the event that 𝐻[π‘ˆ] is an independent set

  • 𝑙

2 pairs, each a non-edge independently with probability 1 βˆ’ π‘ž

  • β‡’ β„™ πΊπ‘ˆ = 1 βˆ’ π‘ž

𝑙 2

slide-92
SLIDE 92

Completing the Proof

Recall

  • 𝐹𝑇 = 𝐻 𝑇 is an β„“βˆ’clique , β„™ 𝐹𝑇 = π‘ž

β„“ 2

  • πΊπ‘ˆ = 𝐻 π‘ˆ is an independent π‘™βˆ’set , β„™ πΊπ‘ˆ = 1 βˆ’ π‘ž

𝑙 2

Union bound does the job

  • 𝐻 not Ramsey = βˆͺ𝑇 𝐹𝑇 βˆͺ βˆͺπ‘ˆ πΊπ‘ˆ
  • ∴ β„™ 𝐻 not Ramsey = β„™ (βˆͺ𝑇 𝐹𝑇) βˆͺ βˆͺπ‘ˆ πΊπ‘ˆ

≀ σ𝑇 β„™ 𝐹𝑇 + Οƒπ‘ˆ β„™(πΊπ‘ˆ)

  • σ𝑇 β„™(𝐹𝑇) + Οƒπ‘ˆ β„™(πΊπ‘ˆ) =

π‘œ β„“ π‘ž

β„“ 2 +

π‘œ 𝑙

1 βˆ’ π‘ž

𝑙 2 < 1

β‡’ β„™ 𝐻 Ramsey = 1 βˆ’ β„™(𝐻 not Ramsey) > 0 ∎

slide-93
SLIDE 93

An Actual Bound

What does this tell us about 𝑆(β„“, 𝑙)? Goal

  • Maximise π‘œ
  • Subject to π‘œ

β„“ π‘ž

β„“ 2 +

π‘œ 𝑙

1 βˆ’ π‘ž

𝑙 2 < 1 for some π‘ž ∈ [0,1]

Theorem 1.5.7 Given β„“, 𝑙, π‘œ ∈ β„• and π‘ž ∈ 0,1 , if π‘œ β„“ π‘ž

β„“ 2 + π‘œ

𝑙 1 βˆ’ π‘ž

𝑙 2 < 1,

then 𝑆 β„“, 𝑙 > π‘œ.

slide-94
SLIDE 94

Computing a Lower Bound

Goal

  • Maximise π‘œ
  • Subject to π‘œ

β„“ π‘ž

β„“ 2 +

π‘œ 𝑙

1 βˆ’ π‘ž

𝑙 2 < 1 for some π‘ž ∈ [0,1]

Varying π‘ž

  • As π‘ž increases, π‘œ

β„“ π‘ž

β„“ 2 increases and π‘œ

𝑙

1 βˆ’ π‘ž

𝑙 2 decreases

  • β‡’ at optimum, expect both quantities to be comparable

Simplification

  • Instead solve π‘œ

β„“ π‘ž

β„“ 2 <

1 2 and π‘œ 𝑙

1 βˆ’ π‘ž

𝑙 2 <

1 2

slide-95
SLIDE 95

Computing Some More

π‘œ β„“ π‘ž

β„“ 2 <

1 2:

  • Bound π‘œ

β„“ ≀ π‘œβ„“, so π‘œ β„“ π‘ž

β„“ 2 ≀ π‘œπ‘ž β„“βˆ’1 2

β„“

  • Sufficient to have π‘ž ≀ 1 βˆ’ 𝑝 1

π‘œβˆ’2/(β„“βˆ’1)

π‘œ 𝑙

1 βˆ’ π‘ž

𝑙 2 <

1 2:

  • Bound π‘œ

𝑙 ≀ π‘œπ‘™ and 1 βˆ’ π‘ž ≀ π‘“βˆ’π‘ž, so π‘œ 𝑙

1 βˆ’ π‘ž

𝑙 2 ≀ π‘œπ‘“βˆ’π‘ž π‘™βˆ’1 /2 𝑙

  • Suffices to have π‘œπ‘“βˆ’π‘ž π‘™βˆ’1 /2 < 1 β‡’ π‘ž 𝑙 βˆ’ 1 > 2 ln π‘œ
  • Substitute π‘ž β‰ˆ π‘œβˆ’2/(β„“βˆ’1)
  • β‡’ 𝑙 > 2π‘œ2/(β„“βˆ’1) ln π‘œ
slide-96
SLIDE 96

Corollary 1.5.8 For fixed β„“ ∈ β„• and 𝑙 β†’ ∞, we have Ξ© 𝑙 2 ln 𝑙

β„“βˆ’1 2

= 𝑆 β„“, 𝑙 = 𝑃 π‘™β„“βˆ’1 .

Concluding the Computations

Recall

  • 𝑙 β‰ˆ 2π‘œ2/(β„“βˆ’1) ln π‘œ

Solve for π‘œ

  • π‘œ β‰ˆ

𝑙 2 ln π‘œ

β„“βˆ’1 2 β‰ˆ

𝑙 2 ln 𝑙

β„“βˆ’1 2

slide-97
SLIDE 97

Any questions?