Fast sampling and counting -SAT solutions in the local lemma regime - - PowerPoint PPT Presentation

β–Ά
fast sampling and counting sat solutions in the local
SMART_READER_LITE
LIVE PREVIEW

Fast sampling and counting -SAT solutions in the local lemma regime - - PowerPoint PPT Presentation

Fast sampling and counting -SAT solutions in the local lemma regime Weiming Feng Nanjing University Joint work with: Heng Guo (University of Edinburgh) Yitong Yin (Nanjing University) Chihao Zhang (Shanghai Jiao Tong University)


slide-1
SLIDE 1

Fast sampling and counting 𝑙-SAT solutions in the local lemma regime

Weiming Feng Nanjing University

Joint work with: Heng Guo (University of Edinburgh) Yitong Yin (Nanjing University) Chihao Zhang (Shanghai Jiao Tong University)

Online Seminar Institute of Computing Technology, Chinese Academy of Sciences

slide-2
SLIDE 2

Conjunctive normal form (CNF)

  • Instance: a formula Ξ¦ = (π‘Š, 𝐷), for example

Ξ¦ = 𝑦! ∨ ¬𝑦" ∨ 𝑦# ∧ 𝑦! ∨ 𝑦" ∨ 𝑦$ ∧ 𝑦# ∨ ¬𝑦$ ∨ ¬𝑦%

π‘Š = {𝑦!, 𝑦", 𝑦#, 𝑦$, 𝑦%}: set of Boolean variables; 𝐷: set of clauses.

  • SAT solutions: an assignment of variables in π‘Š s.t. 𝚾 = true.
  • Fundamental computational tasks for CNF formula:
  • Decision: Does SAT solution exist?

NP-Complete problem [Cook 1971, Levin 1973].

  • Counting: How many SAT solutions?

#P-Complete problem [Valiant 1979].

clause

slide-3
SLIDE 3

𝑙, 𝑒 -CNF formula Ξ¦ = (π‘Š, 𝐷)

  • Each clause contains 𝑙 Boolean variables.
  • Each variable belongs to at most 𝑒 clauses, e.g. max degree ≀ 𝒆.

Example: (3,2)-CNF formula 𝑦! ∨ ¬𝑦" ∨ 𝑦# ∧ 𝑦! ∨ 𝑦" ∨ 𝑦$ ∧ 𝑦# ∨ ¬𝑦$ ∨ ¬𝑦%

Suppose a (𝑙, 𝑒)-CNF formula satisfies 𝒍 ≳ 𝐦𝐩𝐑 𝒆 (𝑙 β‰₯ log 𝑒 + log 𝑙 + 𝐷).

  • Existence [ErdΕ‘s, LovΓ‘sz, 1975]

If each variable takes a value in {true,false} uniformly and independently Pr all clauses are satis?ied β‰₯ 1 βˆ’ 1 2𝑒𝑙

!"

> 0, which implies the 𝑙-SAT solution must exist;

  • Construction [Moser, Tardos, 2010]

a 𝑙-SAT solution can be constructed in expected time 𝑃 π‘œπ‘’π‘™ .

LovΓ‘sz Local Lemma (LLL)

slide-4
SLIDE 4

Sampling & counting 𝑙-SAT solutions

  • Input: a 𝑙, 𝑒 -CNF formula Ξ¦ = (π‘Š, 𝐷) with π‘Š = π‘œ, and error bound πœ— > 0.
  • Almost uniform sampling: generate a random SAT solution π‘Œ ∈ true, false %

s.t. the total variation distance is at most πœ—,

𝑒&% π‘Œ, 𝜈 = 1 2 )

'∈ true,false

#

Pr π‘Œ = 𝜏 βˆ’ 𝜈(𝜏) ≀ πœ—

𝜈: the uniform distribution of all 𝑙-SAT solutions.

slide-5
SLIDE 5

Sampling & counting 𝑙-SAT solutions

  • Input: a 𝑙, 𝑒 -CNF formula Ξ¦ = (π‘Š, 𝐷) with π‘Š = π‘œ, and error bound πœ— > 0.
  • Almost uniform sampling: generate a 𝑙-SAT solution π‘Œ ∈ true, false % s.t.

the total variation distance 𝑒&% π‘Œ, 𝜈 ≀ πœ—,

𝜈: the uniform distribution of all 𝑙-SAT solutions.

  • Approximate counting: estimate the number of 𝑙-SAT solutions, e.g. output

1 βˆ’ πœ— π‘Ž ≀ 9 𝒂 ≀ 1 + πœ— π‘Ž,

π‘Ž = the number of 𝑙-SAT solutions.

Almost Uniform Sampling Approximate Counting

Self-reduction [Jerrum, Valiant, Vazirani 1986] Simulated annealing [Štefankovič et al. 2009]

slide-6
SLIDE 6

Work Regime Running time/lower bound Technique Hermon et al.’19 Monotone CNF[1] 𝑙 ≳ 2 log 𝑒

poly 𝑒𝑙 π‘œ log π‘œ

Markov chain Monte Carlo (MCMC) Guo et al.’17 𝑑 β‰₯ min log 𝑒𝑙 , 𝑙/2

[2]

𝑙 ≳ 2 log 𝑒

poly 𝑒𝑙 π‘œ

Partial rejection sampling Moitra’17 𝑙 ≳ 60 log 𝑒

π‘œ$%&'(!))

Linear programming BezΓ‘kovΓ‘ et al.’15 𝑙 ≀ 2 log 𝑒 βˆ’ 𝐷

NP-hard

  • [1] Monotone CNF: all variables appear positively, e.g. 𝛸 = 𝑦! ∨ 𝑦" ∨ 𝑦# ∧ 𝑦" ∨ 𝑦$ ∨ 𝑦% ∧ 𝑦# ∨ 𝑦$ ∨ 𝑦& .

[2] s: two dependent clauses share at least 𝑑 variables.

Open Problem: Can we sample general

𝑙, 𝑒 -CNF solutions such that

  • the threshold down to 𝑙 ≳ 2 log 𝑒;
  • the running time poly(𝑒𝑙) 1

𝑃(π‘œ).

Table: previous results for sampling SAT solutions of 𝑙, 𝑒 -CNF formulas

slide-7
SLIDE 7

Work Regime Running time/lower bound Technique Hermon et al.’19 Monotone CNF 𝑙 ≳ 2 log 𝑒

poly 𝑒𝑙 π‘œ log π‘œ

MCMC Guo et al.’17 𝑑 β‰₯ min log 𝑒𝑙 , 𝑙/2 𝑙 ≳ 2 log 𝑒

poly 𝑒𝑙 π‘œ

Partial rejection sampling Moitra’17 𝑙 ≳ 60 log 𝑒

π‘œ$%&'(!))

Linear programming BezΓ‘kovΓ‘ et al.’15 𝑙 ≀ 2 log 𝑒 βˆ’ 𝐷

NP-hard

  • This work

𝒍 ≳ πŸ‘πŸ 𝐦𝐩𝐑 𝒆 ; 𝑷(π’†πŸ‘π’πŸ’π’πŸ.𝟏𝟏𝟏𝟏𝟏𝟐) MCMC

Table: results for sampling SAT solutions of 𝑙, 𝑒 -CNF formulas

Our result

slide-8
SLIDE 8

Main theorem (this work)

For any sufficiently small πœ‚ < 2EFG, any (𝑙, 𝑒)-CNF formula satisfying 𝑙 β‰₯ 20 log 𝑒 + 20 log 𝑙 + 3 log 1 πœ‚ ,

  • sampling algorithm (main algorithm)

draw almost uniform random 𝑙-SAT solution in time D 𝑃 𝑒F𝑙Hπ‘œIJK ;

  • counting algorithm (by simulated annealing reduction)

count #𝑙-SAT solutions approximately in time D 𝑃 𝑒H𝑙Hπ‘œFJK ;

slide-9
SLIDE 9

Classic Glauber dynamics (Gibbs sampling)

𝑦! 𝑦" 𝑦# 𝑦' 𝑦$ 𝑦% 𝑦& (𝑦!∨ ¬𝑦" ∨ 𝑦#) ∧ (𝑦" ∨ 𝑦' ∨ 𝑦%) ∧ (𝑦$ ∨ ¬𝑦% ∨ 𝑦&) true false

Start from an arbitrary solution 𝑍 ∈ π‘ˆ, 𝐺 %; For each 𝑒 from 1 to π‘ˆ do

  • Pick 𝑀 ∈ π‘Š uniformly at random;
  • Resample 𝑍

N ∼ (β‹…βˆ£ 𝑍 %\N);

slide-10
SLIDE 10

𝑦! 𝑦" 𝑦# 𝑦' 𝑦$ 𝑦% 𝑦& (𝑦!∨ ¬𝑦" ∨ 𝑦#) ∧ (𝑦" ∨ 𝑦' ∨ 𝑦%) ∧ (𝑦$ ∨ ¬𝑦% ∨ 𝑦&) true false

Classic Glauber dynamics (Gibbs sampling)

Start from an arbitrary solution 𝑍 ∈ π‘ˆ, 𝐺 %; For each 𝑒 from 1 to π‘ˆ do

  • Pick 𝑀 ∈ π‘Š uniformly at random;
  • Resample 𝑍

N ∼ (β‹…βˆ£ 𝑍 %\N);

slide-11
SLIDE 11

𝑦! 𝑦" 𝑦# 𝑦' 𝑦$ 𝑦% 𝑦& (𝑦!∨ ¬𝑦" ∨ 𝑦#) ∧ (𝑦" ∨ 𝑦' ∨ 𝑦%) ∧ (𝑦$ ∨ ¬𝑦% ∨ 𝑦&) true false

Classic Glauber dynamics (Gibbs sampling)

Start from an arbitrary solution 𝑍 ∈ π‘ˆ, 𝐺 %; For each 𝑒 from 1 to π‘ˆ do

  • Pick 𝑀 ∈ π‘Š uniformly at random;
  • Resample 𝑍

N ∼ 𝜈N(β‹…βˆ£ 𝑍 %\N); T/F?

slide-12
SLIDE 12

𝑦! 𝑦" 𝑦# 𝑦' 𝑦$ 𝑦% 𝑦& (𝑦!∨ ¬𝑦" ∨ 𝑦#) ∧ (𝑦" ∨ 𝑦' ∨ 𝑦%) ∧ (𝑦$ ∨ ¬𝑦% ∨ 𝑦&) true false

Classic Glauber dynamics (Gibbs sampling)

Start from an arbitrary solution 𝑍 ∈ π‘ˆ, 𝐺 %; For each 𝑒 from 1 to π‘ˆ do

  • Pick 𝑀 ∈ π‘Š uniformly at random;
  • Resample 𝑍

N ∼ 𝜈N(β‹…βˆ£ 𝑍 %\N); F!

slide-13
SLIDE 13

Connectivity barrier (toy example)

  • (𝑙, 𝑒)-CNF formula Ξ¦ = (π‘Š, 𝐷) with π‘Š = 𝑦I, 𝑦F, … 𝑦P :

Ξ¦ = 𝐷I ∧ 𝐷F ∧ β‹― ∧ 𝐷P. 𝐷I = (¬𝑦I ∨ 𝑦F ∨ 𝑦H ∨ β‹― ∨ 𝑦P) forbids 100 … 0 𝐷F = (𝑦I ∨ ¬𝑦F ∨ 𝑦H ∨ β‹― ∨ 𝑦P) forbids 010 … 0 𝐷P = (𝑦I ∨ 𝑦F ∨ 𝑦H ∨ β‹― ∨ ¬𝑦P) forbids 000 … 1

  • Any assignment π‘Œ ∈ 0,1 % with π‘Œ I = 1 is infeasible.
  • All false solution 𝟏 is disconnected with others.

00…0 00…1 01…0 10…0

Other Solutions

slide-14
SLIDE 14
  • Glauber dynamics: random walk over solution space via local update.
  • Local Markov chain: one of the most fundamental approach for sampling:

rapid mixing slow mixing not mixing For sampling CNF solutions, the MCMC approach meets the connectivity barrier.

β€œthe solution space (and hence the natural Markov chain) is not connected” Mathematics and Computation [Wigderson’19]

uniform graph coloring weighted matching/independent set Ising/spin system bases of a matroid

We are here!

slide-15
SLIDE 15

Work Regime Running time Technique Hermon et al.’19 Monotone CNF 𝑙 ≳ 2 log 𝑒

poly 𝑒𝑙 π‘œ log π‘œ

MCMC Guo, Jerrum, Liu’17 𝑑 β‰₯ min log 𝑒𝑙 , 𝑙/2 𝑙 ≳ 2 log 𝑒

poly 𝑒𝑙 π‘œ

Partial rejection sampling Moitra’17 𝑙 ≳ 60 log 𝑒

π‘œ!"#$(&')

Linear programming

monotone CNF

Technique Motivation:

Can MCMC approach bypass the connectivity barrier?

heavy intersection constant 𝒆 and 𝒍

Bypass the connectivity barrier

Non-MCMC approach

slide-16
SLIDE 16

Our technique: projection

Projecting from a high dimension to a lower dimension to improve connectivity

Source: https://www.shadowmatic.com/presskit/images/IMG_0650.png

slide-17
SLIDE 17

Construct a good subset of variables 𝑁 βŠ† π‘Š Run Glauber dynamics on projected distribution 𝜈+ to draw sample π‘Œ ∼ 𝜈+ Draw sample 𝑍 ∼ 𝜈,\+(β‹… |π‘Œ) from the conditional distribution

Start from a uniform random π‘Œ ∈ true,false ); For each 𝑒 from 1 to π‘ˆ

  • Pick a variable 𝑀 ∈ 𝑁 uniformly at random;
  • Resample π‘Œ* ∼ 𝜈*(β‹… |π‘Œ)\*);

Return π‘Œ ∈ true,false ).

T/F?

There exists an efficiently constructible subset 𝑁 βŠ† π‘Š such that:

  • the Glauber dynamics on 𝜈+ is rapidly mixing,
  • the Glauber dynamics on 𝜈+ can be implemented efficiently (draw π‘Œ. ∼ 𝜈.(β‹… |π‘Œ+\.)),
  • sampling assignment for π‘Š\𝑁 can be implemented efficiently (draw 𝑍 ∼ 𝜈,\+(β‹… |π‘Œ)).

computing exact distr. can be #P-hard

π’˜

slide-18
SLIDE 18

Construct a good subset of variables 𝑁 βŠ† π‘Š Run Glauber dynamics on projected distribution 𝜈+ to draw sample π‘Œ ∼ 𝜈+ Draw sample 𝑍 ∼ 𝜈,\+(β‹… |π‘Œ) from the conditional distribution Our Tasks:

  • Construct such a good subset 𝑁 βŠ† π‘Š.
  • Show that the Glauber dynamics on 𝜈+ is rapidly mixing.
  • Given assignment on 𝑁, draw samples efficiently from the conditional distribution.

Start from a uniform random π‘Œ ∈ true,false ); For each 𝑒 from 1 to π‘ˆ

  • Pick a variable 𝑀 ∈ 𝑁 uniformly at random;
  • Resample π‘Œ* ∼ 𝜈*(β‹… |π‘Œ)\*);

Return π‘Œ;

T/F?

π’˜

slide-19
SLIDE 19

Mark a set of variables 𝑁 βŠ† π‘Š such that

  • each clause contains at least 𝛽𝑙 β‰ˆ 0.11𝑙 marked variables;
  • each clause contains at least 𝛾𝑙 β‰ˆ 0.51𝑙 unmarked variables;

Lemma: marking (prove via LLL)

If 𝑙 β‰₯ 20 log 𝑒 + 20 log 𝑙 + 3 log ,

  • , then

Pr Moserβˆ’Tardos alg co construct cts 𝑁 in time 𝑃 π‘œπ‘’π‘™ log 1 πœ— β‰₯ 1 βˆ’ πœ— 3 .

Mark variables [Moitra’ 17]

Mark each 𝑀 ∈ π‘Š independently w.p. 𝑄 =

!/012 "

to construct a random set β„³ βŠ† π‘Š by LLL, Pr β„³ satisWies above property > 0

slide-20
SLIDE 20

The rapid mixing of Glauber dynamics on 𝜈-

Start from a uniform random π‘Œ ∈ true,false ); For each 𝑒 from 1 to π‘ˆ

  • Pick a marked variable 𝑀 ∈ 𝑁 u.a.r.;
  • Resample π‘Œ* ∼ 𝜈*(β‹… |π‘Œ)\*);

Return π‘Œ;

T/F?

Property: local uniformity (proved via LLL [Haeupler, Saha, Srinivasan’ 11]) For any assignment π‘Œ)\*, the distribution 𝜈*(β‹… |π‘Œ)\*) is cl close to uniform: βˆ€π‘‘ ∈ true,false , 𝜈*(c |π‘Œ)\*) = 1 2 Β± 1 poly(𝑒𝑙) . Each clause has β‰₯ 𝛾𝑙 unmarked variables, by LLL [Haeupler, Saha, Srinivasan’ 11]:

  • After each transition, Pr π‘Œ* = true β‰ˆ ,

. > 0 and Pr π‘Œ* = false β‰ˆ , . > 0.

  • Local uniformity

Glauber dynamics on 𝝂𝑡 is connected!

π’˜

slide-21
SLIDE 21

The rapid mixing of Glauber dynamics on 𝜈-

Start from a uniform random π‘Œ ∈ true,false ); For each 𝑒 from 1 to π‘ˆ = 2π‘œ log 01

2

  • Pick a marked variable 𝑀 ∈ 𝑁 u.a.r.;
  • Resample π‘Œ* ∼ 𝜈*(β‹… |π‘Œ)\*);

Return π‘Œ;

T/F?

Lemma: rapid mixing If T = 2π‘œ log 01

2 , then the returned random assignment π‘Œ satisfies

𝑒34 π‘Œ, 𝜈) ≀ πœ— 3 .

  • Use path coupling [Bubley, Dyer’97] to bound the mixing time.
  • Use β€œdisagreement coupling” [Moitra’17, Guo et al.’ 18] to bound the discrepancy of path coupling.
  • Use local uniformity property (LLL) to show the small discrepancy of β€œdisagreement coupling”.

π’˜

slide-22
SLIDE 22

Implementation of the algorithm

Challenge: computing the exact conditional distributions can be #P-hard.

Transition of Glauber dynamics resample π‘Œ* ∼ 𝜈*(β‹… |π‘Œ)\*)

T/F?

π’˜

Sample unmarked variable in last step sample 𝑍 ∼ 𝜈4\)(β‹… |π‘Œ)

T/F? T/F? T/F? T/F? T/F? T/F? T/F? T/F? T/F? T/F? T/F? T/F? T/F?

π‘Ž( = #{𝑍 ∈ π‘ˆ, 𝐺 )is a SAT solution ∣ 𝑍

* = π‘ˆ, 𝑍 +\* = π‘Œ+\*}

π‘Ž- = #{𝑍 ∈ π‘ˆ, 𝐺 )is a SAT solution ∣ 𝑍

* = 𝐺, 𝑍 +\* = π‘Œ+\*}

𝜈* π‘ˆ π‘Œ+\* = π‘Ž( π‘Ž( + π‘Ž- 𝜈* 𝐺 π‘Œ+\* = π‘Ž- π‘Ž( + π‘Ž-

slide-23
SLIDE 23

Key Property: w.h.p., the graph is deconstructed into small components of size 𝑃 𝑒𝑙 log 1

2

remove satisfied clauses

𝑦! ∨ 𝑦" ∨ ¬𝑦# ∨ ¬𝑦$ 𝑦! = true or 𝑦$ = false

resample π‘Œ* from 𝜈*(β‹… |π‘Œ)\*) 𝐷: connected component containing 𝑀 at any time, any mark variable takes an almost uniform value. each clause contains β‰₯ 𝛽𝑙 marked variables; Pr[each clause is removed] ≳ 1 βˆ’ 1 2

5'

π’˜ π’˜

Start from a uniform random π‘Œ ∈ true,false +;

βˆ€π‘£ ∈ 𝑁, Pr π‘Œ6 = π‘ˆ =

, . , Pr π‘Œ6 = 𝐺 = , .

For each 𝑒 from 1 to π‘ˆ

  • Pick a marked variable 𝑀 ∈ 𝑁 u.a.r.;
  • Resample π‘Œ* ∼ 𝜈*(β‹… |π‘Œ+\*); by local uniformity Pr π‘Œ* = π‘ˆ β‰ˆ

, . , Pr π‘Œ* = 𝐺 β‰ˆ , .

Return π‘Œ;

slide-24
SLIDE 24

Key Property: w.h.p., the graph is deconstructed into small components of size 𝑃 𝑒𝑙 log 1

2

Our solution: try rejection sampling on 𝑀 and other unmarked variables in component 𝐷 by 𝑴𝑴𝑴, if 𝑙 β‰₯ 20 log 𝑒 + 20 log 𝑙 + 3 log ,

  • , then

Pr all clauses in 𝐷 are satis?ied | #𝐷 = 𝑃 𝑒𝑙 log π‘œ πœ— β‰₯ πœ— π‘œ

  • ;

try rejection sampling for 𝑆 = q 𝑃 π‘œ/πœ— - times, then we can draw π‘Œ* ∼ 𝜈*(β‹… |π‘Œ)\*) w.h.p. remove satisfied clauses

𝑦! ∨ 𝑦" ∨ ¬𝑦# ∨ ¬𝑦$ 𝑦! = true, 𝑦$ = false

resample π‘Œ% from 𝜈%(β‹… |π‘Œ&\%) 𝐷: connected component containing 𝑀

π’˜ π’˜

Pr rejection sampling draw π‘Œ* ∼ 𝜈* β‹… π‘Œ)\* , namely t |𝐷| = 𝑃 𝑒𝑙 log π‘œ πœ—

  • ne of 𝑆 tires succeeds

β‰₯ 1 βˆ’ 2 πœ— π‘œ

7

slide-25
SLIDE 25

Key Property: w.h.p., the graph is deconstructed into small components of size 𝑃 𝑒𝑙 log 1

2

Our solution: try rejection sampling on 𝑀 and other unmarked variables in component 𝐷 by 𝑴𝑴𝑴, if 𝑙 β‰₯ 20 log 𝑒 + 20 log 𝑙 + 3 log ,

  • , then

Pr all clauses in 𝐷 are satis?ied | #𝐷 = 𝑃 𝑒𝑙 log π‘œ πœ— β‰₯ πœ— π‘œ

  • ;

try rejection sampling for 𝑆 = q 𝑃 π‘œ/πœ— - times, then we can draw π‘Œ* ∼ 𝜈*(β‹… |π‘Œ)\*) w.h.p. Lemma: Each transition step of the Glauber dynamics and the last step (i.e. sampling unmarked variables) can be implemented using rejection sampling Pr all π‘ˆ + 1 = 𝑃 π‘œ log π‘œ πœ— rejection samplings succeed β‰₯ 1 βˆ’ πœ— 3 . remove satisfied clauses

𝑦! ∨ 𝑦" ∨ ¬𝑦# ∨ ¬𝑦$ 𝑦! = true, 𝑦$ = false

resample π‘Œ% from 𝜈%(β‹… |π‘Œ&\%) 𝐷: connected component containing 𝑀

π’˜ π’˜

slide-26
SLIDE 26
  • 1. Run Morse-Tardos algorithm to construct a set of marked variables 𝑁 βŠ† π‘Š;
  • 2. Run Glauber dynamics on projected distribution 𝜈j for 𝑃 π‘œ log k

l steps to

draw approximate sample π‘Œ ∼ 𝜈j; (implemented using rejection sampling)

  • 3. Run rejection sampling to draw 𝑍 ∼ 𝜈%\j(β‹…βˆ£ π‘Œ);
  • 4. Return π‘Œ βˆͺ 𝑍.

Input: a 𝑙-CNF formula Ξ¦ = (π‘Š, 𝐹) with maximum degree 𝑒, an error bound πœ— > 0. Output: a random sample Οƒβˆˆ true, false ! s.t. 𝑒"! 𝜏, 𝜈 ≀ πœ—.

slide-27
SLIDE 27
  • Marking lemma: Pr MTβˆ’alg fails to ?ind 𝑁 in time 𝑃 π‘œπ‘’π‘™ log ,

2

≀ 2

7 .

  • Rapid mixing lemma: The π‘Œ returned by Glauber dynamics satisfies 𝑒34 π‘Œ, 𝜈) ≀ 2

7.

  • Rej. Sampling lemma: Pr[one of the (T+1) rejection samplings fails] ≀ 2

7

Correctness of the algorithm: 𝑒&% output, 𝜈 ≀ πœ—.

  • The running time is dominated by simulating Glauber dynamics for π‘ˆ = 𝑃 π‘œ log 1

2 steps;

  • Each step is implemented using rejection sampling for 𝑆 = q

𝑃

1 2

  • times.

Efficiency of the algorithm: running time = R 𝑃 𝑒F𝑙Hπœ—EKπ‘œIJK .

slide-28
SLIDE 28

Simulated annealing counting [Štefankovič et al. 2009]

Weighted CNF a CNF-formula Ξ¦ = (π‘Š, 𝐷) and parameter ΞΈ > 0.

  • for any π‘Œ ∈ π‘ˆ, 𝐺 ,, define the weight

π‘₯; π‘Œ = exp βˆ’πœ„πΊ(π‘Œ) , where 𝐺(π‘Œ) is the number of clauses NOT satisfied by π‘Œ.

  • induced Gibbs distribution

βˆ€π‘Œ ∈ π‘ˆ, 𝐺 ,: 𝜈; π‘Œ = π‘₯; π‘Œ π‘Ž(πœ„) , π‘Ž πœ„ = f

<∈ >,@ .

π‘₯; π‘Œ . Randomized approximate counting

  • Input: a 𝑙, 𝑒 βˆ’CNF instance Ξ¦ = (π‘Š, 𝐹), an error bound πœ— > 0.
  • Output: a random number i

π‘Ž, such that Pr 1 βˆ’ πœ— π‘Ž ≀ i π‘Ž ≀ 1 + πœ— π‘Ž β‰₯ 3 4 π‘Ž = the number of 𝑙-SAT solutions.

slide-29
SLIDE 29

Lemma: counting (proved by LLL[Haeupler, Saha, Srinivasan’ 11])

If 𝑙 β‰₯ log 𝑒 + 𝐷, it holds that π‘Ž πœ„ ∈ 1 Β± πœ— 2 π‘Ž, π‘₯β„Žπ‘“π‘ π‘“ πœ„ = 𝑃 log π‘œπ‘’ πœ— . π‘Ž πœ„ = f

<∈ >,@ .

π‘₯; π‘Œ = f

<∈ >,@ .

exp(βˆ’πœ„πΊ(π‘Œ)) Properties:

  • πœ„ = 0: π‘Ž 0 = 2A (easy to compute);
  • πœ„ β†’ ∞: lim

;β†’C π‘Ž πœ„ = π‘Ž = #𝑙-SAT solutions. (target of counting)

slide-30
SLIDE 30
  • Non-adaptive cooling schedule: define β„“ = 𝑃 π‘œπ‘’ log

!" #

parameters 0 = πœ„$ < πœ„% < β‹― < πœ„β„“ = 𝑃 log π‘œπ‘’ πœ— , where the adjacent parameters satisfies πœ„' βˆ’ πœ„'(% = %

"!.

  • Telescoping product: approximate π‘Ž = #𝑙-SAT solutions using

π‘Ž β‰ˆ π‘Ž πœ„β„“ = π‘Ž πœ„β„“ π‘Ž πœ„β„“(% Γ— π‘Ž πœ„β„“(% π‘Ž πœ„β„“() Γ— β‹―Γ— π‘Ž πœ„% π‘Ž πœ„$ Γ—2!

  • Estimate ratios: let π‘Œ ∼ 𝜈*DEF, define the random variable 𝑋

' as

𝑋

' =

π‘₯*D π‘Œ π‘₯*DEF(π‘Œ) , then 𝐹 𝑋

' = π‘Ž πœ„'

π‘Ž πœ„'(% . draw samples from 𝜈*G, 𝜈*F, … , 𝜈*β„“EF to estimate each ratio.

slide-31
SLIDE 31

Proof of the rapid mixing

Start from a uniform random 𝑍 ∈ true,false ); For each 𝑒 from 1 to π‘ˆ = 2π‘œ log 01

2

  • Pick a marked variable 𝑀 ∈ 𝑁 u.a.r.;
  • Resample 𝑍

* ∼ 𝜈*(β‹… |𝑍 )\*);

Return 𝑍;

Lemma: mixing

The 𝑍 = 𝑍

> returned by Glauber dynamics satisfies

𝑒>, 𝑍, 𝜈+ ≀ πœ— 3 .

T/F?

slide-32
SLIDE 32

Path coupling [Bubley and Dyer’ 97]

  • Let π‘Œ, 𝑍 ∈ true, false j be two

assignments disagree only at 𝑀G.

  • For each 𝑣 ∈ 𝑁, we bound the

influence on 𝑣 from 𝑀G 𝐽m = 𝑒&% 𝜈m β‹… π‘Œj\m , 𝜈m β‹… 𝑍

j\m

.

  • Path Coupling: if

)

m∈j\N8

𝐽m ≀ 1 2 , then Glauber dynamics is rapid mixing.

𝑀u 𝑣

Influence may percolate very far away through unmarked variables

slide-33
SLIDE 33

Disagreement percolation coupling [Moitra’17, Guo, et al.’18]

𝑀!

Couple unmarked variables and 𝒗 to generate π‘Œ, 𝑍 ∈ π‘ˆ, 𝐺 4 s.t. π‘Œ ∼ 𝜈 β‹… π‘Œ)\6 , 𝑍 ∼ 𝜈 β‹… 𝑍

)\6

𝐽6 = 𝑒34 𝜈6 β‹… π‘Œ)\6 , 𝜈6 β‹… 𝑍

)\6

≀ Pr Coupling π‘Œ6 β‰  𝑍

6 .

The coupling sketch

  • Let 𝐸 be the set of disagreements, initially, 𝐸 = {𝑀9}.
  • Coupling variables in a BFS order.
  • For each π‘₯, couple π‘Œ π‘₯ and 𝑍 π‘₯ optimally.
  • If π‘Œ π‘₯ = 𝑍(π‘₯), then remove all clauses satisfied by π‘₯;
  • If π‘Œ π‘₯ β‰  𝑍(π‘₯), then add 𝑣 into 𝐸.
  • Repeat until 𝐸 and ~

𝐸 are disconnected.

𝑣

slide-34
SLIDE 34

𝑀! 𝑣

Disagreement percolation coupling [Moitra’17, Guo, et al.’18]

Couple unmarked variables and 𝒗 to generate π‘Œ, 𝑍 ∈ π‘ˆ, 𝐺 4 s.t. π‘Œ ∼ 𝜈 β‹… π‘Œ)\6 , 𝑍 ∼ 𝜈 β‹… 𝑍

)\6

𝐽6 = 𝑒34 𝜈6 β‹… π‘Œ)\6 , 𝜈6 β‹… 𝑍

)\6

≀ Pr Coupling π‘Œ6 β‰  𝑍

6 .

The coupling sketch

  • Let 𝐸 be the set of disagreements, initially, 𝐸 = {𝑀9}.
  • Coupling variables in a BFS order.
  • For each π‘₯, couple π‘Œ π‘₯ and 𝑍 π‘₯ optimally.
  • If π‘Œ π‘₯ = 𝑍(π‘₯), then remove all clauses satisfied by π‘₯;
  • If π‘Œ π‘₯ β‰  𝑍(π‘₯), then add 𝑣 into 𝐸.
  • Repeat until 𝐸 and ~

𝐸 are disconnected.

slide-35
SLIDE 35

𝑀! 𝑣

Disagreement percolation coupling [Moitra’17, Guo, et al.’18]

The coupling sketch

  • Let 𝐸 be the set of disagreements, initially, 𝐸 = {𝑀9}.
  • Coupling variables in a BFS order.
  • For each π‘₯, couple π‘Œ π‘₯ and 𝑍 π‘₯ optimally.
  • If π‘Œ π‘₯ = 𝑍(π‘₯), then remove all clauses satisfied by π‘₯;
  • If π‘Œ π‘₯ β‰  𝑍(π‘₯), then add 𝑣 into 𝐸.
  • Repeat until 𝐸 and ~

𝐸 are disconnected. Couple unmarked variables and 𝒗 to generate π‘Œ, 𝑍 ∈ π‘ˆ, 𝐺 4 s.t. π‘Œ ∼ 𝜈 β‹… π‘Œ)\6 , 𝑍 ∼ 𝜈 β‹… 𝑍

)\6

𝐽6 = 𝑒34 𝜈6 β‹… π‘Œ)\6 , 𝜈6 β‹… 𝑍

)\6

≀ Pr Coupling π‘Œ6 β‰  𝑍

6 .

slide-36
SLIDE 36

𝑀! 𝑣

Pr π‘Œ π‘₯ = true = 1 2 Β± 1 poly(𝑒𝑙) Pr 𝑍 π‘₯ = true = 1 2 Β± 1 poly(𝑒𝑙)

π‘Œ π‘₯ = 𝑍(π‘₯) w.p. 1 βˆ’

,

poly(&')

LLL local uniformity coupling succeeds w.h.p.

each clause contains sufficiently many free variables

adaptive disagreement percolation coupling

Disagreement percolation coupling [Moitra’17, Guo, et al.’18]

The coupling sketch

  • Let 𝐸 be the set of disagreements, initially, 𝐸 = {𝑀9}.
  • Coupling variables in a BFS order.
  • For each π‘₯, couple π‘Œ π‘₯ and 𝑍 π‘₯ optimally.
  • If π‘Œ π‘₯ = 𝑍(π‘₯), then remove all clauses satisfied by π‘₯;
  • If π‘Œ π‘₯ β‰  𝑍(π‘₯), then add 𝑣 into 𝐸.
  • Repeat until 𝐸 and ~

𝐸 are disconnected. Couple unmarked variables and 𝒗 to generate π‘Œ, 𝑍 ∈ π‘ˆ, 𝐺 4 s.t. π‘Œ ∼ 𝜈 β‹… π‘Œ)\6 , 𝑍 ∼ 𝜈 β‹… 𝑍

)\6

𝐽6 = 𝑒34 𝜈6 β‹… π‘Œ)\6 , 𝜈6 β‹… 𝑍

)\6

≀ Pr Coupling π‘Œ6 β‰  𝑍

6 .

slide-37
SLIDE 37

with high probability, size of the disagreement set 𝐸 is small 𝐽6 ≀ Pr Couling π‘Œ6 β‰  𝑍

6 ≀

Pr Coupling [𝑣 βˆ‰ 𝐸] ≲ 1 poly 𝑒𝑙

&()*(*+,6)

f

I∈+\./

𝐽I ≀ 1 2

𝑀9 𝑣 𝐸 A path in power graph

Disagreement percolation coupling [Moitra’17, Guo, et al.’18]

The coupling sketch

  • Let 𝐸 be the set of disagreements, initially, 𝐸 = {𝑀9}.
  • Coupling variables in a BFS order.
  • For each π‘₯, couple π‘Œ π‘₯ and 𝑍 π‘₯ optimally.
  • If π‘Œ π‘₯ = 𝑍(π‘₯), then remove all clauses satisfied by π‘₯;
  • If π‘Œ π‘₯ β‰  𝑍(π‘₯), then add 𝑣 into 𝐸.
  • Repeat until 𝐸 and ~

𝐸 are disconnected. Couple unmarked variables and 𝒗 to generate π‘Œ, 𝑍 ∈ π‘ˆ, 𝐺 4 s.t. π‘Œ ∼ 𝜈 β‹… π‘Œ)\6 , 𝑍 ∼ 𝜈 β‹… 𝑍

)\6

𝐽6 = 𝑒34 𝜈6 β‹… π‘Œ)\6 , 𝜈6 β‹… 𝑍

)\6

≀ Pr Coupling π‘Œ6 β‰  𝑍

6 .

slide-38
SLIDE 38

marking variables at least 𝛽𝑙 marked variables at least 𝛾𝑙 unmarked variables local uniformity small component rapid mixing 𝑃(π‘œ log π‘œ) steps

  • rej. sampling

q 𝑃(π‘œ-) times q 𝑃 π‘œ-;, running time

LLL LLL

path coupling

LLL

sampling algorithm counting algorithm

non-adaptive simulated annealing

LLL

Projection

slide-39
SLIDE 39

Open problems

  • Sampling & counting 𝑙-SAT solutions when 𝑙 ≳ 2 log 𝑒.
  • Extend the technique to more general distributions, e.g. hyper-graph coloring.

Summary

  • A close to linear time algorithm for sampling 𝑙-SAT solutions in LLL regime.
  • A close to quadratic time algorithm for counting 𝑙-SAT solutions in LLL regime.
  • Projection + LLL technique to bypass the connectivity barrier of MCMC method.

Thank you!