Relating Proof Complexity Measures and Practical Hardness of SAT - - PowerPoint PPT Presentation

relating proof complexity measures and practical hardness
SMART_READER_LITE
LIVE PREVIEW

Relating Proof Complexity Measures and Practical Hardness of SAT - - PowerPoint PPT Presentation

Relating Proof Complexity Measures and Practical Hardness of SAT Jakob Nordstr om KTH Royal Institute of Technology Stockholm, Sweden 18th International Conference on Principles and Practice of Constraint Programming Qu ebec City,


slide-1
SLIDE 1

Relating Proof Complexity Measures and Practical Hardness of SAT

Jakob Nordstr¨

  • m

KTH Royal Institute of Technology Stockholm, Sweden

18th International Conference on Principles and Practice of Constraint Programming Qu´ ebec City, Canada October 8–12, 2012

Joint work with Matti J¨ arvisalo, Arie Matsliah, and Stanislav ˇ Zivn´ y

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 1 / 19

slide-2
SLIDE 2

Proof Complexity and SAT Solving

Proof complexity Satsifiability fundamental problem in theoretical computer science SAT proven NP-complete by Stephen Cook in 1971 Hence totally intractable in worst case (probably) One of the million dollar “Millennium Problems” SAT solving Enormous progress in performance last 10-15 years State-of-the-art solvers can deal with real-world instances with millions of variables But best solvers still based on methods from early 1960s Tiny formulas known that are totally beyond reach What makes formulas hard or easy in practice for SAT solvers? What (if anything) can proof complexity say about this?

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 2 / 19

slide-3
SLIDE 3

Proof Complexity and SAT Solving

Proof complexity Satsifiability fundamental problem in theoretical computer science SAT proven NP-complete by Stephen Cook in 1971 Hence totally intractable in worst case (probably) One of the million dollar “Millennium Problems” SAT solving Enormous progress in performance last 10-15 years State-of-the-art solvers can deal with real-world instances with millions of variables But best solvers still based on methods from early 1960s Tiny formulas known that are totally beyond reach What makes formulas hard or easy in practice for SAT solvers? What (if anything) can proof complexity say about this?

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 2 / 19

slide-4
SLIDE 4

Proof Complexity and SAT Solving

Proof complexity Satsifiability fundamental problem in theoretical computer science SAT proven NP-complete by Stephen Cook in 1971 Hence totally intractable in worst case (probably) One of the million dollar “Millennium Problems” SAT solving Enormous progress in performance last 10-15 years State-of-the-art solvers can deal with real-world instances with millions of variables But best solvers still based on methods from early 1960s Tiny formulas known that are totally beyond reach What makes formulas hard or easy in practice for SAT solvers? What (if anything) can proof complexity say about this?

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 2 / 19

slide-5
SLIDE 5

Proof Complexity and SAT Solving

Proof complexity Satsifiability fundamental problem in theoretical computer science SAT proven NP-complete by Stephen Cook in 1971 Hence totally intractable in worst case (probably) One of the million dollar “Millennium Problems” SAT solving Enormous progress in performance last 10-15 years State-of-the-art solvers can deal with real-world instances with millions of variables But best solvers still based on methods from early 1960s Tiny formulas known that are totally beyond reach What makes formulas hard or easy in practice for SAT solvers? What (if anything) can proof complexity say about this?

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 2 / 19

slide-6
SLIDE 6

SAT solving and Proof Complexity Resolution

Resolution Proof System

Refute unsatisfiable formulas in conjunctive normal form (CNF): (x ∨ z) ∧ (y ∨ z) ∧ (x ∨ y ∨ u) ∧ (y ∨ u) ∧ (u ∨ v) ∧ (x ∨ v) ∧ (u ∨ w) ∧ (x ∨ u ∨ w) Resolution rule: B ∨ x C ∨ x B ∨ C

Observation

If F is a satisfiable CNF formula and D is derived from clauses C1, C2 ∈ F by the resolution rule, then F ∧ D is satisfiable. So prove CNF formula unsatisfiable by deriving contradiction by resolution

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 3 / 19

slide-7
SLIDE 7

SAT solving and Proof Complexity Resolution

Resolution Proof System

Refute unsatisfiable formulas in conjunctive normal form (CNF): (x ∨ z) ∧ (y ∨ z) ∧ (x ∨ y ∨ u) ∧ (y ∨ u) ∧ (u ∨ v) ∧ (x ∨ v) ∧ (u ∨ w) ∧ (x ∨ u ∨ w) Resolution rule: B ∨ x C ∨ x B ∨ C

Observation

If F is a satisfiable CNF formula and D is derived from clauses C1, C2 ∈ F by the resolution rule, then F ∧ D is satisfiable. So prove CNF formula unsatisfiable by deriving contradiction by resolution

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 3 / 19

slide-8
SLIDE 8

SAT solving and Proof Complexity Resolution

Resolution Proof System

Refute unsatisfiable formulas in conjunctive normal form (CNF): (x ∨ z) ∧ (y ∨ z) ∧ (x ∨ y ∨ u) ∧ (y ∨ u) ∧ (u ∨ v) ∧ (x ∨ v) ∧ (u ∨ w) ∧ (x ∨ u ∨ w) Resolution rule: B ∨ x C ∨ x B ∨ C

Observation

If F is a satisfiable CNF formula and D is derived from clauses C1, C2 ∈ F by the resolution rule, then F ∧ D is satisfiable. So prove CNF formula unsatisfiable by deriving contradiction by resolution

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 3 / 19

slide-9
SLIDE 9

SAT solving and Proof Complexity Resolution

Resolution Proof System

Refute unsatisfiable formulas in conjunctive normal form (CNF): (x ∨ z) ∧ (y ∨ z) ∧ (x ∨ y ∨ u) ∧ (y ∨ u) ∧ (u ∨ v) ∧ (x ∨ v) ∧ (u ∨ w) ∧ (x ∨ u ∨ w) Resolution rule: B ∨ x C ∨ x B ∨ C

Observation

If F is a satisfiable CNF formula and D is derived from clauses C1, C2 ∈ F by the resolution rule, then F ∧ D is satisfiable. So prove CNF formula unsatisfiable by deriving contradiction by resolution

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 3 / 19

slide-10
SLIDE 10

SAT solving and Proof Complexity Resolution

CDCL Solvers Generate Resolution Proofs

Simple example for DPLL:

x ∨ z y ∨ z x ∨ y ∨ u y ∨ u u ∨ v x ∨ v u ∨ w x ∨ u ∨ w 1 1 1 1 1 1 1 x y u z u v w

Conflict-driven clause learning adds “shortcut edges” in tree But still yields resolution proof True also for (most) preprocessing techniques

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 4 / 19

slide-11
SLIDE 11

SAT solving and Proof Complexity Resolution

CDCL Solvers Generate Resolution Proofs

Simple example for DPLL:

x ∨ z y ∨ z x ∨ y ∨ u y ∨ u u ∨ v x ∨ v u ∨ w x ∨ u ∨ w 1 1 1 1 1 1 x y u u v w x ∨ y

Conflict-driven clause learning adds “shortcut edges” in tree But still yields resolution proof True also for (most) preprocessing techniques

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 4 / 19

slide-12
SLIDE 12

SAT solving and Proof Complexity Resolution

CDCL Solvers Generate Resolution Proofs

Simple example for DPLL:

x ∨ z y ∨ z x ∨ y ∨ u y ∨ u u ∨ v x ∨ v u ∨ w x ∨ u ∨ w 1 1 1 1 1 x y u v w x ∨ y x ∨ y

Conflict-driven clause learning adds “shortcut edges” in tree But still yields resolution proof True also for (most) preprocessing techniques

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 4 / 19

slide-13
SLIDE 13

SAT solving and Proof Complexity Resolution

CDCL Solvers Generate Resolution Proofs

Simple example for DPLL:

x ∨ z y ∨ z x ∨ y ∨ u y ∨ u u ∨ v x ∨ v u ∨ w x ∨ u ∨ w ⊥ x x x ∨ y x ∨ y x ∨ u x ∨ u

Conflict-driven clause learning adds “shortcut edges” in tree But still yields resolution proof True also for (most) preprocessing techniques

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 4 / 19

slide-14
SLIDE 14

SAT solving and Proof Complexity Resolution

CDCL Solvers Generate Resolution Proofs

Simple example for DPLL:

x ∨ z y ∨ z x ∨ y ∨ u y ∨ u u ∨ v x ∨ v u ∨ w x ∨ u ∨ w ⊥ x x x ∨ y x ∨ y x ∨ u x ∨ u

Conflict-driven clause learning adds “shortcut edges” in tree But still yields resolution proof True also for (most) preprocessing techniques

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 4 / 19

slide-15
SLIDE 15

SAT solving and Proof Complexity Proof Complexity Measures

Complexity Measures for Resolution

Let n = size of formula

Length

# clauses in refutation — at most exp(n)

Width

Size of largest clause in refutation — at most n

Space

Max # clauses one needs to remember when “verifying correctness of refutation on blackboard” — at most n (!)

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 5 / 19

slide-16
SLIDE 16

SAT solving and Proof Complexity Proof Complexity Measures

Length

Clearly lower bound on running time for any CDCL algorithm But if there is a short refutation, not clear how to find it In fact, probably intractable [Aleknovich & Razborov ’01] So small length upper bound might be much too optimistic Not the right measure of “hardness in practice”

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 6 / 19

slide-17
SLIDE 17

SAT solving and Proof Complexity Proof Complexity Measures

Length

Clearly lower bound on running time for any CDCL algorithm But if there is a short refutation, not clear how to find it In fact, probably intractable [Aleknovich & Razborov ’01] So small length upper bound might be much too optimistic Not the right measure of “hardness in practice”

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 6 / 19

slide-18
SLIDE 18

SAT solving and Proof Complexity Proof Complexity Measures

Length

Clearly lower bound on running time for any CDCL algorithm But if there is a short refutation, not clear how to find it In fact, probably intractable [Aleknovich & Razborov ’01] So small length upper bound might be much too optimistic Not the right measure of “hardness in practice”

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 6 / 19

slide-19
SLIDE 19

SAT solving and Proof Complexity Proof Complexity Measures

Length

Clearly lower bound on running time for any CDCL algorithm But if there is a short refutation, not clear how to find it In fact, probably intractable [Aleknovich & Razborov ’01] So small length upper bound might be much too optimistic Not the right measure of “hardness in practice”

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 6 / 19

slide-20
SLIDE 20

SAT solving and Proof Complexity Proof Complexity Measures

Length

Clearly lower bound on running time for any CDCL algorithm But if there is a short refutation, not clear how to find it In fact, probably intractable [Aleknovich & Razborov ’01] So small length upper bound might be much too optimistic Not the right measure of “hardness in practice”

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 6 / 19

slide-21
SLIDE 21

SAT solving and Proof Complexity Proof Complexity Measures

Length vs. Width

Searching for small width refutations known heuristic in AI community Small width ⇒ small length (by counting) But small length does not necessary imply small width — can have √n width and linear length [Bonet & Galesi ’99] So width stricter hardness measure than length Small width ⇒ CDCL solver will provably be fast [Atserias, Ficthe & Thurley ’09] (but slighly idealized theoretical model) Right hardness measure?

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 7 / 19

slide-22
SLIDE 22

SAT solving and Proof Complexity Proof Complexity Measures

Length vs. Width

Searching for small width refutations known heuristic in AI community Small width ⇒ small length (by counting) But small length does not necessary imply small width — can have √n width and linear length [Bonet & Galesi ’99] So width stricter hardness measure than length Small width ⇒ CDCL solver will provably be fast [Atserias, Ficthe & Thurley ’09] (but slighly idealized theoretical model) Right hardness measure?

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 7 / 19

slide-23
SLIDE 23

SAT solving and Proof Complexity Proof Complexity Measures

Length vs. Width

Searching for small width refutations known heuristic in AI community Small width ⇒ small length (by counting) But small length does not necessary imply small width — can have √n width and linear length [Bonet & Galesi ’99] So width stricter hardness measure than length Small width ⇒ CDCL solver will provably be fast [Atserias, Ficthe & Thurley ’09] (but slighly idealized theoretical model) Right hardness measure?

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 7 / 19

slide-24
SLIDE 24

SAT solving and Proof Complexity Proof Complexity Measures

Length vs. Width

Searching for small width refutations known heuristic in AI community Small width ⇒ small length (by counting) But small length does not necessary imply small width — can have √n width and linear length [Bonet & Galesi ’99] So width stricter hardness measure than length Small width ⇒ CDCL solver will provably be fast [Atserias, Ficthe & Thurley ’09] (but slighly idealized theoretical model) Right hardness measure?

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 7 / 19

slide-25
SLIDE 25

SAT solving and Proof Complexity Proof Complexity Measures

Length vs. Width

Searching for small width refutations known heuristic in AI community Small width ⇒ small length (by counting) But small length does not necessary imply small width — can have √n width and linear length [Bonet & Galesi ’99] So width stricter hardness measure than length Small width ⇒ CDCL solver will provably be fast [Atserias, Ficthe & Thurley ’09] (but slighly idealized theoretical model) Right hardness measure?

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 7 / 19

slide-26
SLIDE 26

SAT solving and Proof Complexity Proof Complexity Measures

Length vs. Width

Searching for small width refutations known heuristic in AI community Small width ⇒ small length (by counting) But small length does not necessary imply small width — can have √n width and linear length [Bonet & Galesi ’99] So width stricter hardness measure than length Small width ⇒ CDCL solver will provably be fast [Atserias, Ficthe & Thurley ’09] (but slighly idealized theoretical model) Right hardness measure?

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 7 / 19

slide-27
SLIDE 27

SAT solving and Proof Complexity Proof Complexity Measures

Width vs. Space

In practice, memory consumption is a very important bottleneck for SAT solvers So maybe space complexity can be relevant hardness measure? Space ≥ width [Atserias & Dalmau ’03] But small width does not say anything about space [N. ’06], [N. & H˚ astad ’08], [Ben-Sasson & N. ’08] So space stricter hardness measure than width (but space model even more idealized)

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 8 / 19

slide-28
SLIDE 28

SAT solving and Proof Complexity Proof Complexity Measures

Width vs. Space

In practice, memory consumption is a very important bottleneck for SAT solvers So maybe space complexity can be relevant hardness measure? Space ≥ width [Atserias & Dalmau ’03] But small width does not say anything about space [N. ’06], [N. & H˚ astad ’08], [Ben-Sasson & N. ’08] So space stricter hardness measure than width (but space model even more idealized)

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 8 / 19

slide-29
SLIDE 29

SAT solving and Proof Complexity Proof Complexity Measures

Width vs. Space

In practice, memory consumption is a very important bottleneck for SAT solvers So maybe space complexity can be relevant hardness measure? Space ≥ width [Atserias & Dalmau ’03] But small width does not say anything about space [N. ’06], [N. & H˚ astad ’08], [Ben-Sasson & N. ’08] So space stricter hardness measure than width (but space model even more idealized)

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 8 / 19

slide-30
SLIDE 30

SAT solving and Proof Complexity Proof Complexity Measures

Width vs. Space

In practice, memory consumption is a very important bottleneck for SAT solvers So maybe space complexity can be relevant hardness measure? Space ≥ width [Atserias & Dalmau ’03] But small width does not say anything about space [N. ’06], [N. & H˚ astad ’08], [Ben-Sasson & N. ’08] So space stricter hardness measure than width (but space model even more idealized)

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 8 / 19

slide-31
SLIDE 31

SAT solving and Proof Complexity Proof Complexity Measures

Width vs. Space

In practice, memory consumption is a very important bottleneck for SAT solvers So maybe space complexity can be relevant hardness measure? Space ≥ width [Atserias & Dalmau ’03] But small width does not say anything about space [N. ’06], [N. & H˚ astad ’08], [Ben-Sasson & N. ’08] So space stricter hardness measure than width (but space model even more idealized)

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 8 / 19

slide-32
SLIDE 32

SAT solving and Proof Complexity Proof Complexity Measures

Space vs. Tree-like Space

Tree-like resolution: Only use each clause once Have to rederive from scratch if needed again Tree-like space: Usual space measure but restricted to such proofs Proposed as practical measure of hardness of SAT instances in [Ans´

  • tegui, Bonet, Levy & Many`

a ’08] Clearly tree-like space ≥ space but not known to be different This work can be viewed as implementing program outlined in [ABLM08]

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 9 / 19

slide-33
SLIDE 33

SAT solving and Proof Complexity Proof Complexity Measures

Space vs. Tree-like Space

Tree-like resolution: Only use each clause once Have to rederive from scratch if needed again Tree-like space: Usual space measure but restricted to such proofs Proposed as practical measure of hardness of SAT instances in [Ans´

  • tegui, Bonet, Levy & Many`

a ’08] Clearly tree-like space ≥ space but not known to be different This work can be viewed as implementing program outlined in [ABLM08]

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 9 / 19

slide-34
SLIDE 34

SAT solving and Proof Complexity Proof Complexity Measures

Space vs. Tree-like Space

Tree-like resolution: Only use each clause once Have to rederive from scratch if needed again Tree-like space: Usual space measure but restricted to such proofs Proposed as practical measure of hardness of SAT instances in [Ans´

  • tegui, Bonet, Levy & Many`

a ’08] Clearly tree-like space ≥ space but not known to be different This work can be viewed as implementing program outlined in [ABLM08]

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 9 / 19

slide-35
SLIDE 35

SAT solving and Proof Complexity Proof Complexity Measures

Space vs. Tree-like Space

Tree-like resolution: Only use each clause once Have to rederive from scratch if needed again Tree-like space: Usual space measure but restricted to such proofs Proposed as practical measure of hardness of SAT instances in [Ans´

  • tegui, Bonet, Levy & Many`

a ’08] Clearly tree-like space ≥ space but not known to be different This work can be viewed as implementing program outlined in [ABLM08]

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 9 / 19

slide-36
SLIDE 36

SAT solving and Proof Complexity Proof Complexity Measures

Space vs. Tree-like Space

Tree-like resolution: Only use each clause once Have to rederive from scratch if needed again Tree-like space: Usual space measure but restricted to such proofs Proposed as practical measure of hardness of SAT instances in [Ans´

  • tegui, Bonet, Levy & Many`

a ’08] Clearly tree-like space ≥ space but not known to be different This work can be viewed as implementing program outlined in [ABLM08]

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 9 / 19

slide-37
SLIDE 37

SAT solving and Proof Complexity Our Results

Result 1: Separation of Space and Tree-like Space

We don’t believe in tree-like space as hardness measure Tree-like space tightly connected with tree-like length Corresponds to DPLL without clause learning Would suggest CDCL doesn’t buy you anything We prove first asymptotic separation of space and tree-like space

Theorem

There are formulas requiring space O(1) for which tree-like space grows like Ω(log n) Only constant-factor separation known before [Esteban & Tor´ an ’03]

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 10 / 19

slide-38
SLIDE 38

SAT solving and Proof Complexity Our Results

Result 1: Separation of Space and Tree-like Space

We don’t believe in tree-like space as hardness measure Tree-like space tightly connected with tree-like length Corresponds to DPLL without clause learning Would suggest CDCL doesn’t buy you anything We prove first asymptotic separation of space and tree-like space

Theorem

There are formulas requiring space O(1) for which tree-like space grows like Ω(log n) Only constant-factor separation known before [Esteban & Tor´ an ’03]

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 10 / 19

slide-39
SLIDE 39

SAT solving and Proof Complexity Our Results

Result 2: Small Backdoor Sets Imply Small Space

Backdoor sets: practically motivated hardness measure First studied in [Williams, Gomes & Selman ’03] Real-world SAT instances often have small backdoors We show connections between backdoors and space complexity (elaborating on [ABLM08])

Theorem (Informal)

If a formula has a small backdoor set, then it requires small space

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 11 / 19

slide-40
SLIDE 40

SAT solving and Proof Complexity Our Results

Result 2: Small Backdoor Sets Imply Small Space

Backdoor sets: practically motivated hardness measure First studied in [Williams, Gomes & Selman ’03] Real-world SAT instances often have small backdoors We show connections between backdoors and space complexity (elaborating on [ABLM08])

Theorem (Informal)

If a formula has a small backdoor set, then it requires small space

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 11 / 19

slide-41
SLIDE 41

SAT solving and Proof Complexity Our Results

Result 3: Hardness in Practice Correlates with Space

Recall log length ≤ width ≤ space ≤ tree-like space Width and space seem like most promising hardness candidates Run experiments on formulas with fixed complexity w.r.t. width (and length) but varying space∗ Is running time essentially the same? Or does it increase with increasing space?

Experimental results

Running times seem to correlate with space complexity∗∗

(*) But such formulas are nontrivial to find (**) With some caveats to be discussed later

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 12 / 19

slide-42
SLIDE 42

SAT solving and Proof Complexity Our Results

Result 3: Hardness in Practice Correlates with Space

Recall log length ≤ width ≤ space ≤ tree-like space Width and space seem like most promising hardness candidates Run experiments on formulas with fixed complexity w.r.t. width (and length) but varying space∗ Is running time essentially the same? Or does it increase with increasing space?

Experimental results

Running times seem to correlate with space complexity∗∗

(*) But such formulas are nontrivial to find (**) With some caveats to be discussed later

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 12 / 19

slide-43
SLIDE 43

SAT solving and Proof Complexity Our Results

Result 3: Hardness in Practice Correlates with Space

Recall log length ≤ width ≤ space ≤ tree-like space Width and space seem like most promising hardness candidates Run experiments on formulas with fixed complexity w.r.t. width (and length) but varying space∗ Is running time essentially the same? Or does it increase with increasing space?

Experimental results

Running times seem to correlate with space complexity∗∗

(*) But such formulas are nontrivial to find (**) With some caveats to be discussed later

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 12 / 19

slide-44
SLIDE 44

SAT solving and Proof Complexity Our Results

Result 3: Hardness in Practice Correlates with Space

Recall log length ≤ width ≤ space ≤ tree-like space Width and space seem like most promising hardness candidates Run experiments on formulas with fixed complexity w.r.t. width (and length) but varying space∗ Is running time essentially the same? Or does it increase with increasing space?

Experimental results

Running times seem to correlate with space complexity∗∗

(*) But such formulas are nontrivial to find (**) With some caveats to be discussed later

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 12 / 19

slide-45
SLIDE 45

SAT solving and Proof Complexity Our Results

Result 3: Hardness in Practice Correlates with Space

Recall log length ≤ width ≤ space ≤ tree-like space Width and space seem like most promising hardness candidates Run experiments on formulas with fixed complexity w.r.t. width (and length) but varying space∗ Is running time essentially the same? Or does it increase with increasing space?

Experimental results

Running times seem to correlate with space complexity∗∗

(*) But such formulas are nontrivial to find (**) With some caveats to be discussed later

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 12 / 19

slide-46
SLIDE 46

Experiments Benchmark Formulas

Use Pebbling Formulas. . .

CNF formulas encoding so-called pebble games on DAGs 1. u 2. v 3. w 4. u ∨ v ∨ x 5. v ∨ w ∨ y 6. x ∨ y ∨ z 7. z

z x y u v w

sources are true truth propagates upwards but sink is false Extensive literature on pebbling time-space trade-offs from 1970s and 80s Pebbling formulas studied by [Bonet et al. ’98, Raz & McKenzie ’99, Ben-Sasson & Wigderson ’99] and others Hope that pebbling properties of DAG somehow carry over to resolution refutations of pebbling formulas. Except. . .

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 13 / 19

slide-47
SLIDE 47

Experiments Benchmark Formulas

Use Pebbling Formulas. . .

CNF formulas encoding so-called pebble games on DAGs 1. u 2. v 3. w 4. u ∨ v ∨ x 5. v ∨ w ∨ y 6. x ∨ y ∨ z 7. z

z x y u v w

sources are true truth propagates upwards but sink is false Extensive literature on pebbling time-space trade-offs from 1970s and 80s Pebbling formulas studied by [Bonet et al. ’98, Raz & McKenzie ’99, Ben-Sasson & Wigderson ’99] and others Hope that pebbling properties of DAG somehow carry over to resolution refutations of pebbling formulas. Except. . .

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 13 / 19

slide-48
SLIDE 48

Experiments Benchmark Formulas

Use Pebbling Formulas. . .

CNF formulas encoding so-called pebble games on DAGs 1. u 2. v 3. w 4. u ∨ v ∨ x 5. v ∨ w ∨ y 6. x ∨ y ∨ z 7. z

z x y u v w

sources are true truth propagates upwards but sink is false Extensive literature on pebbling time-space trade-offs from 1970s and 80s Pebbling formulas studied by [Bonet et al. ’98, Raz & McKenzie ’99, Ben-Sasson & Wigderson ’99] and others Hope that pebbling properties of DAG somehow carry over to resolution refutations of pebbling formulas. Except. . .

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 13 / 19

slide-49
SLIDE 49

Experiments Benchmark Formulas

Use Pebbling Formulas. . .

CNF formulas encoding so-called pebble games on DAGs 1. u 2. v 3. w 4. u ∨ v ∨ x 5. v ∨ w ∨ y 6. x ∨ y ∨ z 7. z

z x y u v w

sources are true truth propagates upwards but sink is false Extensive literature on pebbling time-space trade-offs from 1970s and 80s Pebbling formulas studied by [Bonet et al. ’98, Raz & McKenzie ’99, Ben-Sasson & Wigderson ’99] and others Hope that pebbling properties of DAG somehow carry over to resolution refutations of pebbling formulas. Except. . .

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 13 / 19

slide-50
SLIDE 50

Experiments Benchmark Formulas

Use Pebbling Formulas. . .

CNF formulas encoding so-called pebble games on DAGs 1. u 2. v 3. w 4. u ∨ v ∨ x 5. v ∨ w ∨ y 6. x ∨ y ∨ z 7. z

z x y u v w

sources are true truth propagates upwards but sink is false Extensive literature on pebbling time-space trade-offs from 1970s and 80s Pebbling formulas studied by [Bonet et al. ’98, Raz & McKenzie ’99, Ben-Sasson & Wigderson ’99] and others Hope that pebbling properties of DAG somehow carry over to resolution refutations of pebbling formulas. Except. . .

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 13 / 19

slide-51
SLIDE 51

Experiments Benchmark Formulas

Use Pebbling Formulas. . .

CNF formulas encoding so-called pebble games on DAGs 1. u 2. v 3. w 4. u ∨ v ∨ x 5. v ∨ w ∨ y 6. x ∨ y ∨ z 7. z

z x y u v w

sources are true truth propagates upwards but sink is false Extensive literature on pebbling time-space trade-offs from 1970s and 80s Pebbling formulas studied by [Bonet et al. ’98, Raz & McKenzie ’99, Ben-Sasson & Wigderson ’99] and others Hope that pebbling properties of DAG somehow carry over to resolution refutations of pebbling formulas. Except. . .

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 13 / 19

slide-52
SLIDE 52

Experiments Benchmark Formulas

. . . with Functions Substituted for Variables

Won’t work — pebbling formulas solved by unit propagation, so supereasy Make formula harder by substituting x1 ⊕ x2 for every variable x (also works for other Boolean functions with “right” properties): x ∨ y ⇓ ¬(x1 ⊕ x2) ∨ (y1 ⊕ y2) ⇓ (x1 ∨ x2 ∨ y1 ∨ y2) ∧ (x1 ∨ x2 ∨ y1 ∨ y2) ∧ (x1 ∨ x2 ∨ y1 ∨ y2) ∧ (x1 ∨ x2 ∨ y1 ∨ y2) Now CNF formula inherits pebbling graph properties!

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 14 / 19

slide-53
SLIDE 53

Experiments Set-up

About the Experiments

12 graph families with varying space complexity 8 different substitution functions Total of 96 formula families with around 50 instances per family CDCL solvers Minisat 2.2.0 and Lingeling version 774 Experiments

◮ with and without preprocessing ◮ with and without random shuffling of clauses and variables

Intel Core i5-2500 3.3-GHz quad-core CPU with 8 GB of memory Time-out 1 hour per instance Massive amounts of data. . .

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 15 / 19

slide-54
SLIDE 54

Experiments Results

Example Results Without Preprocessing

0.01 0.1 1 10 100 1000 10000 100 1000 10000 100000 1e+06 Time [s] Number of variables Minisat (no prepro.), or_3 gtb pyramid pyrseqsqrt bintree pyr1seq 0.1 1 10 100 1000 10000 100 1000 10000 100000 1e+06 Time [s] Number of variables Lingeling (no prepro.), eq_3 gtb pyramid pyrseqsqrt bintree pyr1seq

Looks nice. . . Easy formulas solved fast and hard formulas take longer time

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 16 / 19

slide-55
SLIDE 55

Experiments Results

Example Results with Preprocessing

0.01 0.1 1 10 100 10000 100000 1e+06 Time [s] Number of variables Minisat, or_3 gtb pyramid pyrseqsqrt bintree pyr1seq 0.1 1 10 100 1000 10000 100 1000 10000 100000 1e+06 Time [s] Number of variables Lingeling, eq_3 gtb pyramid pyrseqsqrt bintree pyr1seq

Less nice. . . Which is not surprising

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 17 / 19

slide-56
SLIDE 56

Experiments Results

Caveats and Issues

Preprocessing dampens correlations To be expected — space of proof not captured during preprocessing By construction formulas amenable to preprocessing Artificial benchmarks True, but the only formulas where we know how to control space In general, computing space complexity probably PSPACE-complete Theory vs. practice In theory all substitution functions equal — not so in practice In theory graph pebbling space all that matters — but many source vertices make binary tree formulas “too easy” Varying width and space independently would be more convincing Very true, but provably impossible since space ≥ width Want to see if space is “more fine-grained” hardness indicator

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 18 / 19

slide-57
SLIDE 57

Experiments Results

Caveats and Issues

Preprocessing dampens correlations To be expected — space of proof not captured during preprocessing By construction formulas amenable to preprocessing Artificial benchmarks True, but the only formulas where we know how to control space In general, computing space complexity probably PSPACE-complete Theory vs. practice In theory all substitution functions equal — not so in practice In theory graph pebbling space all that matters — but many source vertices make binary tree formulas “too easy” Varying width and space independently would be more convincing Very true, but provably impossible since space ≥ width Want to see if space is “more fine-grained” hardness indicator

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 18 / 19

slide-58
SLIDE 58

Experiments Results

Caveats and Issues

Preprocessing dampens correlations To be expected — space of proof not captured during preprocessing By construction formulas amenable to preprocessing Artificial benchmarks True, but the only formulas where we know how to control space In general, computing space complexity probably PSPACE-complete Theory vs. practice In theory all substitution functions equal — not so in practice In theory graph pebbling space all that matters — but many source vertices make binary tree formulas “too easy” Varying width and space independently would be more convincing Very true, but provably impossible since space ≥ width Want to see if space is “more fine-grained” hardness indicator

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 18 / 19

slide-59
SLIDE 59

Experiments Results

Caveats and Issues

Preprocessing dampens correlations To be expected — space of proof not captured during preprocessing By construction formulas amenable to preprocessing Artificial benchmarks True, but the only formulas where we know how to control space In general, computing space complexity probably PSPACE-complete Theory vs. practice In theory all substitution functions equal — not so in practice In theory graph pebbling space all that matters — but many source vertices make binary tree formulas “too easy” Varying width and space independently would be more convincing Very true, but provably impossible since space ≥ width Want to see if space is “more fine-grained” hardness indicator

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 18 / 19

slide-60
SLIDE 60

Summing up

Modern CDCL SAT solvers amazingly successful in practice But poorly understood which formulas are easy or hard We propose space complexity as a measure of hardness in practice Don’t claim conclusive evidence, but nontrivial correlations Would like to get similar results also with preprocessing Would like to study if theoretical time-space trade-offs show up in practice Believe there are more connections between proof complexity and SAT solving worth exploring

Thank you for your attention!

Jakob Nordstr¨

  • m (KTH)

Proof Complexity and Practical Hardness of SAT CP ’12 19 / 19