The Complexity and Generality of Learning Answer Set Programs (AIJ - - PowerPoint PPT Presentation

the complexity and generality of learning answer set
SMART_READER_LITE
LIVE PREVIEW

The Complexity and Generality of Learning Answer Set Programs (AIJ - - PowerPoint PPT Presentation

The Complexity and Generality of Learning Answer Set Programs (AIJ 2018) Mark Law, Alessandra Russo and Krysia Broda September 2, 2018 1/8 Mark Law, Alessandra Russo and Krysia Broda The Complexity and Generality of Learning Answer Set


slide-1
SLIDE 1

The Complexity and Generality of Learning Answer Set Programs (AIJ 2018)

Mark Law, Alessandra Russo and Krysia Broda

September 2, 2018

1/8 Mark Law, Alessandra Russo and Krysia Broda The Complexity and Generality of Learning Answer Set Programs (AIJ 2018)

slide-2
SLIDE 2

ILP under the Answer Set Semantics

◮ Several ILP frameworks have been proposed to learn ASP:

◮ In ILPb (resp ILPc) at least one (resp every) answer set of B ∪ H

must cover the (atom) examples.

◮ In ILPLAS examples are partial interpretations and a combination of

ILPb and ILPc can be expressed.

2/8 Mark Law, Alessandra Russo and Krysia Broda The Complexity and Generality of Learning Answer Set Programs (AIJ 2018)

slide-3
SLIDE 3

ILP under the Answer Set Semantics

◮ Several ILP frameworks have been proposed to learn ASP:

◮ In ILPb (resp ILPc) at least one (resp every) answer set of B ∪ H

must cover the (atom) examples.

◮ In ILPLAS examples are partial interpretations and a combination of

ILPb and ILPc can be expressed.

◮ This paper asks two fundamental questions:

◮ What class of ASP programs can each framework learn? ◮ Is there any (complexity) price paid by the more general frameworks? 2/8 Mark Law, Alessandra Russo and Krysia Broda The Complexity and Generality of Learning Answer Set Programs (AIJ 2018)

slide-4
SLIDE 4

ILP under the Answer Set Semantics

◮ Several ILP frameworks have been proposed to learn ASP:

◮ In ILPb (resp ILPc) at least one (resp every) answer set of B ∪ H

must cover the (atom) examples.

◮ In ILPLAS examples are partial interpretations and a combination of

ILPb and ILPc can be expressed.

◮ This paper asks two fundamental questions:

◮ What class of ASP programs can each framework learn? ◮ Is there any (complexity) price paid by the more general frameworks?

◮ In the paper we also consider ILPsm, ILPLOAS and ILPcontext

LOAS .

2/8 Mark Law, Alessandra Russo and Krysia Broda The Complexity and Generality of Learning Answer Set Programs (AIJ 2018)

slide-5
SLIDE 5

One-to-one Distinguishability

Definition 1

A learning framework F can distinguish H1 from H2 wrt B iff there is at least one task TF = B, EF such that H1 ∈ F(TF) and H2 ∈ F(TF).

3/8 Mark Law, Alessandra Russo and Krysia Broda The Complexity and Generality of Learning Answer Set Programs (AIJ 2018)

slide-6
SLIDE 6

One-to-one Distinguishability

Definition 1

A learning framework F can distinguish H1 from H2 wrt B iff there is at least one task TF = B, EF such that H1 ∈ F(TF) and H2 ∈ F(TF).

◮ D1

1(F) is the set of tuples B, H1, H2 such that F can distinguish

H1 from H2 wrt B.

3/8 Mark Law, Alessandra Russo and Krysia Broda The Complexity and Generality of Learning Answer Set Programs (AIJ 2018)

slide-7
SLIDE 7

One-to-one Distinguishability

Definition 1

A learning framework F can distinguish H1 from H2 wrt B iff there is at least one task TF = B, EF such that H1 ∈ F(TF) and H2 ∈ F(TF).

◮ D1

1(F) is the set of tuples B, H1, H2 such that F can distinguish

H1 from H2 wrt B.

Let B = ∅, H1 = {p.} and H2 = 0{p}1.

  • .

◮ B, H1, H2 is not in D1

1(ILPb). 3/8 Mark Law, Alessandra Russo and Krysia Broda The Complexity and Generality of Learning Answer Set Programs (AIJ 2018)

slide-8
SLIDE 8

One-to-one Distinguishability

Definition 1

A learning framework F can distinguish H1 from H2 wrt B iff there is at least one task TF = B, EF such that H1 ∈ F(TF) and H2 ∈ F(TF).

◮ D1

1(F) is the set of tuples B, H1, H2 such that F can distinguish

H1 from H2 wrt B.

Let B = ∅, H1 = {p.} and H2 = 0{p}1.

  • .

◮ B, H1, H2 is not in D1

1(ILPb).

E + = {p} E − = ∅ H2 ∈ ILPb(B, {p}, ∅).

3/8 Mark Law, Alessandra Russo and Krysia Broda The Complexity and Generality of Learning Answer Set Programs (AIJ 2018)

slide-9
SLIDE 9

One-to-one Distinguishability

Definition 1

A learning framework F can distinguish H1 from H2 wrt B iff there is at least one task TF = B, EF such that H1 ∈ F(TF) and H2 ∈ F(TF).

◮ D1

1(F) is the set of tuples B, H1, H2 such that F can distinguish

H1 from H2 wrt B.

Let B = ∅, H1 = {p.} and H2 = 0{p}1.

  • .

◮ B, H1, H2 is not in D1

1(ILPb).

◮ B, H2, H1 is in D1

1(ILPb). 3/8 Mark Law, Alessandra Russo and Krysia Broda The Complexity and Generality of Learning Answer Set Programs (AIJ 2018)

slide-10
SLIDE 10

One-to-one Distinguishability

Definition 1

A learning framework F can distinguish H1 from H2 wrt B iff there is at least one task TF = B, EF such that H1 ∈ F(TF) and H2 ∈ F(TF).

◮ D1

1(F) is the set of tuples B, H1, H2 such that F can distinguish

H1 from H2 wrt B.

Let B = ∅, H1 = {p.} and H2 = 0{p}1.

  • .

◮ B, H1, H2 is not in D1

1(ILPb).

◮ B, H2, H1 is in D1

1(ILPb).

E + = ∅ E − = {p} H2 ∈ ILPb(B, ∅, {p}) but H1 ∈ ILPb(B, ∅, {p}).

3/8 Mark Law, Alessandra Russo and Krysia Broda The Complexity and Generality of Learning Answer Set Programs (AIJ 2018)

slide-11
SLIDE 11

One-to-one Distinguishability

Definition 1

A learning framework F can distinguish H1 from H2 wrt B iff there is at least one task TF = B, EF such that H1 ∈ F(TF) and H2 ∈ F(TF).

◮ D1

1(F) is the set of tuples B, H1, H2 such that F can distinguish

H1 from H2 wrt B.

Let B = ∅, H1 = {p.} and H2 = 0{p}1.

  • .

◮ B, H1, H2 is not in D1

1(ILPb).

◮ B, H2, H1 is in D1

1(ILPb).

◮ B, H1, H2 is in D1

1(ILPc). 3/8 Mark Law, Alessandra Russo and Krysia Broda The Complexity and Generality of Learning Answer Set Programs (AIJ 2018)

slide-12
SLIDE 12

One-to-one Distinguishability

Definition 1

A learning framework F can distinguish H1 from H2 wrt B iff there is at least one task TF = B, EF such that H1 ∈ F(TF) and H2 ∈ F(TF).

◮ D1

1(F) is the set of tuples B, H1, H2 such that F can distinguish

H1 from H2 wrt B.

Let B = ∅, H1 = {p.} and H2 = 0{p}1.

  • .

◮ B, H1, H2 is not in D1

1(ILPb).

◮ B, H2, H1 is in D1

1(ILPb).

◮ B, H1, H2 is in D1

1(ILPc).

E + = {p} E − = ∅ H1 ∈ ILPc(B, ∅, {p}) but H2 ∈ ILPc(B, ∅, {p}).

3/8 Mark Law, Alessandra Russo and Krysia Broda The Complexity and Generality of Learning Answer Set Programs (AIJ 2018)

slide-13
SLIDE 13

One-to-one Distinguishability Conditions

Framework F Sufficient/necessary condition for B, H1, H2 to be in D1

1(F)

ILPb AS(B ∪ H1) ⊆ AS(B ∪ H2) ILPsm AS(B ∪ H1) ⊆ AS(B ∪ H2) ILPc AS(B ∪ H1) = ∅ ∧ (AS(B ∪ H2) = ∅ ∨ (Ec(B ∪ H1) ⊆ Ec(B ∪ H2))) ILPLAS AS(B ∪ H1) = AS(B ∪ H2) ILPLOAS (AS(B ∪ H1) = AS(B ∪ H2)) ∨ (ord(B ∪ H1) = ord(B ∪ H2)) ILPcontext

LOAS

(B ∪ H1 ≡s B ∪ H2)∨ (∃C ∈ ASPch st ord(B ∪ H1 ∪ C) = ord(B ∪ H2 ∪ C))

4/8 Mark Law, Alessandra Russo and Krysia Broda The Complexity and Generality of Learning Answer Set Programs (AIJ 2018)

slide-14
SLIDE 14

One-to-one Distinguishability Conditions

Framework F Sufficient/necessary condition for B, H1, H2 to be in D1

1(F)

ILPb AS(B ∪ H1) ⊆ AS(B ∪ H2) ILPsm AS(B ∪ H1) ⊆ AS(B ∪ H2) ILPc AS(B ∪ H1) = ∅ ∧ (AS(B ∪ H2) = ∅ ∨ (Ec(B ∪ H1) ⊆ Ec(B ∪ H2))) ILPLAS AS(B ∪ H1) = AS(B ∪ H2) ILPLOAS (AS(B ∪ H1) = AS(B ∪ H2)) ∨ (ord(B ∪ H1) = ord(B ∪ H2)) ILPcontext

LOAS

(B ∪ H1 ≡s B ∪ H2)∨ (∃C ∈ ASPch st ord(B ∪ H1 ∪ C) = ord(B ∪ H2 ∪ C))

◮ Neither ILPb of ILPsm can distinguish H ∪ C from H for any constraint C

and any H – in practice, neither ILPb nor ILPsm can learn constraints.

4/8 Mark Law, Alessandra Russo and Krysia Broda The Complexity and Generality of Learning Answer Set Programs (AIJ 2018)

slide-15
SLIDE 15

One-to-one Distinguishability Conditions

Framework F Sufficient/necessary condition for B, H1, H2 to be in D1

1(F)

ILPb AS(B ∪ H1) ⊆ AS(B ∪ H2) ILPsm AS(B ∪ H1) ⊆ AS(B ∪ H2) ILPc AS(B ∪ H1) = ∅ ∧ (AS(B ∪ H2) = ∅ ∨ (Ec(B ∪ H1) ⊆ Ec(B ∪ H2))) ILPLAS AS(B ∪ H1) = AS(B ∪ H2) ILPLOAS (AS(B ∪ H1) = AS(B ∪ H2)) ∨ (ord(B ∪ H1) = ord(B ∪ H2)) ILPcontext

LOAS

(B ∪ H1 ≡s B ∪ H2)∨ (∃C ∈ ASPch st ord(B ∪ H1 ∪ C) = ord(B ∪ H2 ∪ C))

◮ ILPLAS can distinguish any two hypotheses, so long as they have different

answer sets (when combined with B).

4/8 Mark Law, Alessandra Russo and Krysia Broda The Complexity and Generality of Learning Answer Set Programs (AIJ 2018)

slide-16
SLIDE 16

One-to-one Distinguishability Conditions

Framework F Sufficient/necessary condition for B, H1, H2 to be in D1

1(F)

ILPb AS(B ∪ H1) ⊆ AS(B ∪ H2) ILPsm AS(B ∪ H1) ⊆ AS(B ∪ H2) ILPc AS(B ∪ H1) = ∅ ∧ (AS(B ∪ H2) = ∅ ∨ (Ec(B ∪ H1) ⊆ Ec(B ∪ H2))) ILPLAS AS(B ∪ H1) = AS(B ∪ H2) ILPLOAS (AS(B ∪ H1) = AS(B ∪ H2)) ∨ (ord(B ∪ H1) = ord(B ∪ H2)) ILPcontext

LOAS

(B ∪ H1 ≡s B ∪ H2)∨ (∃C ∈ ASPch st ord(B ∪ H1 ∪ C) = ord(B ∪ H2 ∪ C))

◮ ILPcontext

LOAS

can distinguish any two hypotheses, so long as they are not strongly equivalent (when combined with B).

4/8 Mark Law, Alessandra Russo and Krysia Broda The Complexity and Generality of Learning Answer Set Programs (AIJ 2018)

slide-17
SLIDE 17

One-to-one Distinguishability Conditions

Framework F Sufficient/necessary condition for B, H1, H2 to be in D1

1(F)

ILPb AS(B ∪ H1) ⊆ AS(B ∪ H2) ILPsm AS(B ∪ H1) ⊆ AS(B ∪ H2) ILPc AS(B ∪ H1) = ∅ ∧ (AS(B ∪ H2) = ∅ ∨ (Ec(B ∪ H1) ⊆ Ec(B ∪ H2))) ILPLAS AS(B ∪ H1) = AS(B ∪ H2) ILPLOAS (AS(B ∪ H1) = AS(B ∪ H2)) ∨ (ord(B ∪ H1) = ord(B ∪ H2)) ILPcontext

LOAS

(B ∪ H1 ≡s B ∪ H2)∨ (∃C ∈ ASPch st ord(B ∪ H1 ∪ C) = ord(B ∪ H2 ∪ C))

D1

1(ILPb) = D1 1(ILPsm) ⊂ D1 1(ILPLAS) ⊂ D1 1(ILPLOAS) ⊂ D1 1(ILPcontext LOAS )

D1

1(ILPc) ⊂ D1 1(ILPLAS) 4/8 Mark Law, Alessandra Russo and Krysia Broda The Complexity and Generality of Learning Answer Set Programs (AIJ 2018)

slide-18
SLIDE 18

One-to-many Distinguishability

Definition 2

For a framework F, D1

m(F) is the set of tuples B, H, {H1, . . . , Hn} st

there is a task TF which distinguishes H from each Hi with respect to B.

5/8 Mark Law, Alessandra Russo and Krysia Broda The Complexity and Generality of Learning Answer Set Programs (AIJ 2018)

slide-19
SLIDE 19

One-to-many Distinguishability

Definition 2

For a framework F, D1

m(F) is the set of tuples B, H, {H1, . . . , Hn} st

there is a task TF which distinguishes H from each Hi with respect to B. Let B = ∅, H = {1{heads, tails}1.}, H′

1 = {heads.}, H′ 2 = {tails.}

◮ B, H, H′

1 ∈ D1 1(ILPb) and B, H, H′ 2 ∈ D1 1(ILPb)

5/8 Mark Law, Alessandra Russo and Krysia Broda The Complexity and Generality of Learning Answer Set Programs (AIJ 2018)

slide-20
SLIDE 20

One-to-many Distinguishability

Definition 2

For a framework F, D1

m(F) is the set of tuples B, H, {H1, . . . , Hn} st

there is a task TF which distinguishes H from each Hi with respect to B. Let B = ∅, H = {1{heads, tails}1.}, H′

1 = {heads.}, H′ 2 = {tails.}

◮ B, H, H′

1 ∈ D1 1(ILPb) and B, H, H′ 2 ∈ D1 1(ILPb)

◮ B, H, {H′

1, H′ 2} ∈ D1 m(ILPb)

5/8 Mark Law, Alessandra Russo and Krysia Broda The Complexity and Generality of Learning Answer Set Programs (AIJ 2018)

slide-21
SLIDE 21

One-to-many Distinguishability

Definition 2

For a framework F, D1

m(F) is the set of tuples B, H, {H1, . . . , Hn} st

there is a task TF which distinguishes H from each Hi with respect to B. Let B = ∅, H = {1{heads, tails}1.}, H′

1 = {heads.}, H′ 2 = {tails.}

◮ B, H, H′

1 ∈ D1 1(ILPb) and B, H, H′ 2 ∈ D1 1(ILPb)

◮ B, H, {H′

1, H′ 2} ∈ D1 m(ILPb)

◮ B, H, {H′

1, H′ 2} ∈ D1 m(ILPsm)

5/8 Mark Law, Alessandra Russo and Krysia Broda The Complexity and Generality of Learning Answer Set Programs (AIJ 2018)

slide-22
SLIDE 22

One-to-many Distinguishability

Definition 2

For a framework F, D1

m(F) is the set of tuples B, H, {H1, . . . , Hn} st

there is a task TF which distinguishes H from each Hi with respect to B.

D1

m(ILPb) ⊂ D1 m(ILPsm) ⊂ D1 m(ILPLAS) ⊂ D1 m(ILPLOAS) ⊂ D1 m(ILPcontext LOAS )

D1

m(ILPc) ⊂ D1 m(ILPLAS) 5/8 Mark Law, Alessandra Russo and Krysia Broda The Complexity and Generality of Learning Answer Set Programs (AIJ 2018)

slide-23
SLIDE 23

Many-to-many Distinguishability

Definition 3

For a framework F, Dm

m(F) is the set of tuples B, S1, S2, st there is a

task TF with background B, st S1 ⊆ ILPF(TF) and S2 ∩ ILPF(TF) = ∅.

6/8 Mark Law, Alessandra Russo and Krysia Broda The Complexity and Generality of Learning Answer Set Programs (AIJ 2018)

slide-24
SLIDE 24

Many-to-many Distinguishability

Definition 3

For a framework F, Dm

m(F) is the set of tuples B, S1, S2, st there is a

task TF with background B, st S1 ⊆ ILPF(TF) and S2 ∩ ILPF(TF) = ∅.

Dm

m(ILPb) ⊂ Dm m(ILPsm) ⊂ Dm m(ILPLAS) ⊂ Dm m(ILPLOAS) ⊂ Dm m(ILPcontext LOAS )

Dm

m(ILPc) ⊂ Dm m(ILPLAS) 6/8 Mark Law, Alessandra Russo and Krysia Broda The Complexity and Generality of Learning Answer Set Programs (AIJ 2018)

slide-25
SLIDE 25

Complexity

Framework Verification Satisfiablity ILPb NP-complete NP-complete ILPsm NP-complete NP-complete ILPc DP-complete ΣP

2 -complete

ILPLAS DP-complete ΣP

2 -complete

ILPLOAS DP-complete ΣP

2 -complete

ILPcontext

LOAS

DP-complete ΣP

2 -complete 7/8 Mark Law, Alessandra Russo and Krysia Broda The Complexity and Generality of Learning Answer Set Programs (AIJ 2018)

slide-26
SLIDE 26

Complexity

Framework Verification Satisfiablity ILPb NP-complete NP-complete ILPsm NP-complete NP-complete ILPc DP-complete ΣP

2 -complete

ILPLAS DP-complete ΣP

2 -complete

ILPLOAS DP-complete ΣP

2 -complete

ILPcontext

LOAS

DP-complete ΣP

2 -complete

ILPnoise

LOAS

DP-complete ΣP

2 -complete 7/8 Mark Law, Alessandra Russo and Krysia Broda The Complexity and Generality of Learning Answer Set Programs (AIJ 2018)

slide-27
SLIDE 27

Conclusion

◮ We have introduced three new measures of the generality of a

learning framework.

◮ For each of the three measures:

D(ILPb) ⊆ D(ILPsm) ⊂ D(ILPLAS) ⊂ D(ILPLOAS) ⊂ D(ILPcontext

LOAS )

D(ILPc) ⊂ D(ILPLAS)

◮ There is no price to be paid (in terms of complexity) for the

gain in generality of ILPcontext

LOAS

  • ver ILPc.

◮ ILPb and ILPsm are of lower complexity, but are less general

than ILPLAS.

8/8 Mark Law, Alessandra Russo and Krysia Broda The Complexity and Generality of Learning Answer Set Programs (AIJ 2018)

slide-28
SLIDE 28

Backup Slides

9/8 Mark Law, Alessandra Russo and Krysia Broda The Complexity and Generality of Learning Answer Set Programs (AIJ 2018)

slide-29
SLIDE 29

One-to-many Distinguishability

◮ In the paper, we proved that if for any two F tasks T1, T2 there is a task

T3 such that ILPF(T3) = ILPF(T1) ∩ ILPF(T2) then: D1

m(F) =

  B, H, {H1, . . . , Hn}

  • B, H, H1 ∈ D1

1(F),

. . . , B, H, Hn ∈ D1

1(F)

  .

◮ In ILPLAS, T3 can be constructed as B, E +

1 ∪ E + 2 , E − 1 ∪ E − 2 .

◮ This property holds for every framework (in the paper) other than ILPb.

D1

m(ILPb) ⊂ D1 m(ILPsm) ⊂ D1 m(ILPLAS) ⊂ D1 m(ILPLOAS) ⊂ D1 m(ILPcontext LOAS )

D1

m(ILPc) ⊂ D1 m(ILPLAS) 10/8 Mark Law, Alessandra Russo and Krysia Broda The Complexity and Generality of Learning Answer Set Programs (AIJ 2018)

slide-30
SLIDE 30

Brave Induction cannot learn constraints

◮ Let H be a hypothesis and C be a constraint. ◮ For any T = B, E +, E − st H ∪ C ∈ ILPb(T), there is an

A ∈ AS(B ∪ H ∪ C) st E + ⊆ A and E − ∩ A = ∅. Any such A is also an answer set of B ∪ H.

◮ Hence ILPb cannot distinguish H ∪ C from H (wrt any

background knowledge).

◮ In practice this means that ILPb cannot learn constraints.

11/8 Mark Law, Alessandra Russo and Krysia Broda The Complexity and Generality of Learning Answer Set Programs (AIJ 2018)

slide-31
SLIDE 31

Other notion of generality

◮ (De Raedt 1997) defined generality in terms of reductions. F1 is said to

be more general than F2 iff F2 →r F1 and F1 →r F2.

◮ These reductions allowed the background knowledge B to be modified in

the reduction, whereas distinguishability does not.

◮ In the paper we define strong reductions which force the background

knowledge to be the same and show that F1 →sr F2 if and only if Dm

m(F1) ⊆ Dm m(F2).

◮ Other than the restriction on the background knowledge,

distinguishability also allows for fine grained comparisons of frameworks which are incomparable under reductions and strong reductions.

12/8 Mark Law, Alessandra Russo and Krysia Broda The Complexity and Generality of Learning Answer Set Programs (AIJ 2018)

slide-32
SLIDE 32

De Raedt, L. 1997. Logical settings for concept-learning. Artificial Intelligence 95, 1, 187–201. 8/8 Mark Law, Alessandra Russo and Krysia Broda The Complexity and Generality of Learning Answer Set Programs (AIJ 2018)