Session 3: Conditional Constraints for KG Embeddings Michael Weyns, - - PowerPoint PPT Presentation

session 3 conditional constraints for kg embeddings
SMART_READER_LITE
LIVE PREVIEW

Session 3: Conditional Constraints for KG Embeddings Michael Weyns, - - PowerPoint PPT Presentation

Session 3: Conditional Constraints for KG Embeddings Michael Weyns, Pieter Bonte, Bram Steenwinckel, Filip De Turck, and Femke Ongenae IDLAB, IMEC RESEARCH GROUP AT GHENT UNIVERSITY AND ANTWERP UNIVERSITY - CONFIDENTIAL Context KG completion


slide-1
SLIDE 1

IDLAB, IMEC RESEARCH GROUP AT GHENT UNIVERSITY AND ANTWERP UNIVERSITY - CONFIDENTIAL

Session 3: Conditional Constraints for KG Embeddings

Michael Weyns, Pieter Bonte, Bram Steenwinckel, Filip De Turck, and Femke Ongenae

slide-2
SLIDE 2

CONFIDENTIAL

KG completion → link prediction True and false facts required → negative sampling

Context

slide-3
SLIDE 3

CONFIDENTIAL

  • Exploiting schema to improve negative sampling
  • Context-free constraints (RDFS domain and range axioms)
  • Closed-world constraints

Objectives:

  • Conditional constraints (OWL restrictions)
  • Open-world constraints

SOTA & objectives

slide-4
SLIDE 4

CONFIDENTIAL

Incomplete knowledge

Open World Assumption (OWA)

Fred France USA Country Person is a is a is a born in? born in?

slide-5
SLIDE 5

CONFIDENTIAL

Monotonicity

Open World Assumption (OWA)

Fred France USA Country Person is a is a is a born in? born in

slide-6
SLIDE 6

CONFIDENTIAL

Inconsistency - restriction on Fred (Person is born in some Country)

Open World Assumption (OWA) - limits

Fred France USA Country Person is a is a is a

slide-7
SLIDE 7

CONFIDENTIAL

Inconsistency - restriction on Fred (Person is born in max 1 Country)

Open World Assumption (OWA) - limits

Fred France USA Country Person is a is a is a born in born in

slide-8
SLIDE 8

CONFIDENTIAL

Negative property assertions e.g. NegativeObjectPropertyAssertion(:born_in :Fred :USA)

Open World Assumption (OWA) - limits

Fred France USA Country Person is a is a is a not born in born in?

slide-9
SLIDE 9

CONFIDENTIAL

Negative sampling - SOTA

slide-10
SLIDE 10

CONFIDENTIAL

CWA <Fred, born_in, USA> <Fred, born_in, France> <Fred, born_in, Belgium> <Fred, born_in, England> <Lucy, born_in, Scotland> ...

Negative sampling - SOTA

slide-11
SLIDE 11

CONFIDENTIAL

Perturbation (+ filtering) <Fred, born_in, USA> <Lucy, born_in, USA> <Fred, born_in, France>

Negative sampling - SOTA

slide-12
SLIDE 12

CONFIDENTIAL

Bernoulli trick per relationship r: ℎ𝑞𝑢 = 𝑏𝑤𝑕 # ℎ𝑓𝑏𝑒 𝑓𝑜𝑢𝑗𝑢𝑗𝑓𝑡 𝑢𝑏𝑗𝑚 𝑓𝑜𝑢𝑗𝑢𝑧 𝑢𝑞ℎ = 𝑏𝑤𝑕 # 𝑢𝑏𝑗𝑚 𝑓𝑜𝑢𝑗𝑢𝑗𝑓𝑡 ℎ𝑓𝑏𝑒 𝑓𝑜𝑢𝑗𝑢𝑧 perturb head with 𝑞𝑠𝑝𝑐 =

𝑢𝑞ℎ (𝑢𝑞ℎ + ℎ𝑞𝑢)

perturb tail with 𝑞𝑠𝑝𝑐 =

ℎ𝑞𝑢 (ℎ𝑞𝑢 + 𝑢𝑞ℎ)

Negative sampling - SOTA

slide-13
SLIDE 13

CONFIDENTIAL

RDFS axioms

slide-14
SLIDE 14

CONFIDENTIAL

OWL axioms

slide-15
SLIDE 15

CONFIDENTIAL

Context-free constraints - SOTA

slide-16
SLIDE 16

CONFIDENTIAL

Conditional constraints

slide-17
SLIDE 17

CONFIDENTIAL

  • 1. Type inference based on axioms
  • 2. Impose restrictive interpretation
  • 3. Constraint-based negative sampling

≡ Axiomatic consistency checking during perturbation

Constraint-based negative sampling

slide-18
SLIDE 18

CONFIDENTIAL

(𝑓𝑗, 𝑠𝑙, 𝑓𝑘) 𝑗𝑡 𝒘𝒃𝒎𝒋𝒆

Constraints - CWA interpretation (SOTA)

slide-19
SLIDE 19

CONFIDENTIAL

Constraints - OWA interpretation

(𝑓𝑗, 𝑠𝑙, 𝑓𝑘) 𝑗𝑡 𝒋𝒐𝒘𝒃𝒎𝒋𝒆

slide-20
SLIDE 20

CONFIDENTIAL

TransE embedding technique AIFB: research staff, research groups, affiliations, publications MUTAG: potentially carcinogenic molecules

Evaluation - datasets

train 19916 entities valid 2213 entities test 2459 entities OWL constraints 152 train 41999 entities valid 4667 entities test 5185 entities RDFS constraints 5087

slide-21
SLIDE 21

CONFIDENTIAL

Evaluation - results

slide-22
SLIDE 22

CONFIDENTIAL

Evaluation - results

slide-23
SLIDE 23

CONFIDENTIAL

Conclusions

  • AIFB (conditional constraints)
  • OWA interpretation
  • No improvements
  • Decrease in false negatives
  • CWA interpretation
  • Few false negatives: clear improvements
  • Many false negatives: fewer improvements
  • Best setting: no constraints, with high neg ratio
  • Few conditional constraints:
  • Many false negatives (CWA)
  • High computational complexity (OWA)
slide-24
SLIDE 24

CONFIDENTIAL

Conclusions

  • MUTAG (context-free constraints)
  • OWA interpretation
  • Clear improvements
  • Decrease in false negatives
  • CWA interpretation
  • Clear improvements
  • No increase in false negatives
  • Best setting: CWA constraints, with high neg ratio
  • Sufficient conditional constraints:
  • Consistent number of false negatives (CWA)
  • Consistent computational complexity (OWA)
slide-25
SLIDE 25

CONFIDENTIAL

Future work

  • Context-free ↔ conditional constraints (same dataset comparison)
  • Rejection hyperparameter
  • Effects on other embedding strategies
slide-26
SLIDE 26

CONFIDENTIAL

Thank you very much for listening. Any questions?

Discussion

slide-27
SLIDE 27

CONFIDENTIAL