IDLAB, IMEC RESEARCH GROUP AT GHENT UNIVERSITY AND ANTWERP UNIVERSITY - CONFIDENTIAL
Session 3: Conditional Constraints for KG Embeddings Michael Weyns, - - PowerPoint PPT Presentation
Session 3: Conditional Constraints for KG Embeddings Michael Weyns, - - PowerPoint PPT Presentation
Session 3: Conditional Constraints for KG Embeddings Michael Weyns, Pieter Bonte, Bram Steenwinckel, Filip De Turck, and Femke Ongenae IDLAB, IMEC RESEARCH GROUP AT GHENT UNIVERSITY AND ANTWERP UNIVERSITY - CONFIDENTIAL Context KG completion
SLIDE 1
SLIDE 2
CONFIDENTIAL
KG completion → link prediction True and false facts required → negative sampling
Context
SLIDE 3
CONFIDENTIAL
- Exploiting schema to improve negative sampling
- Context-free constraints (RDFS domain and range axioms)
- Closed-world constraints
Objectives:
- Conditional constraints (OWL restrictions)
- Open-world constraints
SOTA & objectives
SLIDE 4
CONFIDENTIAL
Incomplete knowledge
Open World Assumption (OWA)
Fred France USA Country Person is a is a is a born in? born in?
SLIDE 5
CONFIDENTIAL
Monotonicity
Open World Assumption (OWA)
Fred France USA Country Person is a is a is a born in? born in
SLIDE 6
CONFIDENTIAL
Inconsistency - restriction on Fred (Person is born in some Country)
Open World Assumption (OWA) - limits
Fred France USA Country Person is a is a is a
SLIDE 7
CONFIDENTIAL
Inconsistency - restriction on Fred (Person is born in max 1 Country)
Open World Assumption (OWA) - limits
Fred France USA Country Person is a is a is a born in born in
SLIDE 8
CONFIDENTIAL
Negative property assertions e.g. NegativeObjectPropertyAssertion(:born_in :Fred :USA)
Open World Assumption (OWA) - limits
Fred France USA Country Person is a is a is a not born in born in?
SLIDE 9
CONFIDENTIAL
Negative sampling - SOTA
SLIDE 10
CONFIDENTIAL
CWA <Fred, born_in, USA> <Fred, born_in, France> <Fred, born_in, Belgium> <Fred, born_in, England> <Lucy, born_in, Scotland> ...
Negative sampling - SOTA
SLIDE 11
CONFIDENTIAL
Perturbation (+ filtering) <Fred, born_in, USA> <Lucy, born_in, USA> <Fred, born_in, France>
Negative sampling - SOTA
SLIDE 12
CONFIDENTIAL
Bernoulli trick per relationship r: ℎ𝑞𝑢 = 𝑏𝑤 # ℎ𝑓𝑏𝑒 𝑓𝑜𝑢𝑗𝑢𝑗𝑓𝑡 𝑢𝑏𝑗𝑚 𝑓𝑜𝑢𝑗𝑢𝑧 𝑢𝑞ℎ = 𝑏𝑤 # 𝑢𝑏𝑗𝑚 𝑓𝑜𝑢𝑗𝑢𝑗𝑓𝑡 ℎ𝑓𝑏𝑒 𝑓𝑜𝑢𝑗𝑢𝑧 perturb head with 𝑞𝑠𝑝𝑐 =
𝑢𝑞ℎ (𝑢𝑞ℎ + ℎ𝑞𝑢)
perturb tail with 𝑞𝑠𝑝𝑐 =
ℎ𝑞𝑢 (ℎ𝑞𝑢 + 𝑢𝑞ℎ)
Negative sampling - SOTA
SLIDE 13
CONFIDENTIAL
RDFS axioms
SLIDE 14
CONFIDENTIAL
OWL axioms
SLIDE 15
CONFIDENTIAL
Context-free constraints - SOTA
SLIDE 16
CONFIDENTIAL
Conditional constraints
SLIDE 17
CONFIDENTIAL
- 1. Type inference based on axioms
- 2. Impose restrictive interpretation
- 3. Constraint-based negative sampling
≡ Axiomatic consistency checking during perturbation
Constraint-based negative sampling
SLIDE 18
CONFIDENTIAL
(𝑓𝑗, 𝑠𝑙, 𝑓𝑘) 𝑗𝑡 𝒘𝒃𝒎𝒋𝒆
Constraints - CWA interpretation (SOTA)
SLIDE 19
CONFIDENTIAL
Constraints - OWA interpretation
(𝑓𝑗, 𝑠𝑙, 𝑓𝑘) 𝑗𝑡 𝒋𝒐𝒘𝒃𝒎𝒋𝒆
SLIDE 20
CONFIDENTIAL
TransE embedding technique AIFB: research staff, research groups, affiliations, publications MUTAG: potentially carcinogenic molecules
Evaluation - datasets
train 19916 entities valid 2213 entities test 2459 entities OWL constraints 152 train 41999 entities valid 4667 entities test 5185 entities RDFS constraints 5087
SLIDE 21
CONFIDENTIAL
Evaluation - results
SLIDE 22
CONFIDENTIAL
Evaluation - results
SLIDE 23
CONFIDENTIAL
Conclusions
- AIFB (conditional constraints)
- OWA interpretation
- No improvements
- Decrease in false negatives
- CWA interpretation
- Few false negatives: clear improvements
- Many false negatives: fewer improvements
- Best setting: no constraints, with high neg ratio
- Few conditional constraints:
- Many false negatives (CWA)
- High computational complexity (OWA)
SLIDE 24
CONFIDENTIAL
Conclusions
- MUTAG (context-free constraints)
- OWA interpretation
- Clear improvements
- Decrease in false negatives
- CWA interpretation
- Clear improvements
- No increase in false negatives
- Best setting: CWA constraints, with high neg ratio
- Sufficient conditional constraints:
- Consistent number of false negatives (CWA)
- Consistent computational complexity (OWA)
SLIDE 25
CONFIDENTIAL
Future work
- Context-free ↔ conditional constraints (same dataset comparison)
- Rejection hyperparameter
- Effects on other embedding strategies
SLIDE 26
CONFIDENTIAL
Thank you very much for listening. Any questions?
Discussion
SLIDE 27
CONFIDENTIAL