Applications of Rule Mining in Knowledge Bases
Luis Galárraga
November 3rd, 2014 PIKM, Shanghai
1
Applications of Rule Mining in Knowledge Bases Luis Galrraga - - PowerPoint PPT Presentation
Applications of Rule Mining in Knowledge Bases Luis Galrraga November 3 rd , 2014 PIKM, Shanghai 1 Knowledge Bases (KBs) Barack Obama hasChild born On hasChild Malia Aug 4, 1961 hasChild marriedTo hasChild Michelle Sasha 2 KBs in
Luis Galárraga
November 3rd, 2014 PIKM, Shanghai
1
2 hasChild marriedTo born On Aug 4, 1961
Sasha Barack Obama Michelle
hasChild
Malia
hasChild hasChild
3
4
5
6 hasChild marriedTo born On Aug 4, 1961
Sasha Barack Obama Michelle
hasChild
Malia
hasChild hasChild
7 hasChild born On Aug 4, 1961
Sasha
hasChild
Malia
hasChild
marriedTo hasChild
8 hasChild marriedTo born On Aug 4, 1961
Sasha Barack Obama Michelle
hasChild
Malia
hasChild hasChild
9 born On Aug 4, 1961
Malia
marriedTo
hasChild hasChild hasChild hasChild
10 born On Aug 4, 1961
Malia
marriedTo
hasChild hasChild hasChild hasChild
11
Elvis Presley Priscilla Lisa Marie
hasChild marriedTo
12
Elvis Presley Priscilla Lisa Marie
hasChild hasChild?
marriedTo
13
Elvis Presley Priscilla Lisa Marie
hasChild hasChild isMarriedTo
14
Elvis Presley Priscilla Lisa Marie
hasChild hasChild isMarriedTo
15
Sasha Michelle Malia
hasChild hasChild
16
Sasha Michelle Malia
hasChild hasChild hasChild
17
hasChild marriedTo
Sasha Michelle
hasChild
Malia
hasChild hasChild
18
hasChild marriedTo
Sasha Michelle
hasChild
Malia
hasChild hasChild
19
hasChild marriedTo
Prince Charles Camilla
hasChild hasChild
Prince William Tom Laura
20 hasChild hasChild marriedTo
Prince Charles Camilla
hasChild
hasChild
Tom Laura Prince William
21 hasChild hasChild marriedTo
Prince Charles Camilla
hasChild
Tom Laura Prince William
22
Elvis Presley Priscilla Lisa Marie
hasChild marriedTo
23
Elvis Presley Priscilla Lisa Marie
hasChild hasChild marriedTo
24
Elvis Presley Priscilla Lisa Marie
hasChild hasChild marriedTo
Standard confidence counts it as a miss
25
Elvis Presley Priscilla Lisa Marie
hasChild hasChild marriedTo
– Minimum support threshold – Mining operators – Monotonicity of support for pruning – Optimized in-memory database – Confidence gain is used to prune the output.
26
Luis Galárraga, Christina Teflioudi, Katja Hose, Fabian Suchanek. AMIE: Association Rule Mining Under Incomplete Evidence in Ontological Knowledge
z x hasChild
27
z x hasChild
28
z x hasChild
z x hasChild ?r marriedTo influences …. y
29
z x hasChild
z x hasChild ?r marriedTo influences …. y z x hasChild marriedTo y
30
z x hasChild
z x hasChild ?r marriedTo influences …. y z x hasChild marriedTo y
z x hasChild y marriedTo ?r hasChild supervises …
31
z x hasChild
z x hasChild ?r marriedTo influences …. y z x hasChild marriedTo y
z x hasChild y marriedTo ?r hasChild supervises … hasChild z x hasChild y marriedTo
32
z x hasChild
z x hasChild ?r marriedTo influences …. y z x hasChild marriedTo y
z x hasChild y marriedTo ?r hasChild supervises … hasChild z x hasChild y marriedTo
33
Minimum support threshold RDF KB
1 1 Concurrent mining implementation Tailored In-memory DB
34
Minimum support threshold RDF KB
1 1 Concurrent mining implementation Tailored In-memory DB
35
PCA Confidence used to rank rules
isMarriedTo(x, y) livesIn(x, z) => livesIn(y, z) ∧ isCitizenOf(x, y) => livesIn(x, y) hasAdvisor(x, y) graduatedFrom(x, z) => worksAt(y, z) ∧ hasWonPrize(x, Gottfried Wilhelm Leibniz Prize) => livesIn(x, Germany)
38
Sasha Barack Obama
hasChild
Malia
hasChild
Sasha President Obama
parent
Malia
parent
sibling 39
Sasha Barack Obama
hasChild
Malia
hasChild parent
Malia
parent sameAs sameAs sameAs
sibling 40
Sasha President Obama
Sasha Barack Obama
hasChild
Malia
hasChild parent parent sameAs sameAs sameAs
sibling 41
Sasha Malia President Obama
Sasha Barack Obama
hasChild
Malia
hasChild parent parent sameAs sameAs sameAs
sibling 42
Sasha Malia President Obama
Sasha Barack Obama
hasChild
Malia
hasChild parent parent
43
hasChild <=> parent-1 hasChild(y, x) hasChild(y, z) => sibling(x, z)
sibling
r(x, y) => r'(x, y) R-subsumption r(x, y) <=> r'(x, y) R-equivalence type(x, C) => type(x, C') C-subsumption r1(x, y), r2(y, z) => r'(x, z) 2-hops translation r(x, z) r(y, z) => r'(x, y) Triangle alignment r1(x, y), r2(x, V) => r'(x, y) Specific R-subsumption r(x, V) => r'(x, V') Attribute-Value translation r1(x, V1), r2(x, V2) => r'(x, V') 2-values translation Luis Galárraga, Nicoleta Preda, Fabian Suchanek. Mining Rules to Align Knowledge Bases. In Automated Knowledge Base Construction Workshop (AKBC), 2013.
44
Barack Obama
is a graduate of
Harvard Law School Columbia University
earned degree from earned degree from 45
Barack Obama
is a graduate of
Harvard Law School Columbia University
earned degree from 46 earned degree from
Barack Obama
is a graduate of
Harvard Law School Columbia University
earned degree from
47 earned degree from
is a graduate of <=> earned degree from
Barack Obama
is a graduate of
Harvard Law School Columbia University
earned degree from
Luis Galárraga, Geremy Heitz, Kevin Murphy, Fabian Suchanek. Canonicalizing Open Knowledge Bases. In CIKM, 2014
48 earned degree from
– Multiple rules can predict a fact – Integrate soft and hard constraints
49