The NoRDF Project Fabian Suchanek Amazing! This talk is free of - - PowerPoint PPT Presentation

the nordf project
SMART_READER_LITE
LIVE PREVIEW

The NoRDF Project Fabian Suchanek Amazing! This talk is free of - - PowerPoint PPT Presentation

The NoRDF Project Fabian Suchanek Amazing! This talk is free of the Corona virus! (about the speaker, we dont know...) Knowledge Bases P erson subclassOf Singer type born 1935 For us, a knowledge base (KB) is a graph, where the


slide-1
SLIDE 1

The NoRDF Project

Fabian Suchanek Amazing! This talk is free

  • f the Corona virus!

(about the speaker, we don’t know...)

slide-2
SLIDE 2

For us, a knowledge base (KB) is a graph, where the nodes are en ti ties and the edges are relations. 2

Knowledge Bases

type born 1935 Singer P erson subclassOf

(We do not distinguish T

  • Box and A-Box.)
slide-3
SLIDE 3

Cool knowledge‐based applications

Apple Siri 3

When was Elvis born? “1935”

IBM Watson Discovered 6 kineasis proteins that relate to cancer How long was the Thirty Y ears’ War? Amazon Echo These applications feed from knowledge bases.

slide-4
SLIDE 4

There are plen ty of knowledge bases

NELL TextRunner

Plus industrial projects at Sponsored message: New version of YAGO at h ttp://yago-knowledge.org .

slide-5
SLIDE 5

What’s in a knowledge base?

From YAGO

Essen tially binary facts (“triples”) in the knowledge format “RDF”: 5

slide-6
SLIDE 6

What’s in the real world?

In February 1998, Andrew Wake field published a paper in the medical journal The Lance t, which reported on twelve children wi th developmen tal disorders. The paren ts were said to have linked the start

  • f behavioral symptoms to vaccination. The resul

ting con troversy became the biggest science story of 2002. As a resul t, vaccination rates dropped sharply. In 2011, the BMJ de tailed how Wake field had faked some of the data behind the 1998 Lance t article. Belie fs Claims Even ts Reasons Stories Falsifications ...none of which is in a knowledge base! 6

slide-7
SLIDE 7

The NoRDF Project: Go Beyond Triples

If we wan t tomorrow’s in telligen t applications to be really in telligen t, we have to extend their knowledge bases by 7 1) We have to be able to extract complex knowledge from text (“IE”) 2) We have to be able to represen t such knowledge and to reason on i t Belie fs Claims Even ts Reasons Stories Falsifications

slide-8
SLIDE 8

8 Several cool approaches can extract non‐binary in formation:

  • FRED
  • K-P

arser

  • Documen

t spanners

  • ClausIE

Andrew Wake field published in The Lance t in 1998. Publication_even t author venue time

IE: What is possible already

  • StuffIE
  • OpenIE
  • HighLife
  • Classical slot fillers
slide-9
SLIDE 9

IE: What we need

9 “Wake field published a paper that reported on children. Their paren ts were said to have linked the start of behavioral symptoms to vaccination. The resul ting con troversy caused vaccination rates to fall. ...” Publication RateChange Wake field paper Claim symptoms children vaccination Link paren ts vaccinationRate

  • caused
  • f

direction author pub. con ten t about

  • f

by

  • f
  • f
  • f
slide-10
SLIDE 10

IE: What we need

10 Publication RateChange Wake field paper Claim symptoms children vaccination Link paren ts vaccinationRate

  • caused
  • f

direction author pub. con ten t about

  • f

by

  • f
  • f
  • f

Y

  • u know a system

that can do (part of) i t? Please le t me know! T ype here: _ _ _ _ _ _ _ _ _ _ _ _ Cross‐sen tence analysis, advanced co‐re ference resolution, standardized types of frames, relationships be tween even ts, negation, hypothe tical stances, storylines, ...

slide-11
SLIDE 11

Reasoning: What we have

11 RateChange vaccinationRate

  • f

direction

As knowledge represen tation:

  • Frames, JSON
  • complex objects
  • object-relational databases

Publication Wake field paper

caused author pub.

slide-12
SLIDE 12

Reasoning: What we have

12 As knowledge represen tation:

  • Frames, JSON
  • complex objects
  • object-relational databases
  • Fact iden

tifiers

  • RDF*
  • Reification

For reasoning:

  • RDFS, OWL DL, SHACL
  • Description Logic
  • Con

text logics

  • Modal logics
  • Epistemic logics
  • Formal argumen

tation

  • Belie

f revision

  • Provenance and annotated logics

Cannot represen t

  • “All clien

ts believe that the company delivers a good service”

  • “the loss of value on the stock marke

t happened because the public learned of a fraudulen t activi ty by the company”

  • “Mary believes everything P

aul says, P aul says Mary believes ” ... or if they can, they are undecidable

slide-13
SLIDE 13

Reasoning: What we need

13 1) a very simple logic inside a con text 2) a very simple logic about con texts => a moderately simple logic in combination First‐order logic wi thout ? OWL EL? Datalog? Horn Rules? Datalog? Y

  • u have a great idea? Le

t me know! (?) (?) V agueness, fuzziness, and probabili ty: orthogonal topics

slide-14
SLIDE 14

Applications

14

  • Analysis of fake news / fact checking:

understand an article about a con troversial topic, allow reasoning (who said what when and why, what is the evidence, ...)

  • Analysis of the e-reputation of a company:

extract con troversy or belie fs wi th reasons and supporters, for companies or their products

  • Modeling of con

troversies: de tect a con troversial topic on the Web (in blogs, forums, T wi tter), extract opinions, and model differen t views

>more

Understanding the argumen ts of the other side is a prerequisi te for re futing them.

slide-15
SLIDE 15

Applications

15

  • Flagging of poten

tially fraudulen t activi ty: De tect claims that con tradict knowledge, or violate rules.

  • Modeling of processes:

Model sequences of actions, causal relationships, and suggestions.

  • Smarter chatbots:

Allow dialogues that go beyond single-shot questions.

  • Legal text understanding:

Analyze a law, a regulation, or a con tract, and derive what is permi tted and what is obligatory for which party.

slide-16
SLIDE 16

Our project “NoRDF”

16 Our project “NoRDF” aims to extract and model complex in formation from natural language text. We are supported by the French National Research Agency, T élécom P aris, and 4 sponsors:

slide-17
SLIDE 17

Our project “NoRDF”: Who’s there?

17 Fabian Suchanek Professor at T élécom P aris Knowledge Bases, Reasoning, NLP Chloé Clavel Professor at T élécom P aris Affective Computing, Sen timen t Analysis We hired

  • Pierre-Henri P

aris (CNAM) as a postdoc

  • Chadi Helwe (American Univ. of Beirut) as PhD studen

t

  • Sanaz Hasanzadeh (AUT T

ehran) as PhD studen t

slide-18
SLIDE 18

And we are still hiring!

18 We are hiring PhD studen ts, postdocs, and engineers, for the project

  • r anything that has to do wi

th NLP , knowledge bases, and reasoning! Join our team! h ttps:// suchanek.name -> NoRDF

slide-19
SLIDE 19

Backup Slides

slide-20
SLIDE 20

Reasoning: What we have

20 RateChange vaccinationRate

  • f

direction

As knowledge represen tation:

  • Frames, JSON
  • complex objects
  • object-relational databases

Publication Wake field paper

caused author pub.

slide-21
SLIDE 21

Reasoning: What we have

21 RateChange vaccinationRate

  • f

direction

As knowledge represen tation:

  • Frames, JSON
  • complex objects
  • object-relational databases

Publication Wake field paper

caused author pub.

great, but do not allow for reasoning

  • “If X caused Y and Y caused Z, then X caused Z”
  • “If X did not publish a paper, X is not a scien

tist”

  • “If Mary believes what P

aul says & P aul says X, then Mary believes X”

slide-22
SLIDE 22

Reasoning: What we have

22 RateChange vaccinationRate

  • f

direction

For reasoning:

  • RDFS, OWL DL, SHACL
  • Description Logic

Publication Wake field paper

caused author pub.

slide-23
SLIDE 23

Reasoning: What we have

23 RateChange vaccinationRate

  • f

direction

For reasoning:

  • RDFS, OWL DL, SHACL
  • Description Logic

Publication Wake field paper

caused author pub.

great, but do not allow for statemen ts about statemen ts

  • “The paper says that vaccines cause autism”
  • “Fact A caused Fact B”
slide-24
SLIDE 24

Reasoning: What we have

24 RateChange vaccinationRate

  • direction

Annotated Knowledge Represen tations:

  • Fact iden

tifiers

  • RDF*
  • Reification
  • f

Publication Wake field paper

caused author pub.

slide-25
SLIDE 25

Reasoning: What we have

25 RateChange vaccinationRate

  • direction

Annotated Knowledge Represen tations:

  • Fact iden

tifiers

  • RDF*
  • Reification
  • f

Publication Wake field paper

caused author pub.

cannot deal wi th hypothe tical statemen ts cannot do reasoning

  • “Mary believes that vaccines cause autism”
slide-26
SLIDE 26

Reasoning: What we have

26 Big logic machinery:

  • Con

text logics

  • Modal logics
  • Epistemic logics
slide-27
SLIDE 27

Reasoning: What we have

27 Big logic machinery:

  • Con

text logics

  • Modal logics
  • Epistemic logics
  • “All clien

ts believe that the company delivers a good service”

  • “the loss of value on the stock marke

t happened because the public learned of a fraudulen t activi ty by the company”

(or if they can, they are proposi tional logics or undecidable)

cannot quan tify over con texts Formal argumen tation has monoli thic proposi tions. Belie f revision has monoli thic agen ts. Provenance and annotated logics cannot make claims about annotations. V agueness, fuzziness, and probabili ty are orthogonal topics.