LOD2 Stack and the NLP2RDF project http://slideshare.net/kurzum - - PowerPoint PPT Presentation

lod2 stack and the nlp2rdf project
SMART_READER_LITE
LIVE PREVIEW

LOD2 Stack and the NLP2RDF project http://slideshare.net/kurzum - - PowerPoint PPT Presentation

Creating Knowledge out of Interlinked Data Rome 2013/03/13 Page 1 http://lod2.eu LOD2 Stack and the NLP2RDF project http://slideshare.net/kurzum http://nlp2rdf.org http://lod2.eu Sebastian Hellmann AKSW, Universitt Leipzig


slide-1
SLIDE 1

Rome – 2013/03/13 – Page 1 http://lod2.eu

Creating Knowledge out of Interlinked Data

LOD2 Presentation . 02.09.2010 . Page http://lod2.eu

AKSW, Universität Leipzig

Sebastian Hellmann

LOD2 Stack and the NLP2RDF project

http://nlp2rdf.org http://lod2.eu http://slideshare.net/kurzum

slide-2
SLIDE 2

Rome – 2013/03/13 – Page 2 http://lod2.eu

Topics:

  • Introduction to LOD
  • LOD2 Stack http://stack.lod2.eu/
  • NLP Interchange Format (NIF), Version 2.0

Overview

slide-3
SLIDE 3

Rome – 2013/03/13 – Page 3 http://lod2.eu

http://lod-cloud.net

slide-4
SLIDE 4

Rome – 2013/03/13 – Page 4 http://lod2.eu

http://lod-cloud.net

Linked Open Data

  • All datasets provide open access to individual records via http
  • Many are free (no payment required, as in free beer)
  • Some are openly licenced, e.g. CC-0 or CC-By-Sa
slide-5
SLIDE 5

Rome – 2013/03/13 – Page 5 http://lod2.eu

http://stack.lod2.eu/ LOD2 Gang

slide-6
SLIDE 6

Rome – 2013/03/13 – Page 6 http://lod2.eu

http://stack.lod2.eu/

slide-7
SLIDE 7

Rome – 2013/03/13 – Page 7 http://lod2.eu

http://stack.lod2.eu/

slide-8
SLIDE 8

Rome – 2013/03/13 – Page 8 http://lod2.eu

http://stack.lod2.eu/

#download the repository package wget http://stack.lod2.eu/lod2repository_current_all.deb #install the repository package sudo dpkg -i lod2repository_current_all.deb #update the repository database sudo apt-get update #lod2demo is a meta root package that installs all LOD2 components sudo apt-get lod2demo

slide-9
SLIDE 9

Rome – 2013/03/13 – Page 9 http://lod2.eu

http://stack.lod2.eu/

slide-10
SLIDE 10

Rome – 2013/03/13 – Page 10 http://lod2.eu

http://stack.lod2.eu/

slide-11
SLIDE 11

Rome – 2013/03/13 – Page 11 http://lod2.eu

http://stack.lod2.eu/

slide-12
SLIDE 12

Rome – 2013/03/13 – Page 12 http://lod2.eu

http://stack.lod2.eu/

slide-13
SLIDE 13

Rome – 2013/03/13 – Page 13 http://lod2.eu

http://stack.lod2.eu/

slide-14
SLIDE 14

Rome – 2013/03/13 – Page 14 http://lod2.eu

http://stack.lod2.eu/

slide-15
SLIDE 15

Rome – 2013/03/13 – Page 15 http://lod2.eu

http://stack.lod2.eu/

slide-16
SLIDE 16

Rome – 2013/03/13 – Page 16 http://lod2.eu

http://stack.lod2.eu/

http://geoknow.eu

slide-17
SLIDE 17

Rome – 2013/03/13 – Page 17 http://lod2.eu

http://stack.lod2.eu/

slide-18
SLIDE 18

Rome – 2013/03/13 – Page 18 http://lod2.eu

http://stack.lod2.eu/

http://demo.lod2.eu/lod2demo

slide-19
SLIDE 19

Rome – 2013/03/13 – Page 19 http://lod2.eu

http://stack.lod2.eu/

slide-20
SLIDE 20

Rome – 2013/03/13 – Page 20 http://lod2.eu

Knowledge Extraction Article in Wikipedia

slide-21
SLIDE 21

Rome – 2013/03/13 – Page 21 http://lod2.eu

LOD2 EU Project produces LOD2 Stack. Three requirements to unlock Natural Language Processing (NLP) for the project:

  • 1. NLP tool output is required to be in RDF
  • 2. Scalability (less triples, focus on usefulness)
  • 3. Common vocabulary to integrate and use NLP tools

Knowledge Extraction from Text

slide-22
SLIDE 22

Rome – 2013/03/13 – Page 22 http://lod2.eu

LOD2 EU Project produces LOD2 Stack. Three requirements to unlock Natural Language Processing (NLP) for the project:

  • 1. NLP tool output is required to be in RDF
  • 2. Scalability (less triples, focus on usefulness)
  • 3. Common vocabulary to integrate and use NLP tools

The NLP Interchange Format (NIF) is an RDF/OWL-based format that aims to achieve interoperability between Natural Language Processing (NLP) tools, language resources and annotations.

  • Version 1.0 published in November 2011
  • Version 2.0 is scheduled for completion within 2013

NLP Interchange Format 2.0

slide-23
SLIDE 23

Rome – 2013/03/13 – Page 23 http://lod2.eu

NIF Architecture

slide-24
SLIDE 24

Rome – 2013/03/13 – Page 24 http://lod2.eu

Linguistic LOD Cloud

slide-25
SLIDE 25

Rome – 2013/03/13 – Page 25 http://lod2.eu

Adressing Primary Data

slide-26
SLIDE 26

Rome – 2013/03/13 – Page 26 http://lod2.eu

Adressing Primary Data

NIF 1.0: http://www.w3.org/DesignIssues/LinkedData.html#offset_717_729 NIF 2.0 uses RFC 5147: http://www.w3.org/DesignIssues/LinkedData.html#char=717,729 User extensions possible: http://www.w3.org/DesignIssues/LinkedData.html#your_own_scheme (but you have to link to documentation on how it was created)

slide-27
SLIDE 27

Rome – 2013/03/13 – Page 27 http://lod2.eu

As a Web Service

curl

  • -data-urlencode prefix="http://prefix.given.by/theClient#"
  • -data-urlencode input="[...]"

(--data-urlencode source=”http://www.w3.org/DesignIssues/LinkedData.html” http://nlp2rdf.lod2.eu/demo/NIFStanfordCore

slide-28
SLIDE 28

Rome – 2013/03/13 – Page 28 http://lod2.eu

  • Tibeto-Burman languages: http://purl.org/olia/tibet.owl#VNst
  • Russian TreeTagger :

http://purl.org/olia/russ.owl#partizip_prt_sg_neut_passiv_gen_langform

  • German STTS: http://purl.org/olia/stts.owl#VAPP
  • English Penn: http://purl.org/olia/penn.owl#VBG

→ all map to http://purl.org/olia/olia.owl#NonFiniteVerb Ontologies of Lingingustic Annotation (OLiA) contain mappings for over 50 Tagsets (free and open, CC-By)

Vocabulary Module: OLiA

slide-29
SLIDE 29

Rome – 2013/03/13 – Page 29 http://lod2.eu

  • NIF 2.0 tries to be compatible to (Vocabulary Module):
  • ITS 2.0
  • FISE used in Apache Stanbol (IKS-EU Project)
  • LAF/GrAF XML – ISO standard, recently published
  • Fragment Identifiers by IETF and W3C
  • Lemon ontology from Monnet EU Project
  • NERD ontology from EURECOM and LinkedTV EU Project
  • Xpointer/XPath URI scheme
  • Open Annotation

NIF 2.0 - plans

slide-30
SLIDE 30

Rome – 2013/03/13 – Page 30 http://lod2.eu

NIF 2.0 :

  • NIF is free and open (CC-0 or CC-BY)
  • All ontologies will be hosted for persistently by University

Leipzig

  • Sign up on the mailinglist at http://nlp2rdf.org
  • Provide Use Cases, Requirements, Implementations at:
  • http://wiki.nlp2rdf.org/wiki/Use_cases#Use_cases
  • http://wiki.nlp2rdf.org/wiki/Requirements#Requirements

How you can contribute:

slide-31
SLIDE 31

Rome – 2013/03/13 – Page 31 http://lod2.eu

LOD 2 Stack

  • Currently project half-time
  • Most of the tools are free and open source
  • Commercial rollout planned
  • Many webinars available
  • You can integrate your tool via Debian package

http://lod2.eu http://stack.lod2.eu/

How you can contribute:

slide-32
SLIDE 32

Rome – 2013/03/13 – Page 32 http://lod2.eu

Open Community – All feedback is welcome! http://slideshare.net/kurzum Websites: http://nlp2rdf.org http://lod2.eu

Thanks for your attention