building a high performance environment for rdf publishing
play

Building a High Performance Environment for RDF Publishing Pascal - PowerPoint PPT Presentation

Building a High Performance Environment for RDF Publishing Pascal Christoph These slides and all the graphics made by the author and those taken from https://openclipart.org/ are dedicated to the public domain :


  1. Building a High Performance Environment for RDF Publishing Pascal Christoph

  2. These slides and all the graphics made by the author and those taken from https://openclipart.org/ are dedicated to the public domain : https://creativecommons.org/about/cc0 . All marks mentioned may be trademarks or registered trademarks of their respective owners. Read about the license of „The scream“ of Edward Munch at https://en.wikipedia.org/wiki/File:The_Scream.jpg Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/

  3. w e i v r e v O Publishing is for Consuming • Mandatory • Nice to have Story so far - experiences with lobid.org • What is lobid.org ? • Storing the data • Getting the data Publishing RDF through elasticsearch • Benefits • Some more details • Caveats Future prospects 3 Building a High Performance Environment for RDF Publishing

  4. w e i v r e v O Publishing is for Consuming • Mandatory • Nice to have Story so far - experiences with lobid.org • What is lobid.org ? • Storing the data • Getting the data Publishing RDF through elasticsearch • Benefits • Some more details • Caveats Future prospects 4 Building a High Performance Environment for RDF Publishing

  5. g n i m u s n o C r o f s i g n h i s i b l u P Publishing is for Consuming 5 Building a High Performance Environment for RDF Publishing

  6. g n i m u s n o C r o f s i g n h i s i b l u P Mandatory A resource: 6 Building a High Performance Environment for RDF Publishing

  7. g n i m u s n o C r o f s i g n h i s i b l u P Mandatory A resource: gets a dereferenceable URI: 7 Building a High Performance Environment for RDF Publishing

  8. g n i m u s n o C r o f s i g n h i s i b l u P Mandatory A resource: gets a dereferenceable URI: which provides RDF: <http://lobid.org/resource/HT002948556> <http://purl.org/dc/terms/title> "With reference to reference" . <http://lobid.org/resource/HT002948556> <http://purl.org/dc/terms/issued> "1983" . <http://lobid.org/resource/HT002948556> <http://purl.org/ontology/bibo/isbn13> "9780915145539" . <http://lobid.org/resource/HT002948556> <http://purl.org/dc/elements/1.1/creator><http://d-nb.info/gnd/135539897> . 8 Building a High Performance Environment for RDF Publishing

  9. g n i m u s n o C r o f s i g n h i s i b l u P Mandatory => basic LOD publishing is very simple: you just need a Webserver 9 Building a High Performance Environment for RDF Publishing

  10. g n i m u s n o C r o f s i g n h i s i b l u P Nice to have • Dumps • Content Negotiation (different RDF serializations) • SPARQL • Human readable representation (best: RDFa in HTML) • Data searchable • Timely updates • High Availability • Versioning • Web developers want simple APIs providing JSON • ... 10 Building a High Performance Environment for RDF Publishing

  11. g n i m u s n o C r o f s i g n h i s l i b u P SPARQL Endpoint • (Dumps) • Content Negotiation (different RDF serializations) • SPARQL • Human readable representation (best: RDFa in HTML) • Data searchable • Timely updates • High Availability • Versioning • Web developers want simple APIs providing JSON • ... 11 Building a High Performance Environment for RDF Publishing

  12. g n i m u s n o C r o f s i g n h i s l i b u P SPARQL Endpoint • (Dumps): but may be painfully slow when having lots of data • Content Negotiation (different RDF serializations) • SPARQL • Human readable representation (best: RDFa in HTML) • (Data searchable) : maybe painfully slow • Timely updates • High Availability • Versioning • Web developers want simple APIs providing JSON • most triple stores provides JSON/RDF • Simple powerful API : too powerful/complex ? • ... 12 Building a High Performance Environment for RDF Publishing

  13. g n i m u s n o C r o f s i g n h i s i b l u P Nice to have In principle, web developers already got simple APIs : LOD is the API ! 13 Building a High Performance Environment for RDF Publishing

  14. g n i m u s n o C r o f s i g n h i s i b l u P Nice to have In principle, web developers already got simple APIs : Remember: 14 Building a High Performance Environment for RDF Publishing

  15. g n i m u s n o C r o f s i g n h i s i b l u P Mandatory A resource: gets a dereferenceable URI: which provides the data (in RDF): <http://lobid.org/resource/HT002948556> <http://purl.org/dc/terms/title> "With reference to reference" . <http://lobid.org/resource/HT002948556> <http://purl.org/dc/terms/issued> "1983" . <http://lobid.org/resource/HT002948556> <http://purl.org/ontology/bibo/isbn13> "9780915145539" . <http://lobid.org/resource/HT002948556> <http://purl.org/dc/elements/1.1/creator><http://d-nb.info/gnd/135539897> . 15 Building a High Performance Environment for RDF Publishing

  16. g n i m u s n o C r o f s i g n h i s i b l u P Nice to have In principle, web developers already got powerful APIs : RESTful SPARQL 16 Building a High Performance Environment for RDF Publishing

  17. g n i m u s n o C r o f s i g n h i s i b l u P RESTful SPARQL example getting all data of all resources having a particular ISBN: curl -H "Accept: application/json" --data-urlencode 'query= prefix bibo: <http://purl.org/ontology/bibo/> SELECT * WHERE { ?s bibo:isbn13 "9780851706238" ; ?p ?o . } LIMIT 100 ' http://lobid.org/sparql/ 17 Building a High Performance Environment for RDF Publishing

  18. g n i m u s n o C r o f s i g n h i s i b l u P Nice to have Building a High Performance Environment for RDF Publishing 18

  19. g n i m u s n o C r o f s i g n h i s i b l u P RESTful SPARQL example … and the JSON/RDF result: { "head": { "vars": [ "s", "p","o"] }, "results": { "bindings": [ { "o": { "type": "uri", "value": "http://openlibrary.org/works/OL2109573W" }, "p": { "type": "uri", "value": "http://rdvocab.info/RDARelationshipsWEMI/workManifested" }, "s": { "type": "uri", "value": "http://lobid.org/resource/HT007824357" } }, { "o": { ... 19 Building a High Performance Environment for RDF Publishing

  20. g n i m u s n o C r o f s i g n h i s i b l u P Nice to have As it is, web developers don't like SPARQL web developer Building a High Performance Environment for RDF Publishing 20

  21. g n i m u s n o C r o f s i g n h i s i b l u P Nice to have Web developers want APIs like: http://lobid.org/resources/api/isbn/$isbn 21 Building a High Performance Environment for RDF Publishing

  22. Happy web developer

  23. w e i v r e v O Publishing is for Consuming • Mandatory • Nice to have Story so far - experiences with lobid.org • What is lobid.org ? • Storing the data • Getting the data Publishing RDF through elasticsearch • Benefits • Some more details • Caveats Future prospects 23 Building a High Performance Environment for RDF Publishing

  24. ? g r o . d i b o l s i t a h W lobid.org 24 Building a High Performance Environment for RDF Publishing

  25. ? g r o . d i b o l s i t a h W ● lobid := l inking o pen b ibliographic d ata ● LOD services of the hbz ● lobid-resources : ● exposes 85% of the hbz cooperative catalogue ● entries coming from > 200 scientific German libraries ● ~ 16 M records with 700 M triples ● with links to ~ 5 M other resources ● with links to ~ 32 M items (consisting of 300 M triples) ● lobid-organisations : ● exposes German Sigelverzeichnis and MARC-Isil directory ● ~ 40 k descriptions of institutions 25 Building a High Performance Environment for RDF Publishing

  26. ? g r o . d i b o l s i t a h W What's missing? • Dumps • Content Negotiation (different RDF serializations) • SPARQL • Human readable representation ( RDFa in HTML) • Data searchable • Timely updates • High Availability • Versioning • Web developers want simple APIs providing JSON • ... 26 Building a High Performance Environment for RDF Publishing

  27. w e i v r e v O Publishing is for Consuming • Mandatory • Nice to have Story so far - experiences with lobid.org • What is lobid.org ? • Storing the data • Getting the data Publishing RDF through elasticsearch • Benefits • Some more details • Caveats Future prospects 27 Building a High Performance Environment for RDF Publishing

  28. a t a d e h t g n i r o t s 2010 - 2011, lobid-organisation Filesystem : + easy to maintain + reliable + fast - no search - no SPARQL - ... 28 Building a High Performance Environment for RDF Publishing

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend