building rdf and schema based peer to peer systems
play

Building RDF- and Schema-Based Peer-to-Peer Systems / University - PDF document

Building RDF- and Schema-Based Peer-to-Peer Systems / University of Hannover Wolfgang Nej dl Germany L3S Overview Relevant L3S Proj ect Background Motivation Proj ect Background - PADLR, Edutella, et al S chema-Based Peer-to-Peer


  1. Building RDF- and Schema-Based Peer-to-Peer Systems / University of Hannover Wolfgang Nej dl Germany L3S

  2. Overview Relevant L3S Proj ect Background � Motivation � Proj ect Background - PADLR, Edutella, et al S chema-Based Peer-to-Peer Networks � Characteristics and Building Blocks � Resource Description Framework (RDF) and RDF S chema � Edutella Query S ervice / RDF Query Exchange Language RDF-QEL � Semantic Web Inferencing � S ubscriptions � Efficient Routing / HyperCuP & S uper-Peers � Integration of new Peers / Clustering � Distributed Query Processing in P2P Networks � Mediation S ummary and Conclusions Wolfgang Nej dl 14/ 10/ 03 2

  3. Motivation Distributed Peer-to-Peer Infrastructures for the S emantic Web S emantic Web Metadata How can we use distributed S tandards for describing (learning) resources in a (learning) resources and users personalized way? Personalized Environments in the Adaptive Web Wolfgang Nej dl 14/ 10/ 03 3

  4. PADLR and Edutella Personalized Access to Distributed Learning Repositories (www.learninglab.de/ english/ proj ects/ padlr.html) Most important (CS) modules - Peer-to-Peer Infrastructure (incl. Clients and Providers) - Courseware Watchdog and Metadata Extraction - Personalization and Personalized Queries based on Metadata - Web-(Learning-) Services PADLR Participants (Stanford, Hannover, Karlsruhe, Stockholm, Uppsala) + Edutella Participants (Vienna, Berlin, Darmstadt, etc.) Wolfgang Nej dl 14/ 10/ 03 4

  5. Edutella: Goal and Approach S pecify and implement a RDF-based meta-data infrastructure for P2P networks Developed as part of the open source peer-to-peer proj ect JXTA edutella.j xta.org > 60 contributors from various institutions Wolfgang Nej dl 14/ 10/ 03 5

  6. Knowledge Web 6 14/ 10/ 03 Heterogeneity Wolfgang Nej dl Languages Dynamics Scalabity EU/FP6 NoE KnowledgeWeb Semantic Web Services

  7. EU/FP6 NoE PROLEARN Working towards � innovative elearning resources � interoperable elearning resources and systems � sustainable elearning infrastructures and processes for S MEs Wolfgang Nej dl 14/ 10/ 03 7

  8. EU/FP6 NoE REWERSE Reasoning on the Web with Rules and S emantics Develop reasoning languages for advanced Web systems Test these languages on adaptive Web systems and Web-based decision support systems Bring these languages to the level of pre-standards S elected applications for proof-of-concept purposes Personalized Web systems Web-based decision support Towards a Bioinformatics S emantic Web Wolfgang Nej dl 14/ 10/ 03 8

  9. Overview Relevant L3S Proj ect Background � Motivation � Proj ect Background - PADLR, Edutella, et al S chema-Based Peer-to-Peer Networks � Characteristics and Building Blocks � Resource Description Framework (RDF) and RDF S chema � Edutella Query S ervice / RDF Query Exchange Language RDF-QEL � Semantic Web Inferencing � S ubscriptions � Efficient Routing / HyperCuP & S uper-Peers � Integration of new Peers / Clustering � Distributed Query Processing in P2P Networks � Mediation S ummary and Conclusions Wolfgang Nej dl 14/ 10/ 03 9

  10. Schema-Based Peer-to-Peer Networks User-definable schemas Database Systems Schema-based P2P Systems S tructured schemas A MOS II A NY RDBMS C HATTY W EB schema- O BJECT G LOBE Query language C ONCEPT B ASE E DUTELLA based TSIMMIS O NTO B ROKER P IAZZA T UKWILA D IRECT C ONNECT fixed G NUTELLA Decentralized control schema/ N APSTER K A Z A A keywords Node autonomy P - GRID P2P Systems Transient peers CAN S elf organization key CHORD local distributed peer-to-peer (system list not complete) Wolfgang Nej dl 14/ 10/ 03 10

  11. Building Blocks Flexible S chema Language � to describe complex and heterogeneous resources in the P2P network Expressive Query Language � to retrieve data from heterogeneous data stores Efficient Network Topology � to allow appropriate routing algorithms Mediation Facilities � to integrate and combine (possibly heterogeneous) information Wolfgang Nej dl 14/ 10/ 03 11

  12. RDF / RDF Schema for Describing Distributed Resources Basic Formalisms for the S emantic Web � URIs to identify resources � Combine resources and annotate resources with attributes, using <S ubj ect, Property, Value> Tuples � Graph as basic model, easy to translate to logic facts � RDFS allows us to define the RDF vocabulary used (classes and attributes), and thus to represent simple semantic models � Possible extensions towards more expressive semantic descriptions, e.g. description logic (DAML+OIL / OWL) Using RDF / RDFS in the P2P context � Distributed annotations for distributed resources � Flexible schema definitions, which can be uniquely identified and combined, as well as extended by additional properties Wolfgang Nej dl 14/ 10/ 03 12

  13. Characterization of Peers using RDFS S chema level � S upporting specific schemas: dc, lom, dcq Property level � S upporting specific properties: dc:subj ect, lom:type, dc:format Property value range � S upported ranges for specific properties, e.g. ccs:dbms for dc:subj ect Property values � S pecific attribute values, e.g. „ exercise“ for lom:type, „ en“ for dc:language Wolfgang Nej dl 14/ 10/ 03 13

  14. RDF-QEL: RDF Query (Exchange) Language Datalog-based Query Exchange Language (RDF-QEL) � RDF QEL1: conj unctive query up to � RDF QEL5: RDF QEL4 (SQL3) + general recursion Edutella consumer see Nejdl et al: „EDUTELLA: A P2P Networking RDF query RDF QEL 1-5 Infrastructure Based on RDF“, WWW 2002 result Datalog-based ECDM � Datalog is used as the internal data model (ECDM: Edutella Common Data Model) and provided as a set of Java classes Local query � RDF is used to represent the queries transmitted between the peers repository � Wrappers for ot her RDF query languages (RQL, Edutella Provider TRIPLE, etc.) and XML query languages (like Xpath) Edutella query data flow Wolfgang Nej dl 14/ 10/ 03 14

  15. From Querying to Reasoning World Wide Web Data as Distributed (Web) content + S emantic Web Metadata Distributed and interoperable (RDF) P2P metadata descriptions about: � Content � Relationships between the content � Learner + S emantic Web Inferencing (Logic) Programs and Rules to: � Adapt the content and Content relationships (links) � Infer new metadata Content Metadata = Declarative and Composable Web Relationships S ervices Learner Model Logic Programs siehe auch REWERS E NoE Wolfgang Nej dl 14/ 10/ 03 15

  16. P2P and Semantic Web Inferencing: Edutella as basic infrastructure for ELENA (EU/FP5) Wolfgang Nej dl 14/ 10/ 03 16

  17. Another Possibility: Don‘t query, subscribe S ubscriptions are a good idea, too (get the NYTimes each morning, get new teaching material on P2P topologies … ) Example: S elective Information Dissemination in P2P-DIET Instead of Queries and Answers we need � Profile forwarding � Notification forwarding / Filtering � Advertisement forwarding � Dynamicity of P2P network � storing notifications / rendezvous S ee e.g. Koubarakis et al: S elective Information Dissemination in P2P Networks: Problems and S olutions, S IGMOD Record, S pecial P2P Issue, S eptember 2003 Wolfgang Nej dl 14/ 10/ 03 17

  18. P2P and Efficient Routing How do peer-to-peer networks scale? Requirements: � S ymmetric topology (every node is a root) � Low network diameter (small worlds property, should be O(log n)) � Limited node degrees (number of peer-connections from a node, should be O(log n)) � Load balancing of traffic � Efficient broadcast (receive broadcast messages only once) � Adaptable to dynamic number of peers Wolfgang Nej dl 14/ 10/ 03 18

  19. HyperCuP Peer-to-Peer Topology Details: see e.g. S chlosser, S intek, Decker, Nej dl: „ HyperCuP – S haping Up Peer-to-Peer Networks“ , 2nd Intl. WS on Agents and P2P Computing, 2002 Wolfgang Nej dl 14/ 10/ 03 19

  20. Hypercube Topology Broadcast Algorithm � Annotate messages with the “ dimension” of the peer-to-peer connection, and only forward it along “ higher” dimensions Properties � Network diameter, characteristic path length and number of nodes are O(log b N) � Fault tolerant, vertex-symmetric 6 7 0 2 2 1 1 3 2 0 Step 2 4 5 0 1 1 Step 3 2 2 8 1 0 Step 1 Wolfgang Nej dl 14/ 10/ 03 20

  21. Super-Peer Networks Observation: Peers vary significantly in availability, bandwidth, processing power, etc. Create network backbone from highly available and powerful peers to distribute load better. S ee also Yang, Garcia-Molina: Improving S earch in P2P S ystems, Intl. Conf. on Distributed Computing S ystems, Vienna, 2002, or file sharing networks like KaZaa � Wolfgang Nej dl 14/ 10/ 03 21

  22. Super-Peers and Routing Indices Nejdl et al. Super-Peer-Based Routing and Clustering Strategies for RDF-Based Peer-To-Peer Networks. WWW 2003 Wolfgang Nej dl 14/ 10/ 03 22

  23. Extension to Distributed Query Processing Interleave P2P techniques and query processing � Push abstract query plans through the super peer network � S uper peers pick and expand those parts of the query plan that can be executed locally � On the fly distribution and expansion of query plans S ee Brunkhorst, Dhraief, Kemper, Nej dl, Wiesner: Distributed Queries and Query Optimization in S chema-Based P2P-S ystems, VLDB-P2P- Workshop Wolfgang Nej dl 14/ 10/ 03 23

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend