Semantic Web Adoption Ivan Herman, W3C First China Semantic Web - - PowerPoint PPT Presentation

semantic web adoption ivan herman w3c
SMART_READER_LITE
LIVE PREVIEW

Semantic Web Adoption Ivan Herman, W3C First China Semantic Web - - PowerPoint PPT Presentation

Semantic Web Adoption Ivan Herman, W3C First China Semantic Web Symposium (CSWS 2007), Beijing, China, 2007-11-19 (2) > Semantic Web adoption SW technologies go back to few years now we have a really stable set of specifications since


slide-1
SLIDE 1

Semantic Web Adoption Ivan Herman, W3C

First China Semantic Web Symposium (CSWS 2007), Beijing, China, 2007-11-19

slide-2
SLIDE 2

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (2)

(2)

> Semantic Web adoption

SW technologies go back to few years now…

− we have a really stable set of specifications since 2004

Lots of tools, software, system, etc, are at our disposal now Applications come to the fore So where are we exactly with adoption?

slide-3
SLIDE 3

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (3)

(3)

> A way of looking at it…

Semantic Technologies

slide-4
SLIDE 4

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (4)

(4)

> The 2007 Gartner predictions By 2017, we expect the vision of the Semantic Web […] to coalesce […] and the majority of Web pages are decorated with some form of semantic hypertext. By 2012, 80% of public Web sites will use some level of semantic hypertext to create SW documents […] 15% of public Web sites will use more extensive Semantic Web-based ontologies to create semantic databases

(note: “semantic hypertext” refers to, eg, RDFa, microformats with possible GRDDL, etc.)

Source: “Finding and Exploiting Value in Semantic Web Technologies on the Web”, Gartner Research Report, May 2007

slide-5
SLIDE 5

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (5)

(5)

> It often starts with small communities

The needs of a deployment application area:

− have serious problem or opportunity − have the intellectual interest to pick up new things − have motivation to fix the problem − its data connects to other application areas − have an influence as a showcase for others

The high energy physics community played this role for the Web in the 90’s

slide-6
SLIDE 6

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (6)

(6)

> Some SW deployment communities

Some examples: digital libraries, defense, eGovernment, energy sector, financial services, health care, oil and gas industry, life sciences … Health care and life science sector is now very active

− also at W3C, in the form of an Interest Group

slide-7
SLIDE 7

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (7)

(7)

> The “corporate” landscape is moving

Major companies offer (or will offer) Semantic Web tools or systems using Semantic Web: Adobe, BBN, Oracle, IBM, HP, Software AG, GE, Northrop Gruman, Altova, Vodafone, Dow Jones, … Others are using it (or consider using it) as part of their own

  • perations: Novartis, Pfizer, Telefónica, Vodafone, Elsevier, …

Some of the names of active participants in W3C SW related groups: ILOG, HP, Agfa, SRI International, Fair Isaac Corp., Oracle, Boeing, IBM, Chevron, Nokia, Merck, Pfizer, T-Mobile, AstraZeneca, Sun, Eli Lilly, …

slide-8
SLIDE 8

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (8)

(8)

> SWEO enterprise survey

W3C’s SW Education and Outreach Interest Group (“SWEO”) conducted a survey in January 2007 Around 50 responses from 10 countries One of the main results: majority of responders considered missing skills to be the biggest obstacle

− universities have a major role to play…

slide-9
SLIDE 9

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (9)

(9)

> SWEO’s use case collection

SWEO is actively collecting SW use cases and case studies

− use case: prototype applications within the enterprise − case study: deployed applications, either in an enterprise,

community, governmental, etc sites

slide-10
SLIDE 10

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (10)

(10)

> SWEO’s use case collection

At present there are

− 16 case studies and 9 use cases (early November 2007) − from 11 different countries around the globe − submitters’ activity areas include: automotive, broadcasting,

financial institution, health care, oil & gas industry, pharmaceutical, public and governmental institutions, publishing, telecommunications, …

− usage areas include: data integration, portals with improved local

search, business organization, B2B integration, …

Remember this URI:

http://www.w3.org/2001/sw/sweo/public/UseCases/

slide-11
SLIDE 11

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (11)

(11)

> So how do applications look like?

slide-12
SLIDE 12

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (12)

(12)

> Applications are not always very complex…

Eg: simple semantic annotations of data provides easy integration (eg, with MusicBrainz, Wikipedia, geographic data sets, etc) What is needed: some simple vocabularies, simple annotation

− annotation an be generated by a server automatically, or − added by the user via some user interface

“Semantic hypertext”, to use Gartner’s terminology

slide-13
SLIDE 13

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (13)

(13)

> Applications are not always very complex…

slide-14
SLIDE 14

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (14)

(14)

> Major application paradigm: data integration

Very important for large application areas (life sciences, energy sector, eGovernment, financial institutions), as well as everyday applications (eg, reconciliation of calendar data) Is the most representative usage both in the SWEO survey as well as in the use cases and cases studies Developments are under way at various companies, institutions

slide-15
SLIDE 15

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (15)

(15)

> The “HCLS Demo”

The W3C Health Care and Life Sciences Interest Group (“HCLS”) has developed few demonstrations on SW usage Goal is to show:

− the HCLS community how Semantic Web can be used − the SW community how this technology can be useful in this

application are

Prevailing paradigm is data integration

slide-16
SLIDE 16

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (16)

(16)

> HCLS demo: looking for Alzheimer’s targets

Signal transduction pathways are considered to be rich in proteins that might respond to chemical therapy CA1 Pyramidal Neurons are known to be particularly damaged in Alzheimer’s disease Can we find candidate genes known to be involved in signal transduction and active in Pyramidal Neurons?

slide-17
SLIDE 17

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (17)

(17)

> To answer: integrate datasets

W3C HCLS IG has already integrated a number of public datasets and ontologies

− assign URI-s to bio entities − data converted or made reachable in RDF − use reasoners to infer extra triples to increase expressiveness − query the data with SPARQL and visualization tools − around 400M triples so far…

slide-18
SLIDE 18

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (18)

(18)

> Use SPARQL to integrate…

prefix go: <http://purl.org/obo/owl/GO#> prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> prefix owl: <http://www.w3.org/2002/07/owl#> prefix mesh: <http://purl.org/commons/record/mesh/> prefix sc: <http://purl.org/science/owl/sciencecommons/> prefix ro: <http://www.obofoundry.org/ro/ro.owl#> select ?genename ?processname where { graph <http://purl.org/commons/hcls/pubmesh> { ?paper ?p mesh:D017966 . ?article sc:identified_by_pmid ?paper. ?gene sc:describes_gene_or_gene_product_mentioned_by ?article. } graph <http://purl.org/commons/hcls/goa> { ?protein rdfs:subClassOf ?res. ?res owl:onProperty ro:has_function. ?res owl:someValuesFrom ?res2. ?res2 owl:onProperty ro:realized_as. ?res2 owl:someValuesFrom ?process. graph <http://purl.org/commons/hcls/20070416/classrelations> {{?process <http://purl.org/obo/owl/obo#part_of> go:GO_0007166} union {?process rdfs:subClassOf go:GO_0007166 }} ?protein rdfs:subClassOf ?parent. ?parent owl:equivalentClass ?res3. ?res3 owl:hasValue ?gene. } graph <http://purl.org/commons/hcls/gene> { ?gene rdfs:label ?genename } graph <http://purl.org/commons/hcls/20070416> { ?process rdfs:label ?processname} }

Mesh: Pyramidal Neurons Pubmed: Journal Articles Entrez Gene: Genes GO: Signal Transduction

Inference required

Courtesy of Susie Stephens, Eli Lilly, Alan Ruttenberg, Science Commons, and the W3C HCLS IG

slide-19
SLIDE 19

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (19)

(19)

> Yielding results to the query…

DRD1, 1812 adenylate cyclase activation ADRB2, 154 adenylate cyclase activation ADRB2, 154 arrestin mediated desensitization of G-protein coupled receptor protein signaling pathway DRD1IP, 50632 dopamine receptor signaling pathway DRD1, 1812 dopamine receptor, adenylate cyclase activating pathway DRD2, 1813 dopamine receptor, adenylate cyclase inhibiting pathway GRM7, 2917 G-protein coupled receptor protein signaling pathway GNG3, 2785 G-protein coupled receptor protein signaling pathway GNG12, 55970 G-protein coupled receptor protein signaling pathway DRD2, 1813 G-protein coupled receptor protein signaling pathway ADRB2, 154 G-protein coupled receptor protein signaling pathway CALM3, 808 G-protein coupled receptor protein signaling pathway HTR2A, 3356 G-protein coupled receptor protein signaling pathway DRD1, 1812 G-protein signaling, coupled to cyclic nucleotide second messenger SSTR5, 6755 G-protein signaling, coupled to cyclic nucleotide second messenger MTNR1A, 4543 G-protein signaling, coupled to cyclic nucleotide second messenger CNR2, 1269 G-protein signaling, coupled to cyclic nucleotide second messenger HTR6, 3362 G-protein signaling, coupled to cyclic nucleotide second messenger GRIK2, 2898 glutamate signaling pathway GRIN1, 2902 glutamate signaling pathway GRIN2A, 2903 glutamate signaling pathway GRIN2B, 2904 glutamate signaling pathway ADAM10, 102 integrin-mediated signaling pathway GRM7, 2917 negative regulation of adenylate cyclase activity LRP1, 4035 negative regulation of Wnt receptor signaling pathway ADAM10, 102 Notch receptor processing ASCL1, 429 Notch signaling pathway HTR2A, 3356 serotonin receptor signaling pathway ADRB2, 154 transmembrane receptor protein tyrosine kinase activation (dimerization) PTPRG, 5793 transmembrane receptor protein tyrosine kinase signaling pathway EPHA4, 2043 transmembrane receptor protein tyrosine kinase signaling pathway NRTN, 4902 transmembrane receptor protein tyrosine kinase signaling pathway CTNND1, 1500 Wnt receptor signaling pathway

Many of the genes are indeed related to Alzheimer s Disease through gamma secretase (presenilin) activity

Courtesy of Susie Stephens, Eli Lilly, Alan Ruttenberg, Science Commons, and the W3C HCLS IG

slide-20
SLIDE 20

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (20)

(20)

> The “core”…

The “core” of the HCLS demo was the access/integration of public datasets via the Semantic Web

slide-21
SLIDE 21

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (21)

(21)

> Linking Open Data Project

Goal: “expose” open datasets in RDF Set RDF links among the data items from different datasets Set up SPARQL endpoints to query the data, too Over 2 billion triples, 3 million “links” (November 2007)

slide-22
SLIDE 22

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (22)

(22)

> Example data source: DBpedia

DBpedia is a community effort to

− extract structured (“infobox”) information from Wikipedia − provide a SPARQL endpoint to the dataset − interlink the DBpedia dataset with other datasets on the Web

slide-23
SLIDE 23

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (23)

(23)

> Extracting structured data from Wikipedia

http://en.wikipedia.org/wiki/Calgary <http://dbpedia.org/resource/Calgary> dbpedia:native_name “Calgary” ; dbpedia:altitude “1048” ; dbpedia:population_city “988193” ; dbpedia:population_metro “1079310” ; dpbedia:mayor_name dbpedia:Dave_Bronconnier ; dpbedia:governing_body dbpedia:Calgary_City_Council ; ...

Courtesy of Chris Bizer, Free University of Berlin

slide-24
SLIDE 24

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (24)

(24)

> Automatic links among open datasets

<http://dbpedia.org/resource/Calgary>

  • wl.sameAs <http://sws.geonames.org/5913490>;

... <http://sws.geonames.org/5913490>

  • wl:sameAs <http://DBpedia.org/resource/Calgary>

wgs84_pos:lat “51.050112282”; wgs84_pos:long “-114.085285152”; sws:population “968460” ...

DBpedia Geonames Processors can switch automatically from one to the other…

slide-25
SLIDE 25

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (25)

(25)

> Linking Open Data Project (cont)

This is a major community project

− anybody can participate; to subscribe to the list:

 http://simile.mit.edu/mailman/listinfo/linking-open-data

− or look at the project site:

 http://esw.w3.org/topic/SweoIG/TaskForces/CommunityProjects

/LinkingOpenData

− if you know of open data sets: contact the project to incorporate it

with the rest!

Applications using this set of data in real-life setting should come to the fore soon

slide-26
SLIDE 26

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (26)

(26)

> Examples of real applications…

slide-27
SLIDE 27

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (27)

(27)

> Integrate knowledge for Chinese Medicine

Integration of a large number of relational databases (on traditional Chinese medicine) using a Semantic Layer

− around 80 databases, around 200,000 records each

A visual tool to map databases to the semantic layer using a specialized ontology Form based query interface for end users

Courtesy of Huajun Chen, Zhejiang University, (SWEO Case Study)

slide-28
SLIDE 28

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (28)

(28)

> Find the right experts at NASA

Expertise locater for nearly 20,000 NASA civil servants using RDF integration techniques over 6 or 7 geographically distributed databases, data sources, and web services…

Courtesy of Kendall Clark, Clark & Parsia, LLC

slide-29
SLIDE 29

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (29)

(29)

> Public health surveillance (Sapphire)

Integrated biosurveillance system (biohazards, bioterrorism, disease control, etc): integrates data from multiple disparate sources and generated reports

Courtesy of Parsa Mirhaji, School of Health Information Sciences, University of Texas (SWEO Case Study)

Different views of the data New information can be added easily

slide-30
SLIDE 30

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (30)

(30)

> Oracle’s Technology Network portal

Courtesy of Mike DiLascio, Siderean Software, and Justin Kestelyn, Oracle Corporation (SWEO Case Study)

Aggregates many source of content Re-group, categorize, etc content (using a taxonomy)

slide-31
SLIDE 31

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (31)

(31)

> Help for deep sea drilling operations

Integration of experience and data in the planning and

  • peration of deep sea drilling

processes Discover relevant experiences that could affect current or planned drilling operations

Courtesy of David Norheim and Roar Fjellheim, Computas AS (SWEO Use Case)

slide-32
SLIDE 32

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (32)

(32)

> Additional paradigm: intelligent portals

“Portals” collecting data and presenting them to users

− they can be public or behind corporate firewalls

Portal’s internal organization makes use of semantic data and

  • ntologies

− integration with external and internal data

  • these applications often extend the data integration paradigm

− better queries, often based on controlled vocabularies or

  • ntologies…
slide-33
SLIDE 33

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (33)

(33)

> Semantic portal for art collections

Courtesy of Jacco van Ossenbruggen, CWI, and Guus Schreiber, VU Amsterdam

slide-34
SLIDE 34

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (34)

(34)

> FAO Journal portal

Improved search on journal content based on an agricultural

  • ntology and thesaurus (AGROVOC)

Courtesy of Gauri Salokhe, Margherita Sini, and Johannes Keizer, FAO, (SWEO Case Study)

slide-35
SLIDE 35

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (35)

(35)

> Eli Lilly’s Target Assessment Tool

Better prioritization of possible drug target, integrating data from different sources and formats Integration, search, etc, via ontologies (proprietary and public)

Courtesy of Susie Stephens, Eli Lilly (SWEO Case Study)

slide-36
SLIDE 36

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (36)

(36)

> Digital music asset portal at NRK

Used by program production to find the right music in the archive for a specific show

Courtesy of Robert Engels, ESIS, and Jon Roar Tønnesen, NRK (SWEO Case Study)

slide-37
SLIDE 37

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (37)

(37)

> Improved Search via Ontology: GoPubMed

Improved search on top of pubmed.org

− search results are ranked using ontologies − related terms are highlighted, usable for further search

slide-38
SLIDE 38

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (38)

(38)

> Vodafone live!

Integrate various vendors’ product descriptions via RDF

− ring tones, games, wallpapers − manage complexity of handsets, binary

formats

A portal is created to offer appropriate content Significant increase in content download after the introduction

Courtesy of Kevin Smith, Vodafone Group R&D (SWEO Case Study)

slide-39
SLIDE 39

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (39)

(39)

> Other examples…

Sun’s White Paper and System Handbook collections Nokia’s S60 support portal Harper’s Online Magazine Oracle’s virtual pressroom Dow Jones’ Synaptica

slide-40
SLIDE 40

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (40)

(40)

> All kind of other types of applications…

The coming few examples are more “knowledge management” type of applications Less typical usage the “Web” aspect of the Semantic Web…

slide-41
SLIDE 41

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (41)

(41)

> Repair and Diagnostics Documentation

Store repair and diagnostic operations in a repository with an OWL model A “diagnostic engine” generates the manuals on the fly, using RDF for information exchange

Courtesy of François-Paul Servant, Renault, (SWEO Use Case)

slide-42
SLIDE 42

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (42)

(42)

> Portal to Principality of Asturias’ documents

Search through governmental documents A “bridge” is created between the users and the juridical jargon using SW vocabularies and tools

Courtesy of Diego Berrueta and Luis Polo, CTIC, U. of Oviedo, and the Principality of Asturias, (SWEO Case Study)

slide-43
SLIDE 43

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (43)

(43)

> Natural interface to business applications

Courtesy of C. Anantaram, Tata Consultancy Services Limited (SWEO Case Study)

Users interact with a business application (eg, via email) in natural language; OWL helps in the retrieval of relevant concepts

slide-44
SLIDE 44

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (44)

(44)

> Other application areas come to the fore

Content management Knowledge management Business intelligence Collaborative user interfaces Sensor-based services Linking virtual communities Grid infrastructure Multimedia data management Etc

slide-45
SLIDE 45

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (45)

(45)

> Thank you for your attention!

These slides are publicly available on:

http://www.w3.org/2007/Talks/1119-Beijing-IH/

slide-46
SLIDE 46

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (46)

(46)

> CEO guide for SW: the “DO-s”

Start small: Test the Semantic Web waters with a pilot project […] before investing large sums of time and money. Check credentials: A lot of systems integrators don't really have the skills to deal with Semantic Web technologies. Get someone who's savvy in semantics. Expect training challenges: It often takes people a while to understand the technology. […] Find an ally: It can be hard to articulate the potential benefits, so find someone with a problem that can be solved with the Semantic Web and make that person a partner.

Source: BusinessWeek Online, April 2007

slide-47
SLIDE 47

Ivan Herman, “Semantic Web Adoption”, CSWS2007, 2007-11-19, Beijing, China (47)

(47)

> CEO guide for SW: the “DON’T-s”

Go it alone: The Semantic Web is complex, and it's best to get help. […] Forget privacy: Just because you can gather and correlate data about employees doesn’t mean you should. Set usage guidelines to safeguard employee privacy. Expect perfection: While these technologies will help you find and correlate information more quickly, they’re far from perfect. Nothing can help if data are unreliable in the first place. Be impatient: One early adopter at NASA says that the potential benefits can justify the investments in time, money, and resources, but there must be a multi-year commitment to have any hope of success

Source: BusinessWeek Online, April 2007