Document Navigation: Ontologies or Knowledge Organisation Systems - - PowerPoint PPT Presentation

document navigation ontologies or knowledge organisation
SMART_READER_LITE
LIVE PREVIEW

Document Navigation: Ontologies or Knowledge Organisation Systems - - PowerPoint PPT Presentation

A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases Document Navigation: Ontologies or Knowledge Organisation Systems Simon Jupp - NETTAB 2007 BioHealth Informatics Group University of Manchester, UK A


slide-1
SLIDE 1

Document Navigation: Ontologies or Knowledge Organisation Systems

Simon Jupp - NETTAB 2007 BioHealth Informatics Group University of Manchester, UK

A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases

slide-2
SLIDE 2

A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases

Bioinformatics relies heavily on web for IR. Introduction Ontologies are being developed as background knowledge to drive the Semantic Web. Message: Formal ontologies are not the only knowledge artefact needed, artefacts with weaker semantics have their role and are the best solution in some circumstances

slide-3
SLIDE 3

A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases

COHSE Navigation via Hypertext is a mainstay of WWW Problem: Links are typically embedded to Web pages; hard-coding, format restrictions, ownership, legacy resources, maintenance, Unary targets etc. Which model is best suited for Navigation? Strict semantics like OWL or something with weaker semantics like SKOS. Is ontological formality a help or a hindrances?

slide-4
SLIDE 4

A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases

SeaLife use case - study of disease

NeLI: National electronic Library of Infection portal. Range of users but few links.

User Group Question Targets Family Doctor (GP) Tuberculosis drugs and side effects? British National Formulary (BNF) Clinicians Tuberculosis treatments guidelines? Public Health Observatories (PHO) Molecular Biologists Drug resistant tuberculosis species? PubMed General Public What is tuberculosis? Health Protection Agency (HPA)

  • r the NHS direct online website.

Given a document about Tuberculosis, where would users want to navigate to next?

http://www.neli.org.uk

slide-5
SLIDE 5

A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases

SeaLife background knowledge

To cover molecular biology through to medicine we need a large knowledge artefact to serve as background knowledge for

  • SeaLife. This artefact must support sensible navigation between

documents on the web.

Luckily…

slide-6
SLIDE 6

Phenotype

Sequence Proteins Gene products Transcript Pathways Cell type BRENDA tissue / enzyme source Development Anatomy Phenotype Plasmodium life cycle

  • Sequence types

and features

  • Genetic Context
  • Molecule role
  • Molecular Function
  • Biological process
  • Cellular component
  • Protein covalent bond
  • Protein domain
  • UniProt taxonomy
  • Pathway ontology
  • Event (INOH pathway
  • ntology)
  • Systems Biology
  • Protein-protein

interaction

  • Arabidopsis development
  • Cereal plant development
  • Plant growth and developmental stage
  • C. elegans development
  • Drosophila development FBdv fly

development.obo OBO yes yes

  • Human developmental anatomy, abstract

version

  • Human developmental anatomy, timed version
  • Mosquito gross anatomy
  • Mouse adult gross anatomy
  • Mouse gross anatomy and development
  • C. elegans gross anatomy
  • Arabidopsis gross anatomy
  • Cereal plant gross anatomy
  • Drosophila gross anatomy
  • Dictyostelium discoideum anatomy
  • Fungal gross anatomy FAO
  • Plant structure
  • Maize gross anatomy
  • Medaka fish anatomy and development
  • Zebrafish anatomy and development
  • NCI Thesaurus
  • Mouse pathology
  • Human disease
  • Cereal plant trait
  • PATO PATO attribute and value.obo
  • Mammalian phenotype
  • Habronattus courtship
  • Loggerhead nesting
  • Animal natural history and life history

eVOC (Expressed Sequence Annotation for Humans)

slide-7
SLIDE 7

1603 1975 ICD9 1900 ICD 1855 1785 Synopsis Nosologiae Methodicae

OPCS SNOP CPT MESH ICD

MeSH OPCS EmTree SNOP CPT 1700 OPCS3 CTV3 READ 1975 2005 1985 1995 ICPC SNOMED-2 SNOMED International SNOMED-RT SNOMED-CT GALEN DM&D UMLS FMA OPCS4 OPCS4.3

History of Medical Vocabularies

slide-8
SLIDE 8

A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases

What do we need for navigation?

The bio-medical domain is rich in vocabularies and ontologies. Large lexical resource including textual definitions and synonyms There is a varying degree of semantics, expressivity and formality in these vocabularies (e.g. MeSH) and ontologies (e.g FMA). Most include some form of hierarchy. Hierarchies are well suited for driving navigation. Question: Do we want strict sub/super class relationships? Or, do we want looser notations such as broader/narrower?

slide-9
SLIDE 9

A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases

Ontology or Vocabulary?

Initial approach to COHSE and SeaLife was to represent everything in OWL The strict semantics of OWL do not always lend themselves to sensible navigation, conversion from vocabularies to OWL are

  • difficult. It’s hard to model some things in OWL (e.g.

sometimes/always, probabilities etc) MeSH OBO/OWL

Head <-- Ear <-- Nose Accident <-- Traffic Accident <-- Accident Prevention

Nucleus part_of Cell Cell has_part Nucleus - Not always True

PolioVirus causes some PolioDisease

slide-10
SLIDE 10

A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases

SKOS (Simple Knowledge Organisation System)

Purpose: Subject Metadata and information retrieval RDF/XML representation - Semantic Web friendly. Importantly for us, the semantics are more suitable for document navigation

e.g. This document is about tuberculosis e.g. broader, narrower, related. http://www.w3.org/2004/02/skos/

Model for representing concept schemes, thesauri, classification system, taxonomies etc…

slide-11
SLIDE 11

A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases

Conversion to SKOS

  • relationship:part_of --> skos:broader ( e.g. finger part_of hand)
  • relationship:contains --> skos:narrower ( e.g. skull contains brain)
  • relationship:causes --> skos:related ( e.g. PolioVirus causes PolioDisease)

Sub properties: skos:broader skos:narrower inverse

  • bo:part_of
  • bo:has_part

Leaves us open to migration back to OWL

slide-12
SLIDE 12

A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases

Conversion to SKOS SKOS

MeSH OBO ontologies OWL ontologies SNOMED NeLI Taxonomies Concept Schemas Thesauri Other..

slide-13
SLIDE 13

A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases

Advantage of this approach

For a given concept e.g. “Polio Virus”, we can query multiple resources and bring related concepts together.

Source Terms found SKOS relation to “Poliovirus” MeSH Brunhilde Virus skos:altTerm Disease Ontology Spinal cord disease skos:broaderThan Postpoliomyelitis Syndrome skos:narrowerThan SNOMED Microorganism skos:broaderThan Enterovirus skos:broaderThan

  • Rapid (and cheap!) generation of knowledge artefact
  • Take advantage efforts in multiple biomedical communities
  • We don’t have to make any strong ontological distinctions
slide-14
SLIDE 14

A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases

Disadvantage of this approach

Trade off: Lose the inferential power when querying a knowledge resource Inability to do inconsistency checking Potentially large redundancy in our knowledge base Maintenance and scalability (>1000000 concepts) - especially for dynamic hyper- linking. Unwanted concepts & relationship - especially from OWL conversion e.g. ‘Physical Entity’, ‘Continuant’ etc…. Linking overload!

slide-15
SLIDE 15

A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases

Plug for Manchester’s SKOS plug-in - Protégé 4

  • Instance hierarchy viewer
  • OBO or OWL --> SKOS wizards
  • Various rendering options
slide-16
SLIDE 16

A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases

Conclusion

What does your semantic web application need?

Taken from Alistair Miles, XMLUK: “Ontologies and XML” 2005, slide

slide-17
SLIDE 17

A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases

Acknowledgments

Manchester Robert Stevens Sean Bechhofer Yeliz Yesilada NeLI Patty Kostkova Other COHSE developers SeaLife project

Thank you.