Open Ontology Repository Initiative Frank Olken Lawrence Berkeley - - PowerPoint PPT Presentation

open ontology repository initiative
SMART_READER_LITE
LIVE PREVIEW

Open Ontology Repository Initiative Frank Olken Lawrence Berkeley - - PowerPoint PPT Presentation

Open Ontology Repository Initiative Frank Olken Lawrence Berkeley National Laboratory National Science Foundation folken@nsf.gov presented to CENDI/NKOS Workshop World Bank Sept. 11, 2008 Version 6.0 DISCLAIMER Remarks, opinions, etc.


slide-1
SLIDE 1

Open Ontology Repository Initiative

Frank Olken

Lawrence Berkeley National Laboratory National Science Foundation folken@nsf.gov presented to CENDI/NKOS Workshop World Bank

  • Sept. 11, 2008

Version 6.0

slide-2
SLIDE 2
  • Sept. 11, 2008

OOR Talk to CENDI/NKOS, F. Olken 2

DISCLAIMER

  • Remarks, opinions, etc. about the Open

Ontology Repository (OOR) Initiative are solely my own and do not reflect the position(s) of either Lawrence Berkeley National Laboratory

  • r the National Science Foundation.
slide-3
SLIDE 3
  • Sept. 11, 2008

OOR Talk to CENDI/NKOS, F. Olken 3

Joint Work

This is joint work by many persons: Peter Yim, Leo Obrst, Mike Dean, Michelle Raymond, Mark Musen, Barry Smith, Fabian Neuhaus, Michael Gruninger, Pat Hayes, Steve Ray, Ravi Sharma, Natasha Noy, Deborah McGuiness, Pat Cassidy, Elisa Kendall, Evan Wallace, John Sowa, myself (Frank Olken), et al. Many of these slides have been adapted from the OOR Communique without attribution.

slide-4
SLIDE 4
  • Sept. 11, 2008

OOR Talk to CENDI/NKOS, F. Olken 4

What is the OOR Initiative?

  • Open = accessible, minimal intellectual property

encumbrances on the ontologies, preferably open source code for the repository

  • Ontology = formal conceptualization (degree of

formalization may vary: frames, graphs (RDF), logic (OWL-DL, Common Logic, ...)

  • Repository = collection of ontologies, related

materials, support for storage, retrieval, integration, etc.

  • Initiative = group/effort to create OOR
slide-5
SLIDE 5
  • Sept. 11, 2008

OOR Talk to CENDI/NKOS, F. Olken 5

Definition of Ontology Repository

"An ontology repository is a facility where ontologies and related information artifacts can be stored, retrieved and managed."

slide-6
SLIDE 6
  • Sept. 11, 2008

OOR Talk to CENDI/NKOS, F. Olken 6

Goals of OOR Initiative

  • To promote global use and sharing of
  • ntologies by:

– Establishing a hosted registry-repository; – Enabling open, federated, collaborative ontology

repositories, and

– Establishing best practices for expressing

interoperable ontologies and taxonomies in repositories.

slide-7
SLIDE 7
  • Sept. 11, 2008

OOR Talk to CENDI/NKOS, F. Olken 7

Why OOR?

  • To provide access to good quality ontologies for

use in a variety of software, and for education

  • To provide a place for persons to contribute

new ontologies

  • To provide a means for ontology queries,
  • ntology integration
  • To facilitate development of new, composite
  • ntologies
slide-8
SLIDE 8
  • Sept. 11, 2008

OOR Talk to CENDI/NKOS, F. Olken 8

Use Cases

  • Store a new ontology about time.
  • Find me a good ontology about time.
  • Find me a concept “duration” in ontology X
  • What is relationship between “event” in ontology X and

“occurrence” in ontology Y?

  • Find all “subclasses” of “event” in ontology Z
  • Find immediate superclasses of “event” in ontology W
  • Find a mapping between SIC and NAIC classifications
slide-9
SLIDE 9
  • Sept. 11, 2008

OOR Talk to CENDI/NKOS, F. Olken 9

Ontology Formality?

  • Degree and nature of formalization of
  • ntologies will vary:

– Frames, SKOS, RDF, Description Logics, First

Order Logic, Higher Order Logics

  • Formalizable:

– KR language has known syntax

  • Formal:

– KR language with well specified syntax – KR language with well specified semantics (e.g., a

model theory)

slide-10
SLIDE 10
  • Sept. 11, 2008

OOR Talk to CENDI/NKOS, F. Olken 10

Macro-level Operations

  • Macro-level = level of entire ontologies
  • Support storage, retrieval, query, annotation at

the ontology level

  • Relationships among ontologies:

– Extends, specializes, revises, ...

slide-11
SLIDE 11
  • Sept. 11, 2008

OOR Talk to CENDI/NKOS, F. Olken 11

Micro-level Operations

  • Micro-level:

– Partitions of ontologies – Individual concepts, axioms, ...

  • Micro-level Operations:

– Storage, querying, retrieval, updates, annotation – Inference – Mapping between concepts in different ontologies – Composition, integration of ontologies, portions of

  • ntologies
slide-12
SLIDE 12
  • Sept. 11, 2008

OOR Talk to CENDI/NKOS, F. Olken 12

Ontology Admission Criteria

  • Required attributes of an ontology for inclusion

in OOR:

– in a publicly described language and format. – read accessible. – in a formal language with a well-defined syntax. – has the required metadata. – has a clearly specified and clearly delineated

scope.

– versions of the ontology are clearly identified. – appropriately named.

slide-13
SLIDE 13
  • Sept. 11, 2008

OOR Talk to CENDI/NKOS, F. Olken 13

Quality Control

  • Build a repository of “good quality” ontologies
  • There will be an editorial function by which
  • ntologies will be reviewed to assess their

“quality”

  • Ontologies will be annotated with “quality

assessments”.

  • Quality assessments are intend as a guide to

users, ontology developers, integrators

  • Details yet to be settled, cf. ISO 11179
slide-14
SLIDE 14
  • Sept. 11, 2008

OOR Talk to CENDI/NKOS, F. Olken 14

Metadata Uses

  • determine whether an ontology is suitable for a user

purpose;

  • capture the design rationales that underlie the
  • ntology;
  • find information about author, author credentials, and

source of ontology reference material

  • retrieve ontologies for use in domain applications;
  • retrieve ontologies to be integrated with other
  • ntologies;
  • retrieve ontologies that will be extended to create new
  • ntologies;
slide-15
SLIDE 15
  • Sept. 11, 2008

OOR Talk to CENDI/NKOS, F. Olken 15

Types of Metadata

  • Knowledge Representation Language
  • Modularity (module structure)
  • Relationships among ontologies
  • Provenance
  • Version
  • Existing applications of the ontology (e.g.

interoperability, search, decision support)

  • Domain-specificity
slide-16
SLIDE 16
  • Sept. 11, 2008

OOR Talk to CENDI/NKOS, F. Olken 16

Provenance Metadata

  • Many ontologies are based on definitions taken

from government regulations, legislation, court decisions, standards, ...

  • Need fine grained provenance, citations for

individual concepts, axioms which cite authoritative sources for each item in ontology

  • This is especially important for ontologies used

for administrative or regulatory applications in federal agencies (e.g., EPA, FDA, OSHA, IRS, ...)

slide-17
SLIDE 17
  • Sept. 11, 2008

OOR Talk to CENDI/NKOS, F. Olken 17

Repository Architecture

  • Federated
  • Service Oriented Architecture (SOA)
  • Scalable.
  • Optimized for sharing, collaboration and reuse.
  • Supporting ontologies in multiple formats and levels of

formalism.

  • Distributed repositories.
  • Explicit machine usable/accessible formal semantics

for the meta-model of the repository.

slide-18
SLIDE 18
  • Sept. 11, 2008

OOR Talk to CENDI/NKOS, F. Olken 18

Repository Architecture (cont.)

  • A mechanism to address intellectual property and

related legal issues/problems.

  • Support for adding, searching and mapping across
  • ntologies and data related to the stored ontologies.
  • Support additional services both directly within the

province of the repository and as external services.

  • Support all phases of the ontology life cycle.
slide-19
SLIDE 19
  • Sept. 11, 2008

OOR Talk to CENDI/NKOS, F. Olken 19

Repository Services

  • Ontology creation tools
  • Ontology editors
  • Ontology differencing tools
  • Ontology modularization tools (clustering, etc.)
  • Ontology export
  • Ontology visualization (e.g., graph visualization)
  • Version management
  • Access control
  • Ontology syntax checking
  • Ontology consistency checking – no cycles in taxonomies
slide-20
SLIDE 20
  • Sept. 11, 2008

OOR Talk to CENDI/NKOS, F. Olken 20

Status of OOR Initiative

  • Ontology Summit in April of 2008
  • Ongoing teleconference discussions of:

– Integration of component technologies, gathering use cases

& requirements, OOR architecture, open source code development, knowledge representation language, specification of ontology mappings, outreach to early adopters, ...

  • Early discussions about:

– collaboration between contributing institutions, preparation

  • f OOR research proposal(s) to funding agencies

– Producing a joint OOR-Ontolog panel series

  • See: http://ontolog.cim3.net/cgi-bin/wiki.pl?

OpenOntologyRepository

slide-21
SLIDE 21
  • Sept. 11, 2008

OOR Talk to CENDI/NKOS, F. Olken 21

How to participate?

  • Read the OOR Wiki:

– http://ontolog.cim3.net/cgi-bin/wiki.pl?

OpenOntologyRepository

  • Look at Ontology Summit 2008

– http://ontolog.cim3.net/cgi-bin/wiki.pl?

OntologySummit2008

  • Listen to previous teleconferences, read mailing list

archive

  • Contact peter.yim@cim3.com to join mailing list
  • Join the teleconferences
  • Read / Contribute to email discussion
  • Help write the OOR proposal(s)
slide-22
SLIDE 22
  • Sept. 11, 2008

OOR Talk to CENDI/NKOS, F. Olken 22

Supplementary Material

slide-23
SLIDE 23
  • Sept. 11, 2008

OOR Talk to CENDI/NKOS, F. Olken 23

Repository Goals

  • Supporting the Open Ontology Repository (OOR) Initiative

that will promote the global use of ontologies, their instance bases, rules, and services, and mappings among these.

  • Enabling and facilitating open, federated, collaborative
  • ntology repositories.
  • Establishing best practices for expressing interoperable
  • ntology work in open registries/repositories.
  • Enabling and facilitating the development of common

services to support the repository and to extend the capabilities available to providers, users, and developers who use the repository.

slide-24
SLIDE 24
  • Sept. 11, 2008

OOR Talk to CENDI/NKOS, F. Olken 24

Repository Management

  • enforce access policies
  • enforce submission policies
  • enforce governance policies
  • enforce change management policies
  • control user and administrator access
slide-25
SLIDE 25
  • Sept. 11, 2008

OOR Talk to CENDI/NKOS, F. Olken 25

Registry vs. Repository

  • Registry:

– Contains a list of ontologies and some related

metadata about the ontologies

  • Repository:

– Contains actual ontologies, not just a list of

  • ntologies

– Supports storage, retrieval, updates, integration of

  • ntologies
slide-26
SLIDE 26
  • Sept. 11, 2008

OOR Talk to CENDI/NKOS, F. Olken 26

Repository Functions

  • create usage reports
  • validate syntax
  • check logical consistency
  • automatically categorize a submission
slide-27
SLIDE 27
  • Sept. 11, 2008

OOR Talk to CENDI/NKOS, F. Olken 27

Repository Search Criteria

  • domain
  • author/creator/source
  • version
  • language
  • terminology and controlled vocabularies
  • quality
  • mapping
  • inference
slide-28
SLIDE 28
  • Sept. 11, 2008

OOR Talk to CENDI/NKOS, F. Olken 28

Related Work

  • XMDR project – metadata repositories (LBNL,

UC Berkeley, et al.) ISO 11179 prototype

  • Bioportal, Protege [medical ontologies,
  • ntology editor, etc. ](Stanford)
  • Swoogle (UMBC) ontology search engine
  • OBO [biological ontologies, frame based]
  • Ecocyc, etc. [microbial ontologies, frame based]

(SRI)

  • OMG ODM (Ontology Definition Metamodel)
slide-29
SLIDE 29
  • Sept. 11, 2008

OOR Talk to CENDI/NKOS, F. Olken 29

Knowledge Representation Languages

  • Resource Description Framework (RDF)
  • OWL-DL (description logic)
  • OWL-Full (first order logic)
  • Common Logic (first order logic)
  • OBO (frame based)
slide-30
SLIDE 30
  • Sept. 11, 2008

OOR Talk to CENDI/NKOS, F. Olken 30

Additional Resources

  • http://ontolog.cim3.net/cgi-bin/wiki.pl?

OntologySummit2008_Communique

  • http://ontolog.cim3.net/cgi-bin/wiki.pl?

OntologySummit2008

  • http://ontolog.cim3.net/cgi-bin/wiki.pl?

OpenOntologyRepository

slide-31
SLIDE 31
  • Sept. 11, 2008

OOR Talk to CENDI/NKOS, F. Olken 31

Contact Info

  • Frank Olken

– folken@nsf.gov – 703-292-7350 – National Science Foundation

  • Computer and Information Science and Engineering

Directorate

  • Intelligent Information Systems Division
  • Information Integration and Informatics Program
  • 4201 Wilson Blvd., Suite 1125, Arlington, VA 22230
slide-32
SLIDE 32
  • Sept. 11, 2008

OOR Talk to CENDI/NKOS, F. Olken 32

OOR Initiative Contacts

  • Peter Yim

– peter.yim@cim3.com

  • Leo Obrst (MITRE)

– lobrst@mitre.org

  • Michelle Raymond (Honeywell)

– michellearaymond@gmail.com

  • Mark Musen (Stanford)

– musen@stanford.edu

  • Mike Dean (BBN)

– mdean@bbn.com

slide-33
SLIDE 33
  • Sept. 11, 2008

OOR Talk to CENDI/NKOS, F. Olken 33

Acknowledgments

  • My portion of this work has been supported by

the National Science Foundation under the Individual Research Plan, as a part of my IPA appointment to NSF from Lawrence Berkeley National Laboratory.