Infrastructure Matias Frosterus, Mirja Anttila, Mikko Lappalainen, - - PowerPoint PPT Presentation

infrastructure
SMART_READER_LITE
LIVE PREVIEW

Infrastructure Matias Frosterus, Mirja Anttila, Mikko Lappalainen, - - PowerPoint PPT Presentation

Building a National Ontology Infrastructure Matias Frosterus, Mirja Anttila, Mikko Lappalainen, Susanna Nykyri, Tuomas Palonen, Sini Pessala SWIB 2013 THE NATIONAL LIBRARY OF FINLAND Library Network Services This presentation Overview


slide-1
SLIDE 1

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

Building a National Ontology Infrastructure

Matias Frosterus, Mirja Anttila, Mikko Lappalainen, Susanna Nykyri, Tuomas Palonen, Sini Pessala SWIB 2013

slide-2
SLIDE 2

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

This presentation

  • Overview of the ONKI project
  • Linked ontology approach
  • Trilingual ontology
slide-3
SLIDE 3

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

ONKI project

  • A joint project of the National Library of Finland, the Ministry of

Finance and the Ministry of Education and Culture

  • The aim is to build a reliable, centralized, national ontology

service named Finto

slide-4
SLIDE 4

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

ONKI project

  • What does the ONKI project offer?
  • Publication of ontologies
  • Using ontologies in applications through various interfaces
  • The development of the General Finnish Upper Ontology YSO
  • Coordination of ontology work on national scale
  • Improving interoperability across the spectrum by harmonizing

annotations

slide-5
SLIDE 5

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

ONKI project

  • Based on the FinnONTO research project, which ran in Aalto

University and the University of Helsinki 2003-2012

  • Focus on light-weight SKOS ontologies intended for

annotations

  • Powered by ONKI Light
  • Open source
  • https://code.google.com/p/onki-light/
slide-6
SLIDE 6

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

ONKI Light

Guidelines and support

Browsing

Interfaces

Ontology developers

Annota- tors

Application Developers

End users

General Upper Ontology

Finto ontology service: Users: Ontology publication

slide-7
SLIDE 7

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

The second part

  • Linked ontology approach
slide-8
SLIDE 8

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

Linked ontology approach

Data

Metadata

Thesaurus

Data

Metadata

Thesaurus

silo silo

  • What we have:
  • Silos
  • Expert-made thesauri
  • A large amount of data and annotations

slide-9
SLIDE 9

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

Linked ontology approach

Data

Metadata

Thesaurus

Data

Metadata

Thesaurus

  • What we want:
  • Eliminate the silos
  • Harmonize the annotations
slide-10
SLIDE 10

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

Linked ontology approach

Data

Metadata

Data

Metadata Ontology Ontology

  • How?
  • Ontologies are much easier to link together than thesauri
  • Concepts as opposed to terms
  • Explicit relations
slide-11
SLIDE 11

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

Linked ontology approach

Data

Metadata

Data

Metadata Ontology Ontology

Data

Metadata Ontology

Data

Metadata Ontology

… Data

Metadata Ontology

  • The problem:
slide-12
SLIDE 12

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

Linked ontology approach

Data

Metadata

Data

Metadata Ontology Ontology

Data

Metadata Ontology

Data

Metadata Ontology

… Data

Metadata Ontology

  • The problem:
  • A lot of work!

I have an update!

I must react!

Me too! Me too! Me too!

slide-13
SLIDE 13

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

Linked ontology approach

Data

Metadata

Data

Metadata

Domain Ontology Domain Ontology

Data

Metadata

Domain Ontology

Data

Metadata

Domain Ontology

… Data

Metadata

Domain Ontology

General Upper Ontology

  • The approach:
  • Limit the links between

the ontologies

slide-14
SLIDE 14

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

Linked ontology approach

Data

Metadata

Data

Metadata

JUHO TERO Data

Metadata

LIITO Data

Metadata

MAO … Data

Metadata

AFO YSO KOKO

  • In practice:
slide-15
SLIDE 15

Linked ontology approach: KOKO

Ontology Domain Concepts

YSO General upper ontology 24 800 MAO Museum artifacts 6 800 MUSO Music 1 000 TAO Design 3 000 TERO Health 6 500 VALO Photography 2 000 AFO Agriculture 7 000 JUHO Government 6 300 KAUNO Literature 5 000 KTO Linguistics 900 KITO Literary research 850 KULO Cultural research 1 500 LIITO Economics 3 000 MERO Seafaring 1 300 PUHO Military 2 000

slide-16
SLIDE 16

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

Challenges to be tackled

  • Propagating the changes in the upper general ontology to the

domain ontologies

  • Locating the overlapping concepts between the domain
  • ntologies
  • Not always simple
  • Labels might be misleading
  • Ontological structure can help
  • Coordinating the use and development of ontologies on a

national level

slide-17
SLIDE 17

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

The third part

  • Trilingual ontology
slide-18
SLIDE 18

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

Ontology design

  • The relations between concepts can be designed in several

ways

  • What affects these choices?
  • Corpora
  • Language
  • Culture
slide-19
SLIDE 19

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

Trilingual ontology

  • In practice
  • YSO: General Finnish Upper Ontology
  • ”Finnish” as a culture
  • Finland has two official languages: Finnish and Swedish
  • Very different from one another
  • Lingua franca: English
slide-20
SLIDE 20

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

YSO

  • Topmost hierarchy is inspired by DOLCE
  • Offers the general concepts needed for annotation in many

domains

  • Complemented with a number of domain ontologies for

specific use cases

  • Based on the General Finnish Thesaurus YSA
  • Used and developed for decades in the annotation of all Finnish

non-fiction literature

slide-21
SLIDE 21

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

Language affects the hierarchy

  • Finnish word ’siirto’ means transfer

siirto maan- siirto hiusten- siirto voiman- siirto skos:broader

slide-22
SLIDE 22

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

Language affects the hierarchy

  • Finnish word ’siirto’ means transfer

transfer earth- moving

hair trans- plantation power trans- mission

skos:broader

slide-23
SLIDE 23

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

Language affects the hierarchy

  • Finnish has a single concept for rivers
  • Swedish has three
  • Älv = Scandinavian river situated north of Göta älv (a specific

river)

  • Å = Scandinavian river situated south of Göta älv
  • Flod = non-Scandinavian river
  • A distinction not used in Finland
slide-24
SLIDE 24

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

Culture before language

  • Looking beyond the language
  • Realizing that language does affect the way we perceive the

world

  • Building an ontology for a specific cultural sphere
  • Key to the harmonization of different annotations in different

domains

slide-25
SLIDE 25

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

The development of YSO

  • Mapping to other ontologies
  • Mapping to LCSH is underway
  • Building the guidelines for the development
  • How to choose the correct approach when language clash leads

to concept clash?

slide-26
SLIDE 26

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

Thank you!

matias.frosterus@helsinki.fi

  • nki-posti@helsinki.fi