Entitifying Europeana: Building an ecosystem of networked references - - PowerPoint PPT Presentation

entitifying europeana building an ecosystem of networked
SMART_READER_LITE
LIVE PREVIEW

Entitifying Europeana: Building an ecosystem of networked references - - PowerPoint PPT Presentation

Entitifying Europeana: Building an ecosystem of networked references for Cultural Objects Hugo Manguinhas, Valentine Charles, Antoine Isaac, Tim Hill| Eur uropeana na F Found undat ion What at is European ana? a? The Platform for E


slide-1
SLIDE 1

Entitifying Europeana: Building an ecosystem of networked references for Cultural Objects

Hugo Manguinhas, Valentine Charles, Antoine Isaac, Tim Hill|

Eur uropeana na F Found undat ion

slide-2
SLIDE 2

What at is European ana? a?

CC BY-SA

We aggregate metadata:

  • From all E

U countries

  • ~3,500 galleries, libraries,

archives and museums

  • More than 53M objects
  • In about 50 languages
  • Huge amount of references to

places, agents, concepts, time

Eur uropeana na aggregat ion n inf nfrast ruc uct ur ure Europeana| | CC BY-SA

The Platform for E urope’s Digital Cultural Heritage

SWIB16 16 - Ent nt it ifying ng Eur uropeana na: Bui uilding ng an n ecosyst em of ne net worked referenc nces for Cul ult ur ural Object s

slide-3
SLIDE 3

European ana a Linked Dat at a a St rat at egy

Our e efforts and l lines of

  • f w

wor

  • rk

CC BY-SA

  • Europeana Data Model (EDM) offers a base for linking data
  • We apply automatic enrichment to link source data to reference

data

  • We encourage data providers to contribute their own

vocabularies so that we can benefit from data links made at data providers’ level

  • We encourage alignment activities between domain

vocabularies

SWIB16 16 - Ent nt it ifying ng Eur uropeana na: Bui uilding ng an n ecosyst em of ne net worked referenc nces for Cul ult ur ural Object s

Significant progress have been made, most of it presented in past SWIB!

slide-4
SLIDE 4

European ana a Linked Dat at a a St rat at egy

A strategy f for E Entities

CC BY-SA

As a cornerstone for our strategy we are building an "Entity Collection"

  • A service that acts as a centralized point of reference and

access to data about contextual entities

  • Caching and curating data from the wider Linked Open Data

cloud

  • A sort of Europeana "knowledge graph"

SWIB16 16 - Ent nt it ifying ng Eur uropeana na: Bui uilding ng an n ecosyst em of ne net worked referenc nces for Cul ult ur ural Object s

slide-5
SLIDE 5

European ana a Linked Dat at a a St rat at egy

Motiv ivatio ion

CC BY-SA

  • Improve user experience
  • S

upport better ways of searching and navigating through the collections, eliminating ambiguity and clarifying the meaning of descriptions

  • Adapt better to the language of the user
  • by improving the interlinking of data
  • Brings more context to the objects
  • Alleviates polysemy issues
  • E

xpands language coverage

  • Contributes to build a web of data ('knowledge graph') that

third parties can use to improve their users' experience

SWIB16 16 - Ent nt it ifying ng Eur uropeana na: Bui uilding ng an n ecosyst em of ne net worked referenc nces for Cul ult ur ural Object s

slide-6
SLIDE 6

The E Ent it it y Colle llect io ion

Use C Cases

CC BY-SA

Europeana Collections Portal

  • Findability: users can look for entities, not
  • nly records (Entity-Based Search)
  • Understandability: Entity Pages group and

present all assertions about an entity

  • Exploration: Navigation along relationships

becomes possible

Crowdsourcing

  • Objects can be annotated with references to

entities

  • A controlled vocabulary for client applications

Enrichment of Provider’s Data

  • A controlled vocabulary to help identify

named references to entities

Republication for Re-use

  • Entities can be republished as an open

source to the community

Entity Collection

SWIB16 16 - Ent nt it ifying ng Eur uropeana na: Bui uilding ng an n ecosyst em of ne net worked referenc nces for Cul ult ur ural Object s

slide-7
SLIDE 7

The E Ent it it y Colle llect io ion

What c can it e enable?

CC BY-SA

Semantic auto- completion Semantic and Metadata annotations Entity Pages Entity based facets

SWIB16 16 - Ent nt it ifying ng Eur uropeana na: Bui uilding ng an n ecosyst em of ne net worked referenc nces for Cul ult ur ural Object s

Google Knowledge Card Pundit Annotation Client Food & Drink Project

slide-8
SLIDE 8

The E Ent it it y Colle llect io ion

How d do we choos

  • se o
  • ur t

target v vocabularies?

CC BY-SA

As defined in the recent E uropeana Tech Task Force on enrichment and evaluation (presented last year), we consider the following criteria when selecting a vocabulary:

  • Properly documented and supported by a community
  • Technically available on the web according to the Linked Data best

practices and recipes

  • Available under an open licence
  • Multilingual
  • Abide to a minimal ontological commitment principle
  • Apply the best practices and standards for the representation, structure

and description of vocabularies

  • Well-connected internally and externally to other vocabularies (preferably

spine vocabularies)

SWIB16 16 - Ent nt it ifying ng Eur uropeana na: Bui uilding ng an n ecosyst em of ne net worked referenc nces for Cul ult ur ural Object s

slide-9
SLIDE 9

The E Ent it it y Colle llect io ion

Which target vocabularies are w we u using?

CC BY-SA

For historical reasons, the target vocabularies correspond to the ones being used for S emantic E nrichment (as of November 2016):

  • Places

a subset of Ge Geonames, corresponding to places which are part of E uropean countries and of some specific feature classes.

  • Agents

a subset of DBp Bpedia corresponding to most of the instances of dbp:Artist with some exceptions, and integrated from 49 DBpedia language editions.

  • Concepts

a subset of DBp Bpedia corresponding to a handful of concepts matching the needs from E uropeana Collections.

  • Time S

pans

The chronological periods from Semi miumT mTime me.

214,307

resources

274

resources

165,008

resources

2,566

resources

SWIB16 16 - Ent nt it ifying ng Eur uropeana na: Bui uilding ng an n ecosyst em of ne net worked referenc nces for Cul ult ur ural Object s

slide-10
SLIDE 10

The E Ent it it y Colle llect io ion

Cont ntribut ution t n to mul ultilingua ual c coverage

CC BY-SA

SWIB16 16 - Ent nt it ifying ng Eur uropeana na: Bui uilding ng an n ecosyst em of ne net worked referenc nces for Cul ult ur ural Object s

Entities effectively used to enrich Europeana Objects Entities present in the Entity Collection

slide-11
SLIDE 11

The E Ent it it y Colle llect io ion

Are t these target vocabularies es en enough?

CC BY-SA

  • Not enough coreferencing information to other vocabularies
  • particularly to the ones we receive from data providers (e.g.

musical instruments, MIMO)

  • Labels and values are not always accurate and normalized
  • need for better reference data (e.g. VIAF)
  • Missing relevant information
  • e.g. roles and professions
  • Need to expand coverage to other types of entities
  • namely Works and E

vents

SWIB16 16 - Ent nt it ifying ng Eur uropeana na: Bui uilding ng an n ecosyst em of ne net worked referenc nces for Cul ult ur ural Object s

slide-12
SLIDE 12

The E Ent it it y Colle llect io ion

Challenges

CC BY-SA

Investigate and design strategies for:

  • Integrating new vocabularies that can further improve
  • entity descriptions and multilingual coverage (e.g. VIAF)
  • linking between entities (e.g. Wikidata)
  • Integrating alignments, in particular:
  • links between local/domain vocabularies to pivot vocabularies
  • S

upporting manual curation of existing and new entities

  • Keeping up-to-date the information collected from external sources

SWIB16 16 - Ent nt it ifying ng Eur uropeana na: Bui uilding ng an n ecosyst em of ne net worked referenc nces for Cul ult ur ural Object s

slide-13
SLIDE 13

The E Ent it it y Colle llect io ion

Our r roadmap f for t the next y years

CC BY-SA

  • Mint E

uropeana UR Is for E ntities and update internal references

  • Make entity services and data available via an API
  • Make use of the API in the Collections Portal
  • Implement support for new vocabularies and entity types

SWIB16 16 - Ent nt it ifying ng Eur uropeana na: Bui uilding ng an n ecosyst em of ne net worked referenc nces for Cul ult ur ural Object s

✔ ✔

slide-14
SLIDE 14

The E Ent it it y Colle llect io ion

Alpha r relea ease of our n new Entity API

CC BY-SA

SWIB16 16 - Ent nt it ifying ng Eur uropeana na: Bui uilding ng an n ecosyst em of ne net worked referenc nces for Cul ult ur ural Object s

More methods will come, for:

Creation, Update and Delete; UR I resolution to E uropeana E ntities

slide-15
SLIDE 15

The E Ent it it y Colle llect io ion

DBpedia resource for “ “Mozart” i in o

  • ur d

data

CC BY-SA

SWIB16 16 - Ent nt it ifying ng Eur uropeana na: Bui uilding ng an n ecosyst em of ne net worked referenc nces for Cul ult ur ural Object s

Coreference links to 6 other datasets

(e.g. Freebase, Wikidata)

Inter-linking information… still need to switch references to link to Europeana Entities Preferred labels for 48 languages

slide-16
SLIDE 16

The E Ent it it y Colle llect io ion

Entity A API - suggest m method

CC BY-SA

SWIB16 16 - Ent nt it ifying ng Eur uropeana na: Bui uilding ng an n ecosyst em of ne net worked referenc nces for Cul ult ur ural Object s

/entity/suggest.json?text=neo&lang=en&rows=6

slide-17
SLIDE 17

Con

  • nclusion
  • n

CC BY-SA

  • A S

trategy for E ntities is a “must” for E uropeana

  • There is no “one fits all” vocabulary
  • We have a long way to go…

...but we are making progress

SWIB16 16 - Ent nt it ifying ng Eur uropeana na: Bui uilding ng an n ecosyst em of ne net worked referenc nces for Cul ult ur ural Object s

slide-18
SLIDE 18

Thank you!

hugo.manguinhas@europeana.eu