: on the Semantic Web : on the Semantic Web Building a Semantic - - PowerPoint PPT Presentation

on the semantic web
SMART_READER_LITE
LIVE PREVIEW

: on the Semantic Web : on the Semantic Web Building a Semantic - - PowerPoint PPT Presentation

Family History Research Family History Research : on the Semantic Web : on the Semantic Web Building a Semantic Prototype for Danish Building a Semantic Prototype for Danish Genealogical Research Genealogical Research By By Charla Woodbury


slide-1
SLIDE 1

Family History Research Family History Research

  • n the Semantic Web
  • n the Semantic Web:

:

Building a Semantic Prototype for Danish Building a Semantic Prototype for Danish Genealogical Research Genealogical Research

By By

Charla Woodbury and David W. Embley Charla Woodbury and David W. Embley BYU Computer Science Department BYU Computer Science Department

charlajw@cs.byu charlajw@cs.byu.edu .edu embley@cs.byu.edu embley@cs.byu.edu

Family History Technology Institute Family History Technology Institute March 24, 2005 March 24, 2005

Supported in part by NSF Supported in part by NSF

slide-2
SLIDE 2

2

Semantic Web Semantic Web Machine “Understandable” Web Machine “Understandable” Web DATA INFORMATION KNOWLEDGE MEANING

slide-3
SLIDE 3

3

Need for Semantic Web Need for Semantic Web

“The Semantic Web: … content that is meaningful to computers [and that] will unleash a revolution of new possibilities … Properly designed, the Semantic Web can assist the evolution of human knowledge …”

(Tim Berners-Lee, …, Weaving the Web)

slide-4
SLIDE 4

4

Semantic Web Semantic Web ‘ ‘DATE DATE’ ’

Calendar date To date an artifact A fruit A romantic experience To go on a romantic experience with someone

slide-5
SLIDE 5

5

Also a Also a SURNAME SURNAME – –

  • Mr. C. J. Date
  • Mr. C. J. Date**

**

The semantic web will make it possible for machines to know the difference!

** Edgar F. Codd and C. J. Date are famous in the area of databases for defining levels of normal forms

slide-6
SLIDE 6

6

Real Problem Real Problem

A person decides to do family history research for the first time

  • n their Danish family lines.
  • Where do they go?
  • What records do they look for?
  • How do they handle records in Danish?
  • How can they tell when the records they have match their

search family?

slide-7
SLIDE 7

7

Semantic Web Semantic Web Ideal for Family History Ideal for Family History

SOLUTION – PROTOTYPE

 The heart of a one-stop web site for naïve

researchers

 So many records have been extracted into

digitized forms and are often available on the Web

 Limited geographically – parish and probate

records from Nim District, Skanderborg, Denmark

  • 100% probates

100% probates

  • 100% marriages

100% marriages

slide-8
SLIDE 8

8

Semantic Web Prototype

Ontology – semantic model

(BYU Ontos)

Annotated web pages

(Web Ontology Language OWL proposed W3C Feb 2004)

Solutions for special genealogical problems

slide-9
SLIDE 9

9

Ontology Model Ontology Model

slide-10
SLIDE 10

10

Person Matching Person Matching in genealogical research in genealogical research

 NAMES  DATES  PLACES  RELATIONS

slide-11
SLIDE 11

11

Ontology Entities Ontology Entities

FIND and MARK UP relevant web pages by:

  • NAME

NAME <NAME> <NAME>

  • DATE

DATE <DATE> <DATE>

  • PLACE

PLACE <PLACE> <PLACE>

  • RELATIONSHIP

RELATIONSHIP <RELATION> <RELATION>

  • OCCUPATION

OCCUPATION <OCCUPATION> <OCCUPATION>

  • RECORD_TYPE

RECORD_TYPE <RTYPE> <RTYPE>

  • SOURCE

SOURCE <SOURCE> <SOURCE>

slide-12
SLIDE 12

12

Partial Danish Partial Danish GIVEN NAME GIVEN NAME LEXICON LEXICON

MALE

  • And.

And.

  • Anders

Anders

  • Andreas

Andreas

  • Christen

Christen

  • Christian

Christian

  • Eric

Eric

  • Erik

Erik

  • Gregers

Gregers

  • Hans

Hans

  • Ib

Ib

  • Jacob

Jacob

  • Jens

Jens

  • Jep

Jep

FEMALE

  • Ane

Ane

  • Anna

Anna

  • Anne

Anne

  • Birthe

Birthe

  • Birte

Birte

  • Bodil

Bodil

  • Caroline

Caroline

  • Dorte

Dorte

  • Dorthe

Dorthe

  • Elene

Elene

  • Ellen

Ellen

  • Elisabeth

Elisabeth

  • Elsbeth

Elsbeth

slide-13
SLIDE 13

13

Partial Partial DATE DATE Lexicon Lexicon

(actual lexicon is a single list in alphabetic order) (actual lexicon is a single list in alphabetic order)

MONTHS

January –Jan –Januar -11br

Februrary –Feb –Februar -12br

March –Mar –Marts

April – Apr –Apl

May –Mai

June –Jun –Juni

July –Jul –Juli -5br

August –Aug –Augst -6br

September –Sep –Sept -7br –Septembre

October –Oct -8br –Octobre

November –Nov -9br –Novembre

December –Dec -10br -Decembre

TIME

Year –yr –aar –år

Month –mo –maaned –måned –m.

Week –uge –ug.

Day –dag –dg.

Hour – h. –hr.

FEAST DATES (partial)

Easter – Paaske –Påske –Paasche –Påsche

Pentecost – Pent –Pinse -Pin

Trinity –Tr –Trin –Trinitatis

DAYS OF WEEK

Sunday –Dominico –Dom.

Monday –Mondag –Mond.

Tuesday –Tirsdag –Tirsd.

Wednesday -Onsdag –Onsd.

Thursday –Tørsdag –Tørsd.

Friday –Fredag –Fred.

Saturday –Lørsdag –Lørs.

slide-14
SLIDE 14

14

Original Record Original Record

FHL Film#052,236 Tvilum Parish FHL Film#052,236 Tvilum Parish

slide-15
SLIDE 15

15

Web Page Web Page

  • SOURCE URL -

SOURCE URL -Tvilum Sogne Kirkebog Tvilum Sogne Kirkebog

  • [PAGE HEADER]

[PAGE HEADER] Fødde 1751 3 Fødde 1751 3

  • [BODY]

[BODY] Truust Dom. 23 p: Trinit: laest over Niels Truust Dom. 23 p: Trinit: laest over Niels Baches SØREN fadd. Johannes Michelsens og Niels Baches SØREN fadd. Johannes Michelsens og Niels Mollers hustruer af Søebyevad, Peder Rasmussen af Mollers hustruer af Søebyevad, Peder Rasmussen af Søebyevad, Jens Bachis søn Peder og Niels Thylkes s. Søebyevad, Jens Bachis søn Peder og Niels Thylkes s. Peder af Truust Peder af Truust

slide-16
SLIDE 16

16

Ontology Entities Ontology Entities

FIND and MARK UP relevant web pages by:

  • NAME

NAME <NAME> <NAME>

  • DATE

DATE <DATE> <DATE>

  • PLACE

PLACE <PLACE> <PLACE>

  • RELATIONSHIP

RELATIONSHIP <RELATION> <RELATION>

  • OCCUPATION

OCCUPATION <OCCUPATION> <OCCUPATION>

  • RECORD_TYPE

RECORD_TYPE <RTYPE> <RTYPE>

  • SOURCE

SOURCE<SOURCE> <SOURCE> Colors only represent OWL annotation mark-ups Colors only represent OWL annotation mark-ups automatically placed in the web page using the ontology automatically placed in the web page using the ontology

slide-17
SLIDE 17

17

Annotated Web Page Annotated Web Page

  • SOURCE -

SOURCE -Tvilum Parish Register Tvilum Parish Register

  • [PAGE HEADER]

[PAGE HEADER] Fødde Fødde 1751 1751 3 3

  • [BODY]

[BODY] Truust Truust

  • Dom. 23 p: Trinit:
  • Dom. 23 p: Trinit: laest over

laest over Niels Niels Baches Baches SØREN SØREN fadd. fadd. Johannes Michelsens Johannes Michelsens og

  • g Niels

Niels Mollers Mollers hustruer hustruer af af Søebyevad Søebyevad, , Peder Rasmussen Peder Rasmussen af af Søebyevad Søebyevad, , Jens Bachis Jens Bachis søn søn Peder Peder og

  • g Niels Thylkes

Niels Thylkes s. s. Peder Peder af af Truust Truust

slide-18
SLIDE 18

18

Results Listing Results Listing

TARGET – Jens Pedersen Bach Truust, Tvilum Parish, Gjern District, Skanderborg Date Range - born 1693 to died 1778 Name Date Place Relation Occupation Record Type Source (URL)

Jens Bachis

  • Dom. 23 p:

Trinit: 1751

(14 Nov 1751)

Truust fadd: fadd: Fødde Fødde

Tvilum Parish Register SOURCE - SOURCE -Tvilum Parish Register Tvilum Parish Register [PAGE HEADER] [PAGE HEADER] Fødde Fødde 1751 1751 3 3 [BODY] [BODY] Truust Truust

  • Dom. 23 p: Trinit:
  • Dom. 23 p: Trinit: laest over

laest over Niels Baches Niels Baches SØREN SØREN fadd. fadd. Johannes Michelsens Johannes Michelsens og

  • g Niels

Niels Mollers Mollers hustruer hustruer af af Søebyevad Søebyevad, , Peder Rasmussen Peder Rasmussen af af Søebyevad Søebyevad, , Jens Bachis

Jens Bachis

søn søn Peder Peder og

  • g

Niels Thylkes Niels Thylkes s. s. Peder Peder af af Truust Truust

slide-19
SLIDE 19

19

Conversion Functions Conversion Functions inside the ontology inside the ontology

  • Compute birthdate from age at death

Compute birthdate from age at death Death – 22 Mar 1743 Death – 22 Mar 1743 Age - 23 yr 2 m Age - 23 yr 2 m

  • >
  • > BIRTH

BIRTH Jan 1720 Jan 1720

  • Compute dates from feast dates

Sunday 23rd after Trinity 1751

  • > 14 Nov 1751
slide-20
SLIDE 20

20

Solutions for Special Problems Solutions for Special Problems

RULES FOR

  • Matching different name forms

Matching different name forms

  • Matching place names to appropriate records

Matching place names to appropriate records

slide-21
SLIDE 21

21

RULE RULE - Match different name forms

  • Match different name forms

as ONE PERSON as ONE PERSON

  • JENS PEDERSEN

JENS PEDERSEN

  • JENS PEDERSEN BACH

JENS PEDERSEN BACH

  • JENS BACH

JENS BACH

  • JENS BACHIS

JENS BACHIS

slide-22
SLIDE 22

22

PLACES - County Map of DENMARK

slide-23
SLIDE 23

23

Parish and District Map of Parish and District Map of SKANDERBORG SKANDERBORG

slide-24
SLIDE 24

24

Road Map Road Map www.expedia.com www.expedia.com

slide-25
SLIDE 25

25

Matching Places to Records Matching Places to Records

Farm name Parish District County Record Links

Molger Tamdrup Nim Skanderborg

PARISH Tamdrup 1684-1912 PROBATE Nim Herred Provisti Rask Skanderborg Rytterdistrikt

Tamdrup Nim Skanderborg

List of URL’s Includes Molger URL’s Adds Parish specific records

Nim Skanderborg

List of URL’s Includes Tamdrup URL’s Adds District specific records

Skanderborg

List of URL’s Includes all district URL’s Adds County specific records

slide-26
SLIDE 26

26

MAJOR CONTRIBUTIONS MAJOR CONTRIBUTIONS

 First genealogical prototype for the

semantic web

  • FOCUS on primary records

FOCUS on primary records

  • Not just an index of the records

Not just an index of the records

 Practical demonstration of the superiority of

the semantic web for research

 Portal for family history research that could

be easily expanded:

  • Maps

Maps

  • Look-ups

Look-ups

  • Helps

Helps

  • Research training

Research training

  • Other countries and states

Other countries and states

slide-27
SLIDE 27

27

QUESTIONS? QUESTIONS?