Linking the TEI
Approaches, Limitations, Use Cases
Christian Chiarcos & Maxim Ionov
{chiarcos|ionov}@cs.uni-frankfurt.de Applied Computational Linguistics Goethe Universität Frankfurt, Germany
DH2019, Utrecht, 2019-07-11
Linking the TEI Approaches, Limitations, Use Cases Christian - - PowerPoint PPT Presentation
DH2019, Utrecht, 2019-07-11 Linking the TEI Approaches, Limitations, Use Cases Christian Chiarcos & Maxim Ionov {chiarcos|ionov}@cs.uni-frankfurt.de Applied Computational Linguistics Goethe Universitt Frankfurt, Germany Linking the TEI
DH2019, Utrecht, 2019-07-11
inline XML solutions ?
breaks if we have dynamic TEI content
generate (textual) documentation generate actual schemas
interoperable.
interpretation For problems not documented in the TEI documentation, different customizations will not be interoperable E.g., when trying to encode RDF triples in the TEI ;)
interesting consequences for extension and interchange.
(Bauman & Flanders 2004, bullet points by us)
perseus-nt: tlg002.perseus-grc1:9.35
PREFIX perseus-nt : <http://www.perseus.tufts.edu/hopper/text?doc=urn:cts:greekLit:tlg0031.>
graph (RDF) text (URI) perseus-nt:tlg002.perseus-grc1:9.35 prefix declarations
„... πάντων ἔσχατος ...“ perseus-nt: tlg002.perseus-grc1:9.35
PREFIX perseus-nt : <http://www.perseus.tufts.edu/hopper/text?doc=urn:cts:greekLit:tlg0031.>
rdfs:label
PREFIX rdfs : <http://www.w3.org/2000/01/rdf-schema#>
graph (RDF) Turtle perseus-nt:tlg002.perseus-grc1:9.35 rdfs:label „... πάντων ἔσχατος ...“ . prefix declarations RDFS URL: machine-readable representation of a particular vocabulary
„... πάντων ἔσχατος ...“ perseus-nt: tlg002.perseus-grc1:9.35
PREFIX perseus-nt : <http://www.perseus.tufts.edu/hopper/text?doc=urn:cts:greekLit:tlg0031.> PREFIX saws-nt : <http://www.ancientwisdoms.ac.uk/cts/urn:cts:greekLit:tlg3017.Syno298.sawsGrc01:divedition.>
saws-nt: divsection1.o14.a107 rdfs:label
PREFIX rdfs : <http://www.w3.org/2000/01/rdf-schema#>
saws:isVariantOf
PREFIX saws : <http://purl.org/saws/ontology#>
graph (RDF) perseus-nt:tlg002.perseus-grc1:9.35 rdfs:label „... πάντων ἔσχατος ...“ . saws-nt: divsection1.o14.a107 saws:isVariantOf perseus-nt:tlg002.perseus-grc1:9.35 . prefix declarations Turtle
„... πάντων ἔσχατος ...“ perseus-nt: tlg002.perseus-grc1:9.35
PREFIX perseus-nt : <http://www.perseus.tufts.edu/hopper/text?doc=urn:cts:greekLit:tlg0031.> PREFIX saws-nt : <http://www.ancientwisdoms.ac.uk/cts/urn:cts:greekLit:tlg3017.Syno298.sawsGrc01:divedition.>
saws-nt: divsection1.o14.a107 „ἔσχατος πάντων“ rdfs:label
PREFIX rdfs : <http://www.w3.org/2000/01/rdf-schema#>
saws:isVariantOf
PREFIX saws : <http://purl.org/saws/ontology#>
rdfs:label graph (RDF) perseus-nt:tlg002.perseus-grc1:9.35 rdfs:label „... πάντων ἔσχατος ...“ . saws-nt: divsection1.o14.a107 rdfs:label „ἔσχατος πάντων“ . saws-nt: divsection1.o14.a107 saws:isVariantOf perseus-nt:tlg002.perseus-grc1:9.35 . prefix declarations Turtle
a machine-readable, formal definition
validation
https://www.w3.org/DesignIssues/LinkedData.html
(version of July 2017) http://linguistic-lod.org/
snippets
* other URI-bearing attributes do exist, too
property: <g> „is instance of glyph type“ target: @ref subject: @xml:id
property: <g> „is instance of glyph type“ target: @ref subject: @xml:id
property: <g> „is instance of glyph type“ target: @ref subject: @xml:id ... this would be a natural application of reification and established RDF vocabularies ! The colleagues will certainly invent something, but ...
child of <seg> and <ab>
Tittel, Bermúdez-Sabel & Chiarcos (2018)
:8 rdfs:label „anathomie “ ; rdfs:seeAlso deaf:anatomie; skos:definition „structure ...“.
dictionary entries
Display sense definitions for every lexical unit Display notes from critical apparatus
http://www.deaf-page.de/guichaulmTel/edition.html
http://www.deaf-page.de/guichaulmTel/edition.html
page.de/guichaulmTel/edition.html&format=turtle
http://www.deaf-page.de/guichaulmTel/edition.html
https://www.w3.org/2012/pyRdfa/extract?uri=http://www.deaf- page.de/guichaulmTel/edition.html&format=turtle
as target graph
http://www.deaf-page.de/guichaulmTel/edition.html
https://www.w3.org/2012/pyRdfa/extract?uri=http://www.deaf- page.de/guichaulmTel/edition.html&format=turtle
(FROM) URI
query=PREFIX+rdfs%3A+%3Chttp%3A...
https://github.com/TEIC/TEI/issues/1860
http://tei-l.970651.n3.nabble.com/template/NamlServlet.jtp? macro=search_page&node=1692902&query=RDFa
ambiguous
largely to be considered tag abuse
TEI-XML + WebAnnotation (JSON-LD) no
If you want to provide a generic RDF view on, say, a digital edition or electronic data set
Do you plan to update your TEI data?
yes TEI+RDFa yes TEI-XML + WebAnnotation (JSON-LD) no
If you want to provide a generic RDF view on, say, a digital edition or electronic data set
Do you plan to update your TEI data? Do you need W3C compliancy?
yes TEI+RDFa yes no yes TEI-XML + WebAnnotation (JSON-LD) no
If you want to provide a generic RDF view on, say, a digital edition or electronic data set
Do you plan to update your TEI data? Do you need W3C compliancy? Do you want to combine human-readable and machine-readable views in multiple output documents?
yes TEI+RDFa yes no yes no no TEI-XML + WebAnnotation (JSON-LD) no
If you want to provide a generic RDF view on, say, a digital edition or electronic data set
Do you plan to update your TEI data? Do you need W3C compliancy? Do you want to combine human-readable and machine-readable views in multiple output documents? Do you have the resources to create and maintain your own converters?
yes TEI+RDFa yes no yes no no yes interoperable TEI-XML + WebAnnotation (JSON-LD) no
If you want to provide a generic RDF view on, say, a digital edition or electronic data set
Do you plan to update your TEI data? Do you need W3C compliancy? Do you want to combine human-readable and machine-readable views in multiple output documents? Do you have the resources to create and maintain your own converters? Do you want your data to be interoperable or interpretable?
yes TEI+RDFa yes no yes no no yes interoperable just interpretable (not interoperable) TEI <relation> native TEI/XML yes no TEI-XML + WebAnnotation (JSON-LD) no
If you want to provide a generic RDF view on, say, a digital edition or electronic data set
Do you plan to update your TEI data? Do you need W3C compliancy? Do you want to combine human-readable and machine-readable views in multiple output documents? Do you have the resources to create and maintain your own converters? Do you want your data to be interoperable or interpretable? Do specific TEI markup elements sufficiently cover your use case?
yes TEI+RDFa yes no yes no no yes interoperable just interpretable (not interoperable) native TEI/XML yes no TEI-XML + WebAnnotation (JSON-LD) no
If you want to provide a generic RDF view on, say, a digital edition or electronic data set
Do you plan to update your TEI data? Do you need W3C compliancy? Do you want to combine human-readable and machine-readable views in multiple output documents? Do you have the resources to create and maintain your own converters? Do you want your data to be interoperable or interpretable? Do specific TEI markup elements sufficiently cover your use case?
N3 your own converters
...
RDF/TTL HTML+ RDFa ePub+ RDFa
publication
RDF/HDT JSON-LD RDF/XML RDF-Thrift
TEI <relation>
A proper follow-up publication is in preparation, stay tuned! Also, please share your thoughts and criticisms.