Using Semantic Mapping to Manage Heterogeneity in XLIFF - - PowerPoint PPT Presentation

using semantic mapping to manage heterogeneity in xliff
SMART_READER_LITE
LIVE PREVIEW

Using Semantic Mapping to Manage Heterogeneity in XLIFF - - PowerPoint PPT Presentation

Using Semantic Mapping to Manage Heterogeneity in XLIFF Interoperability by Dave Lewis, Rob Brennan, Alan Meehan, Declan OSullivan CNGL Centre for Global Intelligent Content at Trinity College Dublin Outline Localization industry


slide-1
SLIDE 1

Using Semantic Mapping to Manage Heterogeneity in XLIFF Interoperability

by

Dave Lewis, Rob Brennan, Alan Meehan, Declan O’Sullivan

CNGL Centre for Global Intelligent Content at Trinity College Dublin

slide-2
SLIDE 2

Outline

  • Localization industry – interoperability issues
  • Linked Data representation of localization content
  • Still has interoperability issues
  • Language Technology retraining workflow - use case
  • Our mapping representation
  • Evaluation
  • Conclusions
slide-3
SLIDE 3

Localization Industry

Document Store

Extract & Segment Named Entity Recognition Identify terms and translation Prioritise PE based on QE Post edit Machine Translate

HTML source Annotated XLIFF source Src XLIFF + glossary Src/Tgt XLIFF Prioritised XLIFF PE‘d XLIFF XLIFF source

Translation Workflow

slide-4
SLIDE 4

Linked Data Representation – L3 Data

Document Store Triple Store

Extract & Segment Named Entity Recognition Identify terms and translation Prioritise PE based on QE Post edit Machine Translate

HTML source Annotated XLIFF source Src XLIFF + glossary Src/Tgt XLIFF Prioritised XLIFF PE‘d XLIFF L3 data XLIFF source

Translation Workflow

XSLT Mapper

slide-5
SLIDE 5

LT Retraining Workflow

Document Store Triple Store

Extract & Segment Named Entity Recognition Identify terms and translation Prioritise PE based on QE Post edit Machine Translate

HTML source Annotated XLIFF source Src XLIFF + glossary Src/Tgt XLIFF Prioritised XLIFF PE‘d XLIFF L3 data (GLOBIC) New training data

Train & deploy MT Tool (GLOBIC unaware) Analyse and select

Retrain? XLIFF source

Retraining Workflow Translation Workflow

Mapping (GLOBIC to ITS) L3 data (ITS)

XSLT Mapper

slide-6
SLIDE 6

Architecture Diagram of the Process

Triple Store Application SPARQL processor SPIN API

  • 1. Application search for resources in the Triple Store
  • 2. None in application’s vocabulary, search for mappings
  • 3. If mappings exist, then retrieve the SPIN representation
  • 4. Convert the SPIN representation to SPARQL syntax via a call to the

SPIN API

  • 5. Execute the SPARQL query via the SPARQL processor
  • 6. Consume the newly created data
slide-7
SLIDE 7

Mapping Requirements

  • 1. A mapping entity must be expressed as RDF, with a unique URI, allowing it to be

published as Linked Data

  • 2. The executable statement must be a SPARQL query
  • 3. The executable statement must be expressed as RDF and linked to a mapping

entity

  • 4. A mapping entity is to be modeled with associated meta-data
slide-8
SLIDE 8

Meta-data and SPIN

  • Meta-data properties from the GLOBIC and W3C PROV vocabularies:

gic:wasCreatedBy, gic:mapDescription, prov:generatedAtTime, prov:wasRevisionOf

  • SPIN vocabulary to express SPARQL queries as RDF:

SELECT ?subject ?predicate ?object WHERE { ?subject ?predicate ?object } [] a sp:Select ; sp:templates ([ sp:object _:b1 ; sp:predicate _:b2 ; sp:subject _:b3 ]); sp:where ([ sp:object _:b1 ; sp:predicate _:b2 ; sp:subject _:b3 ]). _:b3 sp:varName “subject”^^xsd:string . _:b2 sp:varName “predicate"^^xsd:string . _:b1 sp:varName “object"^^xsd:string . SPARQL Query SPIN Representation

slide-9
SLIDE 9

Mapping Representation Example

ex:globic_to_its_mtScore_map_1_1 a gic:Mapping ; gic:hasRepresentation ex:globic_to_its_mtScore_sp_2 ; gic:wasCreatedBy ex:person_1 ; prov:generatedAtTime “2014-01-01”^^xsd:date ; gic:mapDescription “Used to map MT confidence data from -------------------

  • ---------------- GLOBIC to ITS vobabulary” ;

gic:version “1.1”^^xsd:float ; prov:wasRevisionOf ex:globic_to_its_mtScore_map_1 . ex:globic_to_its_mtScore_sp_2 a sp:Construct ; sp:templates ([ sp:object _:b1 ; sp:predicate itsrdf:mtConfidence ; sp:subject _:b2 ]) ; sp:where ([ sp:object _:b1 ; sp:predicate gic:qualityAssessment ; sp:subject _:b2 ]) . _:b2 sp:varName "s"^^xsd:string . _:b1 sp:varName "val"^^xsd:string .

Mapping Entity + Meta-data SPIN Representation of SPARQL Query

slide-10
SLIDE 10

Evaluation

  • Two initial experiments:

1. Test the mapping capabilities of SPARQL construct queries

  • R2R Framework – 70* test mappings
  • Reproduced R2R Evaluation
  • R2R test mappings as SPARQL construct queries
  • Compared results – SPARQL construct queries as expressive as R2R Framework

2. Test the expressiveness of SPIN vocabulary with regard to expressing SPARQL construct queries as RDF

  • Carried out using online SPIN RDF Converter and TopBraid composer
  • Input the SPARQL construct queries from first evaluation
  • SPIN could represent all queries in RDF
  • Suitable vocabulary to use
slide-11
SLIDE 11

Conclusions

  • Mapping representation to increase interoperability within heterogeneous workflows
  • All aspects of mapping representation published as Linked Data
  • Discovery of the mappings through SPARQL queries - ultimately executed through SPARQL

processor

  • Evaluation – Capabilities of SPARQL construct queries and expressiveness of SPIN
  • Not just relevant to localization workflows, useful in other Linked Data scenarios
slide-12
SLIDE 12

Thank You

Questions?