using semantic mapping to manage heterogeneity in xliff
play

Using Semantic Mapping to Manage Heterogeneity in XLIFF - PowerPoint PPT Presentation

Using Semantic Mapping to Manage Heterogeneity in XLIFF Interoperability by Dave Lewis, Rob Brennan, Alan Meehan, Declan OSullivan CNGL Centre for Global Intelligent Content at Trinity College Dublin Outline Localization industry


  1. Using Semantic Mapping to Manage Heterogeneity in XLIFF Interoperability by Dave Lewis, Rob Brennan, Alan Meehan, Declan O’Sullivan CNGL Centre for Global Intelligent Content at Trinity College Dublin

  2. Outline • Localization industry – interoperability issues • Linked Data representation of localization content • Still has interoperability issues • Language Technology retraining workflow - use case • Our mapping representation • Evaluation • Conclusions

  3. Localization Industry Document Store Annotated Src XLIFF + Prioritised XLIFF HTML PE‘d Src/Tgt XLIFF glossary source XLIFF source XLIFF XLIFF source Identify Named Entity Extract & Machine Prioritise PE Post terms and Segment Recognition Translate based on QE edit translation Translation Workflow

  4. Linked Data Representation – L3 Data Document Store Annotated Src XLIFF + Prioritised XLIFF HTML PE‘d Src/Tgt XLIFF glossary source XLIFF source XLIFF XLIFF source Identify Named Entity Extract & Machine Prioritise PE Post XSLT terms and Segment Recognition Translate based on QE edit Mapper translation Translation Workflow Triple Store L3 data

  5. LT Retraining Workflow Document Store Annotated Src XLIFF + Prioritised XLIFF HTML PE‘d Src/Tgt XLIFF glossary source XLIFF source XLIFF XLIFF source Identify Named Entity Extract & Machine Prioritise PE Post XSLT terms and Segment Recognition Translate based on QE edit Mapper translation Translation Workflow Retrain? Train & deploy MT Triple Store Analyse Tool and select L3 data (GLOBIC unaware) (GLOBIC) New Mapping (GLOBIC training to ITS) L3 data (ITS) data Retraining Workflow

  6. Architecture Diagram of the Process 1. Application search for resources in the Triple Store SPARQL Application 2. None in application’s vocabulary, search for mappings processor 3. If mappings exist, then retrieve the SPIN representation 4. Convert the SPIN representation to SPARQL syntax via a call to the Triple Store SPIN API 5. Execute the SPARQL query via the SPARQL processor 6. Consume the newly created data SPIN API

  7. Mapping Requirements 1. A mapping entity must be expressed as RDF, with a unique URI, allowing it to be published as Linked Data 2. The executable statement must be a SPARQL query 3. The executable statement must be expressed as RDF and linked to a mapping entity 4. A mapping entity is to be modeled with associated meta-data

  8. Meta-data and SPIN • Meta-data properties from the GLOBIC and W3C PROV vocabularies: gic:wasCreatedBy, gic:mapDescription, prov:generatedAtTime, prov:wasRevisionOf • SPIN vocabulary to express SPARQL queries as RDF: SPIN Representation [] a sp:Select ; sp:templates ([ sp:object _:b1 ; SPARQL Query sp:predicate _:b2 ; sp:subject _:b3 ]); SELECT ?subject ?predicate ?object sp:where ([ sp:object _:b1 ; WHERE { ?subject ?predicate ?object } sp:predicate _:b2 ; sp:subject _:b3 ]). _:b3 sp:varName “subject”^^ xsd:string . _:b2 sp:varName “predicate"^^xsd:string . _:b1 sp:varName “object"^^ xsd:string .

  9. Mapping Representation Example Mapping Entity + Meta-data SPIN Representation of SPARQL Query ex:globic_to_its_mtScore_sp_2 a sp:Construct ; ex:globic_to_its_mtScore_map_1_1 a gic:Mapping ; sp:templates ([ sp:object _:b1 ; gic:hasRepresentation ex:globic_to_its_mtScore_sp_2 ; sp:predicate itsrdf:mtConfidence ; gic:wasCreatedBy ex:person_1 ; sp:subject _:b2 ]) ; prov:generatedAtTime “2014 -01-01 ”^^ xsd:date ; sp:where ([ sp:object _:b1 ; gic:mapDescription “Used to map MT confidence data from ------------------- sp:predicate gic:qualityAssessment ; ----------------- GLOBIC to ITS vobabulary ” ; sp:subject _:b2 ]) . gic:version “1.1”^^ xsd:float ; prov:wasRevisionOf ex:globic_to_its_mtScore_map_1 . _:b2 sp:varName "s"^^xsd:string . _:b1 sp:varName "val"^^xsd:string .

  10. Evaluation • Two initial experiments: 1. Test the mapping capabilities of SPARQL construct queries • R2R Framework – 70* test mappings • Reproduced R2R Evaluation • R2R test mappings as SPARQL construct queries • Compared results – SPARQL construct queries as expressive as R2R Framework 2. Test the expressiveness of SPIN vocabulary with regard to expressing SPARQL construct queries as RDF • Carried out using online SPIN RDF Converter and TopBraid composer • Input the SPARQL construct queries from first evaluation • SPIN could represent all queries in RDF • Suitable vocabulary to use

  11. Conclusions • Mapping representation to increase interoperability within heterogeneous workflows • All aspects of mapping representation published as Linked Data • Discovery of the mappings through SPARQL queries - ultimately executed through SPARQL processor • Evaluation – Capabilities of SPARQL construct queries and expressiveness of SPIN • Not just relevant to localization workflows, useful in other Linked Data scenarios

  12. Thank You Questions?

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend