Interactive alignment of Parallel Texts a cross browser experience - - PowerPoint PPT Presentation

interactive alignment of parallel texts a cross browser
SMART_READER_LITE
LIVE PREVIEW

Interactive alignment of Parallel Texts a cross browser experience - - PowerPoint PPT Presentation

Interactive alignment of Parallel Texts a cross browser experience (standards in practice) Gavin Brelstaff (gjb@ crs4.it) CRS4 09010 Pula (CA) Sardinia, Italy Francesca Chessa University of Sassari, Italy Multilingual Web Workshop Pisa


slide-1
SLIDE 1

MLW Pisa 2011 G.Brelstaff & F.Chessa 1

Interactive alignment of Parallel Texts – a cross browser experience

Gavin Brelstaff (gjb@ crs4.it) CRS4 09010 Pula (CA) – Sardinia, Italy Francesca Chessa University of Sassari, Italy Multilingual Web Workshop Pisa April 2011

(standards in practice)

slide-2
SLIDE 2

MLW Pisa 2011 G.Brelstaff & F.Chessa 2

Introduction Alignment of parallel texts; multi-lingual; minority languages; poetry But why?

Dante’s was a minority language.

slide-3
SLIDE 3

MLW Pisa 2011 G.Brelstaff & F.Chessa 3

Genius loci the creative spirits

  • f place – geolocated.

Minority language a seed-bed for poetic expression, beyond mere communication.

He was the cat that walked by himself and all places were alike to him. Kipling

“Think global, act local” “Think local, act global”

Whenever we lose a language the “genetic basis” for such expression diminishes, globally

slide-4
SLIDE 4

MLW Pisa 2011 G.Brelstaff & F.Chessa 4

Echo Chamber Minority language Island language (song,verse,prose)

slide-5
SLIDE 5

MLW Pisa 2011 G.Brelstaff & F.Chessa 5

Echo Chamber in poet’s head

slide-6
SLIDE 6

MLW Pisa 2011 G.Brelstaff & F.Chessa 6

Echo Chamber inside the head (ear,tongue,thought)

slide-7
SLIDE 7

MLW Pisa 2011 G.Brelstaff & F.Chessa 7

  • Echo Chamber

inside the head (ear,tongue,thought, eye)

slide-8
SLIDE 8

MLW Pisa 2011 G.Brelstaff & F.Chessa 8

  • Language Barrier
slide-9
SLIDE 9

MLW Pisa 2011 G.Brelstaff & F.Chessa 9

Language Barrier Cultural context B Cultural context A

cf R.Jakobson

d i f f u s i

  • n

d i f f u s i

  • n

d i f f u s i

  • n

d i f f u s i

  • n
slide-10
SLIDE 10

MLW Pisa 2011 G.Brelstaff & F.Chessa 10

Minority language Global language

  • smosis

diffusion diffusion diffusion “cellular membrane” d i f f u s i

  • n

Language Barrier Assist

avoiding dilution, shrivelling, bursting.

d i f f u s i

  • n
slide-11
SLIDE 11

MLW Pisa 2011 G.Brelstaff & F.Chessa 11

  • Language Barrier
slide-12
SLIDE 12

MLW Pisa 2011 G.Brelstaff & F.Chessa 12

  • Parallel text alignment ↔ to communicate semantics
  • standards-based markup
  • web delivery, cross-browser
  • non-verbal interactvity
  • beyond GoogleTranslate

  • Midway

Nel mezzo

Translator

slide-13
SLIDE 13

MLW Pisa 2011 G.Brelstaff & F.Chessa 13

Beyond GoogleTranslate:

  • SMT not going to translate poetry well any time soon.
  • We allow the translator to clarify by alignment
  • Point-&-click interface to modify standard markup
  • Colour-code: formal & dynamic equivalence [Nida-Taber]
  • Demo

Parallel text alignment web interface

slide-14
SLIDE 14

MLW Pisa 2011 G.Brelstaff & F.Chessa 14

Demo (a desktop browser: IE8-9,FF3-4,Opera11,Chrome,Safari)

slide-15
SLIDE 15

MLW Pisa 2011 G.Brelstaff & F.Chessa 15

Demo: selection by click

slide-16
SLIDE 16

MLW Pisa 2011 G.Brelstaff & F.Chessa 16

Demo: selection & alignment

slide-17
SLIDE 17

MLW Pisa 2011 G.Brelstaff & F.Chessa 17

Presentation Content Structure Semantics eXist XML db

XML CSS XHTML

not RDF

Unicode

http put

REST/ajax

DOM Javascript jQuery

  • TEI-p5

XMLSchema XSL XQL not w3cRange

Standards in practice

Pros & Cons

slide-18
SLIDE 18

MLW Pisa 2011 G.Brelstaff & F.Chessa 18

Presentation Content Structure Semantics eXist XML db

XML CSS XHTML

not RDF

Unicode

http put

REST/ajax

DOM Javascript jQuery

  • TEI-p5

XMLSchema XSL XQL

Cons: #1

We can’t interact directly with Semantics Browsers only bind events to XHTML (why not XML?) elements Incurs two degrees of messy indirection.

not w3cRange

slide-19
SLIDE 19

MLW Pisa 2011 G.Brelstaff & F.Chessa 19

Presentation Content Structure Semantics eXist XML db

XML CSS XHTML

not RDF

Unicode

http put

REST/ajax

DOM Javascript jQuery

  • TEI-p5

XMLSchema XSL XQL

Cons: #2

w3cRange is not “road worthy”. We resort to Click to Select Selection within words still lacking.

not w3cRange

slide-20
SLIDE 20

MLW Pisa 2011 G.Brelstaff & F.Chessa 20

Presentation Content Structure Semantics eXist XML db

XML CSS XHTML

not RDF

Unicode

http put

REST/ajax

DOM Javascript jQuery

  • TEI-p5

XMLSchema XSL XQL

Cons: #3

TEI-p5 must be subsetted to avoid

  • verlapping markup

We prioritise alignment tags over {verse-line,paragraph} hierarchy.

not w3cRange

slide-21
SLIDE 21

MLW Pisa 2011 G.Brelstaff & F.Chessa 21

Presentation Content Structure Semantics eXist XML db

XML CSS XHTML

not RDF

Unicode

http put

REST/ajax

DOM Javascript jQuery

  • TEI-p5

XMLSchema XSL XQL

Pros: #1

Unicode in XML attributes permits our novel alignment scheme: The verbatim source text is simply assigned as an attributed of an enclosing tag in the translated text

not w3cRange

slide-22
SLIDE 22

MLW Pisa 2011 G.Brelstaff & F.Chessa 22

Presentation Content Structure Semantics eXist XML db

XML CSS XHTML

not RDF

Unicode

http put

REST/ajax

DOM Javascript jQuery

  • TEI-p5

XMLSchema XSL XQL

Pros: #2

CSS selection mechanism as embraced in jQuery helps tame the complexity of cross-browser DOM programming.

not w3cRange

slide-23
SLIDE 23

MLW Pisa 2011 G.Brelstaff & F.Chessa 23

Presentation Content Structure Semantics eXist XML db

XML CSS XHTML

not RDF

Unicode

http put

REST/ajax

DOM Javascript jQuery

  • TEI-p5

XMLSchema XSL XQL

Pros: #3

RESTful archiving is a reality due to:

  • Ajax in the browser
  • Http PUT on the wire, &
  • eXist XML db on the server

not w3cRange

slide-24
SLIDE 24

MLW Pisa 2011 G.Brelstaff & F.Chessa 24

Conclusion

Cons

  • Can’t bind to XML
  • W3cRange not ready
  • Must subset TEI-p5

Pros

  • Unicode in attributes
  • CSS&jQuery v. DOM
  • RESTful reality

Standards in practice

slide-25
SLIDE 25

MLW Pisa 2011 G.Brelstaff & F.Chessa 25

Browser issues

  • Opera: no transparent cursor in text
  • Firefox: synchronous scoll down bug
  • IE: onselectstart issue
  • Google Chrome: run from disk fix
  • Safari/Chrome/IE Form Enctype: validation
slide-26
SLIDE 26

MLW Pisa 2011 G.Brelstaff & F.Chessa 26

That’s all folks:

Gavin Brelstaff (gjb@ crs4.it) CRS4 09010 Pula (CA) – Sardinia, Italy Francesca Chessa University of Sassari, Italy

L'Amor che move il sole e l'altre stelle