Multilingual mark-up of text-audio synchronization at a - PowerPoint PPT Presentation

Multilingual ¡mark-‑up ¡of ¡ text-‑audio ¡synchronization ¡ at ¡a ¡word-‑by-‑word ¡level, ¡how ¡ HTML5 may ¡assist ¡ in-‑browser ¡solutions ¡ Gavin Brelstaff (gjb@ crs4.it) CRS4, Sardinia, Italy Francesca Chessa University of Sassari, Italy Multilingual Web Workshop Rome March 2013 MLW Rome 2013 G.Brelstaff & F.Chessa 1

First, to the movies MLW Rome 2013 G.Brelstaff & F.Chessa 2

Movies subtitles MLW Rome 2013 G.Brelstaff & F.Chessa 3

HTML5 video <video src= “ Cyrano.ogv"> <track kind="subtitles" label="English " src= “ Cyrano_en.vtt" srclang="en " default/> </video> MLW Rome 2013 G.Brelstaff & F.Chessa 4

HTML5 audio <audio> <track kind="subtitles" label="English " src= " file_en.srt" srclang="en " default /> <source src = " file-RU.ogg" type="audio/ogg " /> <source src = " file-RU.mp3 " type="audio/mpeg " /> Your browser does not support the audio element. </audio> Simply supply the vtt or srt timed-text file and the browser does it all for you line by line. MLW Rome 2013 G.Brelstaff & F.Chessa 5

Timed-text audio on the web: http://commons.wikimedia.org/wiki/TimedText:GraziaDeledda.ogg.en.srt MLW Rome 2013 G.Brelstaff & F.Chessa 6

Timed-text audio srt: MLW Rome 2013 G.Brelstaff & F.Chessa 7

Speech to Text digital spectrogram Speech analysis credit: Carlo Schirru , Univ. Sassari MLW Rome 2013 G.Brelstaff & F.Chessa 8 Aspetti fonetico-fonologici introduttivi all’analisi strumentale sull’intonazione del sardo (2006)

Demo ¡+ ¡'med-‑text ¡ MLW Rome 2013 G.Brelstaff & F.Chessa 9

Multilingual markup - recap A human marks up the equivalances bewteen bilingual texts at three different levels: word, phrase, idea . word Colour-coded idea equivalence phrase Web-based alignment and presentation of semantic equivalence [XHTML + CSS + jQuery] MLW Rome 2013 G.Brelstaff & F.Chessa 10

HTML under the hood <audio> ... </audio> ... Astonished was I: Timed Text Markup (TTML) by 31 Jan 2013 the hush over water ... MLW Rome 2013 G.Brelstaff & F.Chessa 11

Archive format: XML TEI <text><body> <div type="poem" xml:lang="en"> … <s><phr> <milestone unit="stanza"/> <lb/> <milestone unit="cue" n="22.9s"/> <w n="ru: И _ удивило " type="parap"> Astonished was </w> <milestone unit="cue" n="24.8s"/> <w n="ru: меня "> I: </w> </phr><phr> <lb/> Add one <milestone unit="cue" n="25.5s"/> TEI <w n="ru: как " type="parap"> by </w> milestone <milestone unit="cue" n="25.9s"/> “anchor” <w n="ru: спокойны "> the hush over </w> per audio <milestone unit="cue" n="26.7s"/> cue-point <w n="ru: воды "> water </w> </phr> <lb/> ... Text Encoding Initiative P5, 2012 MLW Rome 2013 G.Brelstaff & F.Chessa 12

HTML5 audio tag HTML code <audio id=" audio " nocontrols > <source src=" 01-RU.ogg " type="audio/ogg"> <source src=" 01-RU.mp3 " type="audio/mpeg"> Your browser does not support the audio element. </audio> No <track> subtitles here Javascript audio play var myAudio=$('# audio '); // jQuery selector myAudio.get(0).currentTime = 15.5 //secs myAudio.get(0). play (); // start HTML5 audio Javascript text sync (scarry stuff instead) setTimeout('switch_on (... )', start_ms ); // times in setTimeout('switch_off(... )', end_ms ); // milliseconds See also: westonruter-html5-audio-read-along on github MLW Rome 2013 G.Brelstaff & F.Chessa 13

Cue-point mark-up tools? www.nikse.dk/subtitleedit MLW Rome 2013 G.Brelstaff & F.Chessa 14

Cue-point mark-up tools? www.fon.hum.uva.nl/praat MLW Rome 2013 G.Brelstaff & F.Chessa 15

Cue-point mark-up (visual interface) Insert & nudge cue-points directly on the web-page while listening MLW Rome 2013 G.Brelstaff & F.Chessa 16

Our aim: to activate poetic memory Involve ear, tongue and eyes Nel mezzo del cammin di nostra to vita mi ritrovai per reinforce memory/ una selva oscura appreciation across ché la diritta via era smarrita the language divide. MLW Rome 2013 G.Brelstaff & F.Chessa 17

"We preferred poems that make a powerful impact when they are heard aloud - not because they are theatrical, but because they dramatise experiences that surprise us into a new apprehension of ourselves and our capacity for imagining, thinking and marvelling." Mr Gove said the project would ensure that more children would be captivated by great poetry and it would help "pass our cultural legacy on to the next generation". MLW Rome 2013 G.Brelstaff & F.Chessa 18

Caesar’s Europe: poetic memory http://www.perseus.tufts.edu/hopper/text?doc=Perseus%3Atext%3A1999.02.0001%3Abook%3D6%3Achapter%3D14 The Druids … learn by heart a great number of verses; … . Nor do they regard it lawful to commit these to writing … MLW Rome 2013 G.Brelstaff & F.Chessa 19

Poetic memory internal external Internal: Immediately available to society (in cache) External: Appreciation, Available comprehension across Juliane Stiller, Marlies Olensky MLW Dublin 2012 on demand the language divide? (in digital archive) MLW Rome 2013 G.Brelstaff & F.Chessa 20

• Information is not knowledge • knowledge is not wisdom • wisdom is not truth … F.Zappa 1979 MLW Rome 2013 G.Brelstaff & F.Chessa 21

Poetic memory informs society 1562 Lost on us Arthur Prose plot Brooke (information) 1597 Able to William Poetic language inform Shakespeare (information plus ) society MLW Rome 2013 G.Brelstaff & F.Chessa 22

Back to the movies – an extreme social network Learning by rote or Learning by heart? MLW Rome 2013 G.Brelstaff & F.Chessa 23

http://www.youtube.com/watch?v=ZriW3CPU9G4&list=PLGGjdQw3TIx9Dk0CYaHS9R6LyI_HmtG9i MLW Rome 2013 G.Brelstaff & F.Chessa 24

That ’ s all folks: Gavin Brelstaff ( gjb@ crs4.it ) CRS4 09010 Pula (CA) – Sardinia, Italy Francesca Chessa University of Sassari, Italy MLW Rome 2013 G.Brelstaff & F.Chessa 25

Multilingual mark-up of text-audio synchronization at a - PowerPoint PPT Presentation

Multilingual mark-up of text-audio synchronization at a word-by-word level, how HTML5 may assist in-browser solutions Gavin Brelstaff (gjb@ crs4.it) CRS4,

Drupal 8s multilingual APIs Gbor Hojtsy DRUPAL 7 MULTILINGUAL DRUPAL 7 MULTILINGUAL Drupal

Drupal 8 Multilingual Wonderland Gabor Hojtsy Acquia Foreign language site Multilingual site

10 slides that always work Simple text boxes (I) Sample text Sample text Sample text

Audio Device Client Better and Faster Audio I/O on Web Hongchan Choi Google Chrome Web Audio

Content Synchronization Content Synchronization March 2nd 2005 Jukka Honkola T-110.456

CONTENT TITLE Insert Subtitle Here Enter Text Here Enter Text Here Enter Text Here

Post-Conference Presentation Sunday Oladayo Oladejo Table of Content A Introduction B

Cirrus Audio Solutions Cirrus Audio Solutions Home Audio Portable Audio Personal CD Player

Multilingual App Toolkit Standards and multilingual software development 29, April 2015 Jan

Enhancing ICANN Text Accountability 26 June 2014 Text #ICANN50 Text #ICANN50 Text #ICANN50

Add Your Title Here Replace your text here! Replace your text here! Insert your title here 1

Text Text #ICANN51 15 October 2014 Text Text IDN Root Zone LGR Sarmad Hussain IDN Program

Text Text #ICANN51 Contractual Compliance Text Text Contractual Compliance Update

Text Text #ICANN50 Contractual Compliance Text Text GNSO Council Meeting Wednesday, Jun 25

Create PowerPoint Audio and Video V0B August 2020 V0B V0B Schield: 2020 PPTX Create Audio-Video

Audio and Speech August 13, 2001 Audio 2 Digital sound anti-aliasing amplifier codec filter

Introductory Course for Commercial Dealers of Guinea Pigs, Hamsters or Rabbits Part 5:

An update on the latest Introduction The Australian beef industry Agritech emerging in

BV LC Jacob Andreas, Marcus Rohrbach, Trevor Darrell, Dan Klein Visual ques>on answering

TELEPHONY ETI2506 Sunday, September 25, 2016 1 ETI2506 - SYLLABUS 2 BASIC SWITCHING NETWORK

9/3/2018 Department of Veterinary and Animal Sciences Department of Veterinary and Animal

Conversational Recommendation: Formulation, Methods, and Evaluation Wenqiang Lei, Xiangnan He,

Language is Contextual Grounded Semantics Some problems depend on grounding into perceptual

CMP722 ADVANCED COMPUTER VISION Lecture #5 Language and Vision Aykut Erdem // Hacettepe

Multilingual mark-up of text-audio synchronization at a - PowerPoint PPT Presentation

Multilingual mark-up of text-audio synchronization at a word-by-word level, how HTML5 may assist in-browser solutions Gavin Brelstaff (gjb@ crs4.it) CRS4,

Drupal 8s multilingual APIs Gbor Hojtsy DRUPAL 7 MULTILINGUAL DRUPAL 7 MULTILINGUAL Drupal

Drupal 8 Multilingual Wonderland Gabor Hojtsy Acquia Foreign language site Multilingual site

10 slides that always work Simple text boxes (I) Sample text Sample text Sample text

Audio Device Client Better and Faster Audio I/O on Web Hongchan Choi Google Chrome Web Audio

Content Synchronization Content Synchronization March 2nd 2005 Jukka Honkola T-110.456

CONTENT TITLE Insert Subtitle Here Enter Text Here Enter Text Here Enter Text Here

Post-Conference Presentation Sunday Oladayo Oladejo Table of Content A Introduction B

Cirrus Audio Solutions Cirrus Audio Solutions Home Audio Portable Audio Personal CD Player

Multilingual App Toolkit Standards and multilingual software development 29, April 2015 Jan

Enhancing ICANN Text Accountability 26 June 2014 Text #ICANN50 Text #ICANN50 Text #ICANN50

Add Your Title Here Replace your text here! Replace your text here! Insert your title here 1

Text Text #ICANN51 15 October 2014 Text Text IDN Root Zone LGR Sarmad Hussain IDN Program

Text Text #ICANN51 Contractual Compliance Text Text Contractual Compliance Update

Text Text #ICANN50 Contractual Compliance Text Text GNSO Council Meeting Wednesday, Jun 25

Create PowerPoint Audio and Video V0B August 2020 V0B V0B Schield: 2020 PPTX Create Audio-Video

Audio and Speech August 13, 2001 Audio 2 Digital sound anti-aliasing amplifier codec filter

Introductory Course for Commercial Dealers of Guinea Pigs, Hamsters or Rabbits Part 5:

An update on the latest Introduction The Australian beef industry Agritech emerging in

BV LC Jacob Andreas, Marcus Rohrbach, Trevor Darrell, Dan Klein Visual ques&gt;on answering

TELEPHONY ETI2506 Sunday, September 25, 2016 1 ETI2506 - SYLLABUS 2 BASIC SWITCHING NETWORK

9/3/2018 Department of Veterinary and Animal Sciences Department of Veterinary and Animal

Conversational Recommendation: Formulation, Methods, and Evaluation Wenqiang Lei, Xiangnan He,

Language is Contextual Grounded Semantics Some problems depend on grounding into perceptual

CMP722 ADVANCED COMPUTER VISION Lecture #5 Language and Vision Aykut Erdem // Hacettepe

BV LC Jacob Andreas, Marcus Rohrbach, Trevor Darrell, Dan Klein Visual ques>on answering