A Micro Crowdsourcing Architecture to Localize A Micro Crowdsourcing - - PowerPoint PPT Presentation

a micro crowdsourcing architecture to localize a micro
SMART_READER_LITE
LIVE PREVIEW

A Micro Crowdsourcing Architecture to Localize A Micro Crowdsourcing - - PowerPoint PPT Presentation

A Micro Crowdsourcing Architecture to Localize A Micro Crowdsourcing Architecture to Localize Web Content for Less-Resourced Languages Asanka Wasala Chris Exton Ruvan Weerasinghe Reinhard Schler Adapted from:


slide-1
SLIDE 1

A Micro Crowdsourcing Architecture to Localize A Micro Crowdsourcing Architecture to Localize Web Content for Less-Resourced Languages

Asanka Wasala Chris Exton Ruvan Weerasinghe Reinhard Schäler

slide-2
SLIDE 2

Adapted from: http://w3techs.com/technologies/overview/content_language/all

slide-3
SLIDE 3

Adapted from: http://w3techs.com/technologies/overview/content_language/all

slide-4
SLIDE 4

Scenario

You’ve come across a Sinhala web site..

slide-5
SLIDE 5

http://si.wikipedia.org/wiki/ංහල__අ_අද

ංහල අ අද

slide-6
SLIDE 6

www.translate.google.com Google Translate http://si.wikipedia.org/wiki/ංහල__අ_අද

slide-7
SLIDE 7

http://www.microsofttranslator.com/

Bing Translator http://si.wikipedia.org/wiki/ංහල__අ_අද

slide-8
SLIDE 8

http://babelfish.yahoo.com/

Yahoo Babel Fish –Text Tra…

http://si.wikipedia.org/wiki/ංහල__අ_අද

slide-9
SLIDE 9
slide-10
SLIDE 10

Web site Localisation..

Web site l10n Automatic (MT) Manual

slide-11
SLIDE 11

Internet

slide-12
SLIDE 12

English Web

slide-13
SLIDE 13

Web site localisation.. “English dominated web pages on the “English dominated web pages on the Internet have resulted in businesses losing sales revenues of $10 million per year”

  • Harvard-based Forrester Research
slide-14
SLIDE 14
slide-15
SLIDE 15
slide-16
SLIDE 16

Home Cut Paste Format Painter Insert Page Layout References Mailings Review View Paste Clipboard Font Paragraph Calibri (Body) 11

වස

වස

Real Time Localisation of Software

“Micro Crowdsourcing: A new Model for Software Localisation”. (2009). Localisation Focus, The International Journal of Localisation ,Vol 8 Issue 1 Chris Exton, Asanka Wasala, Reinhard Schäler, Jim Buckley

slide-17
SLIDE 17

Real Time Web Content Localisation

slide-18
SLIDE 18

Web site Localisation..

Web site l10n Automatic (MT) Manual + TM

slide-19
SLIDE 19

http://www.translationautomation.com/perspectives/lets-make-music-together.html

Let’s Make Music Together! What is interoperability? Why is it important? How do we achieve it?

A few years ago, I converted my garage into a music studio to support my musical hobbies and interests. The initial investment I made in time and money has paid off in that I have spent countless hours of creative bliss experimenting with music on my own, but although I enjoy my hobbies immensely, I always come to the same conclusion after each session—music is social; I need

  • ther musicians to help me fully experience the

phenomenon of creative expression through music. I need a group. I need a band.

slide-20
SLIDE 20

http://www.translationautomation.com/perspectives/lets-make-music-together.html

Let’s Make Music Together! What is interoperability? Why is it important? How do we achieve it?

සැෙවම එක ගය

A few years ago, I converted my garage into a music studio to support my musical hobbies and interests. The initial investment I made in time and money has paid off in that I have spent countless hours of creative bliss experimenting with music on my own, but although I enjoy my hobbies immensely, I always come to the same conclusion after each session—music is social; I need

  • ther musicians to help me fully experience the

phenomenon of creative expression through music. I need a group. I need a band.

slide-21
SLIDE 21

TM

Extension

slide-22
SLIDE 22

<xliff version='1.2' xmlns='urn:oasis:names:tc:xliff:document:1.2'> <file original='site/e1/1223' source-language='en' target-language='si' datatype='plaintext'> <body>

TM

<body> <trans-unit id='124i8j'> <source>Let’s Make Music Together!</source> <target>සැෙවම එක ගය </target> <context-group> <context context-type="element"> xpath:html/body/div#ja-a-content/div#ja-clearfix/a </context>

Extension

</context> </context-group> </trans-unit> </body> </file> </xliff>

slide-23
SLIDE 23

TM

Extension

slide-24
SLIDE 24

http://www.translationautomation.com/perspectives/lets-make-music-together.html

Let’s Make Music Together!

සැෙවම එක ගය

අත !"#හනය ය% &ම' ද? එය වැදග වෙ ෙකෙස)ද? How do we achieve it?

සැෙවම එක ගය

A few years ago, I converted my garage into a music studio to support my musical hobbies and interests. The initial investment I made in time and money has paid off in that I have spent countless hours of creative bliss experimenting with music on my own, but although I enjoy my hobbies immensely, I always come to the same conclusion after each session—music is social; I need

  • ther musicians to help me fully experience the

phenomenon of creative expression through music. I need a group. I need a band.

slide-25
SLIDE 25

Issues

Legal issues L10n of non-textual content L10n of non-textual content Constant updates Style/Formatting issues Translation voting issues Deployment issues

slide-26
SLIDE 26

Benefits

A practical web translation method for less-resourced languages languages Web site independent framework Non-intrusive mechanism for local language users to benefit from the vast knowledge embodied in the Internet A method to facilitate l10n of web pages (in early stage of l10n cycle) e.g. in context l10n./; l10n cycle) e.g. in context l10n./; TM based – more accurate translations Helps building of resources such as parallel corpora

slide-27
SLIDE 27

Standardization considerations

Different extension mechanisms

slide-28
SLIDE 28

Standardization considerations

Different rendering mechanisms/engines (esp. mobile)

slide-29
SLIDE 29

Standardization Considerations

Browser Extensions Rendering Engines and Fonts Rendering Engines and Fonts

slide-30
SLIDE 30

Summery

Multilingual Web is not just the top10 languages Browser extensions Browser extensions

Enable a majority to obtain the most out of the vast body of knowledge encompassed in the largely English

Novel TM based web site localisation methodology Standardization considerations

Browser Extensions Rendering Engines Rendering Engines

slide-31
SLIDE 31

Thank you