Unsupervised Word Translation Kira Selby University of Waterloo - PowerPoint PPT Presentation

Dec 31, 2023 •122 likes •218 views

Unsupervised Word Translation Kira Selby University of Waterloo Can we train a model to translate a language we know nothing about? Yes we can! Near the end of 2017, FAIR (Facebook AI Research) published a model called MUSE ( M ultilingual

Unsupervised Word Translation Kira Selby University of Waterloo
Can we train a model to translate a language we know nothing about?
Yes we can! • Near the end of 2017, FAIR (Facebook AI Research) published a model called MUSE ( M ultilingual U n S upervised word E mbeddings) • MUSE can learn to translate between languages without any cross-lingual information! • Achieves state of the art accuracy on hundreds of languages, even coming close to or surpassing supervised models!
Word Embeddings • Word embeddings are models that map every word in a language to a fixed-size vector • The idea is to map words in such a way that the resulting vector space somehow captures something about the relationships between words • Most famous example: Word2Vec (Mikolov 2013) • King – Man + Woman = Queen
MUSE • We start with a fixed set of word embeddings in each language, typically learned from a large corpus of text • Given target vectors Y and source vectors X, we want to learn a mapping Y = XW between the two spaces • We want to do this in such a way that the distribution of vectors in each of the two languages is the same
GANs • MUSE does this by using a GAN ( G enerative A dversarial N etwork) • We train a discriminator to try to tell whether two vectors are from the same language, and a generator to map the vectors from one language into each other • The discriminator and the generator are adversaries – they each train to try to beat the other
MUSE • MUSE has been incredibly successful, and set a new standard for word translation • Many papers have been published following up on MUSE’s techniques, but there are still open problems in the area • One of the most important is to improve the performance on highly dissimilar languages and low- resource languages • This is an area that could be an excellent opportunity for a research project

Recommend

UNSUPERVISED LEARNING, CLUSTERING UNSUPERVISED LEARNING UNSUPERVISED LEARNING Supervised

UNSUPERVISED LEARNING, CLUSTERING UNSUPERVISED LEARNING UNSUPERVISED LEARNING Supervised learning: X - y pairs, f(x) function approximation Unsupervised learning: only X, no y Exploring the space of X measurements,

441 views • 14 slides

Memory Memory Decoders M bits M bits RWM NVRWM ROM S 0 S 0 Word 0 Word 0 S 1 Word 1 Word

Memory Memory Decoders M bits M bits RWM NVRWM ROM S 0 S 0 Word 0 Word 0 S 1 Word 1 Word 1 A 0 S 2 Storage Storage Random Non-Random Word 2 Word 2 N Words Cell A 1 Cell EPROM Mask-Programmed Decoder Access Access E 2 PROM

464 views • 21 slides

Unsupervised Learning and Clustering l In unsupervised learning you are given a data set with no

Unsupervised Learning and Clustering l In unsupervised learning you are given a data set with no output classifications (labels) l Clustering is an important type of unsupervised learning PCA was another type of unsupervised learning l The

858 views • 38 slides

4CSLL5 Parameter Estimation (Supervised and Unsupervised) Unsupervised Maximum Likelihood

4CSLL5 Parameter Estimation (Supervised and Unsupervised) 4CSLL5 Parameter Estimation (Supervised and Unsupervised) Outline 4CSLL5 Parameter Estimation (Supervised and Unsupervised) Unsupervised Maximum Likelihood (re-)Estimation Hidden

310 views • 8 slides

11-731 Machine Translation Speech 2 Speech Translation Speech Translation Three part systems

11-731 Machine Translation Speech 2 Speech Translation Speech Translation Three part systems Three part systems ASR ASR - -> Translation > Translation - -> TTS > TTS System configurations System

284 views • 27 slides

Community Translation By Willem Stoeller Examples Community Translation Virtual Teams Powering

Powering the Translation Network Community Translation By Willem Stoeller Examples Community Translation Virtual Teams Powering the Translation Network Community Translation Confusion Powering the Translation Network How to use it? LSP

660 views • 19 slides

CRF Word Alignment & Noisy Channel Translation January 31, 2013 Tuesday, February 19, 13

CRF Word Alignment & Noisy Channel Translation January 31, 2013 Tuesday, February 19, 13 Last Time ... X Translation Translation Alignment p ( p ( ) = ) , Alignment Tuesday, February 19, 13 Last Time ... X Translation Translation

966 views • 68 slides

CRF Word Alignment & Noisy Channel Translation Machine Translation Lecture 6 Instructor:

CRF Word Alignment & Noisy Channel Translation Machine Translation Lecture 6 Instructor: Chris Callison-Burch TAs: Mitchell Stern, Justin Chiu Website: mt-class.org/penn Last Time ... X Translation Translation Alignment p ( p ( ) = )

937 views • 48 slides

On the Limitations of Unsupervised Bilingual Dictionary Induction Anders Sgaard Sebastian

On the Limitations of Unsupervised Bilingual Dictionary Induction Anders Sgaard Sebastian Ruder Ivan Vuli Background: Unsupervised MT 2 Background: Unsupervised MT Recently: Unsupervised neural machine translation (Artetxe

1.36k views • 98 slides

Statistical Machine Translation Nadir Durrani 21-November-2014 Machine Translation

Statistical Machine Translation Nadir Durrani 21-November-2014 Machine Translation www.uni-stuttart.de Problem: Automatic translation the foreign text: 2 Open Problems in Machine Translation www.uni-stuttart.de Ambiguity in translation

939 views • 44 slides

Computer Aided Translation Philipp Koehn 30 April 2015 Philipp Koehn Machine Translation:

Computer Aided Translation Philipp Koehn 30 April 2015 Philipp Koehn Machine Translation: Computer Aided Translation 30 April 2015 Why Machine Translation? 1 Assimilation reader initiates translation, wants to know content user is

775 views • 49 slides

Computer Aided Translation Philipp Koehn 15 November 2018 Philipp Koehn Machine Translation:

Computer Aided Translation Philipp Koehn 15 November 2018 Philipp Koehn Machine Translation: Computer Aided Translation 15 November 2018 Why Machine Translation? 1 Assimilation reader initiates translation, wants to know content user

1.04k views • 66 slides

Global Translation Services Website translation using post-edited machine translation and

Global Translation Services Website translation using post-edited machine translation and crowdsourcing David Grunwald davidg@gts-translation.com twitter: @davegrun LinkedIn: davegrun March 31, 2011 David Grunwald, GTS About GTS Small

211 views • 17 slides

4CSLL5 IBM Translation Models Martin Emms October 22, 2020 4CSLL5 IBM Translation Models IBM

4CSLL5 IBM Translation Models 4CSLL5 IBM Translation Models Martin Emms October 22, 2020 4CSLL5 IBM Translation Models IBM models Probabilities and Translation Alignments IBM Model 1 definitions 4CSLL5 IBM Translation Models IBM models

1.23k views • 103 slides

4CSLL5 IBM Translation Models IBM models Probabilities and Translation Alignments Martin Emms

4CSLL5 IBM Translation Models 4CSLL5 IBM Translation Models 4CSLL5 IBM Translation Models IBM models Probabilities and Translation Alignments Martin Emms IBM Model 1 definitions October 22, 2020 4CSLL5 IBM Translation Models 4CSLL5 IBM

571 views • 7 slides

Unsupervised Learning Andrea Passerini passerini@disi.unitn.it Machine Learning Unsupervised

Unsupervised Learning Andrea Passerini passerini@disi.unitn.it Machine Learning Unsupervised Learning Unsupervised Learning Setting Supervised learning requires the availability of labelled examples Labelling examples can be an extremely

752 views • 27 slides

The Importance of Safe Patient Handling and Mobility in the American Health Care System and The

The Importance of Safe Patient Handling and Mobility in the American Health Care System and The Nurse and Health Care Worker Protection Act of 2013 (H.R. 2480) SPEAKERS Karen Daley PhD, RN, FAAN President, American Nurses Association

506 views • 26 slides

Mapping Global Value Chains Koen De Backer Sbastien Miroudot OECD Final WIOD Conference:

Mapping Global Value Chains Koen De Backer Sbastien Miroudot OECD Final WIOD Conference: Causes and Consequences of Globalization Groningen, The Netherlands, April 24-26, 2012 . Why focusing on GVCs? A value chain can be defined as

381 views • 19 slides

Online Data Plane Checking June 12, 2013 Summer School on Formal Methods and Networks Cornell

Online Data Plane Checking June 12, 2013 Summer School on Formal Methods and Networks Cornell University VeriFlow: Verifying Network-Wide Invariants in Real Time* Ahmed Khurshid , Xuan Zou, Wenxuan Zhou, Matthew Caesar, P. Brighten Godfrey

470 views • 33 slides

Work, Health and Well-being: and interdisciplinary approach to managing health in the workplace

Arthritis Research Campaign National Primary Care Centre Work, Health and Well-being: and interdisciplinary approach to managing health in the workplace Work, Health and Well-being: an interdisciplinary approach to managing health in the

921 views • 25 slides

The MUSE Hubble Ultra Deep Field survey Roland Bacon CRAL and the MUSE consortium Tokyo

The MUSE Hubble Ultra Deep Field survey Roland Bacon CRAL and the MUSE consortium Tokyo Spring Lyman-alpha Workshop Mar 27 2018 1 The search for Ly emitters: Imaging or Spectroscopy ? Imaging Narrow Band Survey Spectroscopic Survey

512 views • 26 slides

M U S E Turn music into memories. Muse-ability Study Our team Alema F. Tifgany M. Nylah D.

M U S E Turn music into memories. Muse-ability Study Our team Alema F. Tifgany M. Nylah D. Vincent N. UX Designer App Developer App Developer UX Designer M U S E Brand overview There's something about memories that we naturally

469 views • 33 slides

MUSE: Multi-query Event Trend Aggregation Allison Rozet 1 , Olga Poppe 2 , Chuan Lei 3 , and Elke

Supported by NSF Grants IIS-1815866, CRI-1305258, and IIS-1018443, and the U.S. Department of Education grant P200A150306. MUSE: Multi-query Event Trend Aggregation Allison Rozet 1 , Olga Poppe 2 , Chuan Lei 3 , and Elke A. Rundensteiner 1 1.

679 views • 14 slides

Mining and Understanding Software Enclaves (MUSE) Suresh Jagannathan Information Innovation

Mining and Understanding Software Enclaves (MUSE) Suresh Jagannathan Information Innovation Office DARPA http://www.darpa.mil/Our_Work/I2O/Programs/Mining_and_Understanding_Software_Enclaves_(MUSE).aspx 1 Distribution Statement A - Approved

473 views • 22 slides