the sockeye neural mt toolkit at amta 2018
play

The Sockeye Neural MT Toolkit at AMTA 2018 Felix Hieber, Tobias - PowerPoint PPT Presentation

mt @ The Sockeye Neural MT Toolkit at AMTA 2018 Felix Hieber, Tobias Domhan, Michael Denkowski, David Vilar, Artem Sokolov, Ann Clifton, Matt Post github.com/awslabs/sockeye Why Sockeye? Sockeye is: A production-ready framework for training


  1. mt @ The Sockeye Neural MT Toolkit at AMTA 2018 Felix Hieber, Tobias Domhan, Michael Denkowski, David Vilar, Artem Sokolov, Ann Clifton, Matt Post github.com/awslabs/sockeye

  2. Why Sockeye? Sockeye is: A production-ready framework for training state-of-the-art models • A flexible experimentation platform for researchers • Motivation: rapid evolution of Neural MT—different toolkits with different features No single toolkit with everything we need at Amazon • Nothing mature for MXNet, our framework of choice • Decision: build such a toolkit—Sockeye Highly scalable (multiple GPUs, large data) • Free and open source software (Apache 2.0) • Named after the Sockeye salmon found in the Northern Pacific Ocean (Favorite fish around Seattle, WA) 2

  3. Quick Start A translation system in 3 slides 3

  4. Sequence-to-Sequence Modeling the white house <EOS> Language model conditioned on source sentence ! = # 1 , … , #' : / ( )|# = + (() , |) .:,2. , #) D , encoder ? @AB ,-. Encode source sentence 8 7 : ; : < : = : > 8 9 8 . 8 5 8 6 Decode target sentence Attention connects states across steps ! . ! 6 4 . 4 5 4 7 ! 5 4 6 Many instantiations: Recurrent • casa blanca white house la <BOS> the Convolutional • Self-attentional • decoder ? C@B 4

  5. Data Pre-Processing Given raw parallel text: The shares closed almost unchanged at 187.35 dollars. The question comes alone: Collserola? Park or mountain? Step 1 – Tokenize: The shares closed almost unchanged at 187.35 dollars . The question comes alone : Collserola ? Park or mountain ? Step 2 – Sub-word encode: The share@@ s closed a@@ lmost un@@ chang@@ ed at 18@@ 7@@ .@@ 35 dollar@@ s . The question comes alone : Co@@ ll@@ s@@ er@@ ola ? Park or mountain ? Ready for training! 5

  6. Running Sockeye Install Sockeye: pip install sockeye Train with default settings: python -m sockeye.train \ --source train-corpus.de \ --target train-corpus.en \ --validation-source dev-corpus.de \ --validation-target dev-corpus.en \ --output model.de-en Decode with default settings: python -m sockeye.translate \ --models model.de-en Customization? 6

  7. Architectures & Features Customizing translation systems 7

  8. Base Architectures Sockeye supports 3 prominent architectures: Mix and match with --encoder and --decoder options • Attentional Recurrent Fully Convolutional Self-Attentional Transformer [Bahdanau et al., 2014, Luong et al., 2015] [Gehring et al., 2017] [Vaswani et al., 2017] 8

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend