mt@
The Sockeye Neural MT Toolkit at AMTA 2018
Felix Hieber, Tobias Domhan, Michael Denkowski, David Vilar, Artem Sokolov, Ann Clifton, Matt Post github.com/awslabs/sockeye
The Sockeye Neural MT Toolkit at AMTA 2018 Felix Hieber, Tobias - - PowerPoint PPT Presentation
mt @ The Sockeye Neural MT Toolkit at AMTA 2018 Felix Hieber, Tobias Domhan, Michael Denkowski, David Vilar, Artem Sokolov, Ann Clifton, Matt Post github.com/awslabs/sockeye Why Sockeye? Sockeye is: A production-ready framework for training
Felix Hieber, Tobias Domhan, Michael Denkowski, David Vilar, Artem Sokolov, Ann Clifton, Matt Post github.com/awslabs/sockeye
Sockeye is:
Motivation: rapid evolution of Neural MT—different toolkits with different features
Decision: build such a toolkit—Sockeye
Named after the Sockeye salmon found in the Northern Pacific Ocean (Favorite fish around Seattle, WA) 2
3
A translation system in 3 slides
Language model conditioned on source sentence ! = #1, … , #': ( )|# = +
,-. /
((),|).:,2., #) Encode source sentence Decode target sentence Attention connects states across steps Many instantiations:
4. <BOS> 45 the 46 white 47 house 8. 85 86 87 89 the white house <EOS> !. la !5 casa !6 blanca :; :< := :>
encoder ?@AB decoder ?C@B
4 D,
Given raw parallel text: The shares closed almost unchanged at 187.35 dollars. The question comes alone: Collserola? Park or mountain? Step 1 – Tokenize: The shares closed almost unchanged at 187.35 dollars . The question comes alone : Collserola ? Park or mountain ? Step 2 – Sub-word encode: The share@@ s closed a@@ lmost un@@ chang@@ ed at 18@@ 7@@ .@@ 35 dollar@@ s . The question comes alone : Co@@ ll@@ s@@ er@@ ola ? Park or mountain ? Ready for training! 5
Install Sockeye: pip install sockeye Train with default settings: python -m sockeye.train \
Decode with default settings: python -m sockeye.translate \
Customization? 6
7
Customizing translation systems
Sockeye supports 3 prominent architectures:
8 Attentional Recurrent
[Bahdanau et al., 2014, Luong et al., 2015]
Fully Convolutional
[Gehring et al., 2017]
Self-Attentional Transformer
[Vaswani et al., 2017]
9 Name MLP [Bahdanau et al, 2014] Dot [Luong et al. 2015] Location [Luong et al. 2015] Bilinear [Luong et al. 2015] Coverage [Tu et al. 2015] Multi-head [Vaswani et al., 2017]
v>
a tanh(Wu s + Wv h + Wc C)
softmax sWQ
u (hWK u )>
√du ! hWV
u
<latexit sha1_base64="Ww9FJ6bLKv1dlMWMarWC3TPI4es=">ACtXiclVFNTxsxFPQu0EJKS4AjF4uoUrhEu20l6A3BpRIXkAgBxcnK67UTC+96a7+tiKz9iVx649/gDTnwESExkqXRzHv2+L20VNJCFD0E4crq2qfP6xutL5tfv21t3eurK4M432mlTbXKbVcyYL3QYLi16XhNE8VH6S3p40/+MeNlbq4hFnJRzmdFJIRsFLSfue5Km+c1YLyOldTRQX0CXCUOZIqlVmZ95XjuQUpqlwtq6XyYO6TqrxBe4uM6fv9ZwdjAnosnbE/jXgsqTy1UZOpnCAP3zZVdLuRL1oDvyWxAvSQucJ+3/JNOsynkBTFrh3FUwshRA5IpXrdIZXlJ2S2d8KGnBc25Hbn52Gv83SsZFtr4UwCeq87HM1tk9NXNjHta68Rl3nDCsTRyMmirIAX7OkhUSkMGjc7xJk0nIGaeUKZkT4rZlPqdwZ+0y0/hPj1l9+S/o/e7158atzfLKYxjraQ/uoi2J0iI7RH3SO+ogFP4ObIA1YeBSOwywUT6VhsOjZRS8Q6kdKON/6</latexit><latexit sha1_base64="Ww9FJ6bLKv1dlMWMarWC3TPI4es=">ACtXiclVFNTxsxFPQu0EJKS4AjF4uoUrhEu20l6A3BpRIXkAgBxcnK67UTC+96a7+tiKz9iVx649/gDTnwESExkqXRzHv2+L20VNJCFD0E4crq2qfP6xutL5tfv21t3eurK4M432mlTbXKbVcyYL3QYLi16XhNE8VH6S3p40/+MeNlbq4hFnJRzmdFJIRsFLSfue5Km+c1YLyOldTRQX0CXCUOZIqlVmZ95XjuQUpqlwtq6XyYO6TqrxBe4uM6fv9ZwdjAnosnbE/jXgsqTy1UZOpnCAP3zZVdLuRL1oDvyWxAvSQucJ+3/JNOsynkBTFrh3FUwshRA5IpXrdIZXlJ2S2d8KGnBc25Hbn52Gv83SsZFtr4UwCeq87HM1tk9NXNjHta68Rl3nDCsTRyMmirIAX7OkhUSkMGjc7xJk0nIGaeUKZkT4rZlPqdwZ+0y0/hPj1l9+S/o/e7158atzfLKYxjraQ/uoi2J0iI7RH3SO+ogFP4ObIA1YeBSOwywUT6VhsOjZRS8Q6kdKON/6</latexit><latexit sha1_base64="Ww9FJ6bLKv1dlMWMarWC3TPI4es=">ACtXiclVFNTxsxFPQu0EJKS4AjF4uoUrhEu20l6A3BpRIXkAgBxcnK67UTC+96a7+tiKz9iVx649/gDTnwESExkqXRzHv2+L20VNJCFD0E4crq2qfP6xutL5tfv21t3eurK4M432mlTbXKbVcyYL3QYLi16XhNE8VH6S3p40/+MeNlbq4hFnJRzmdFJIRsFLSfue5Km+c1YLyOldTRQX0CXCUOZIqlVmZ95XjuQUpqlwtq6XyYO6TqrxBe4uM6fv9ZwdjAnosnbE/jXgsqTy1UZOpnCAP3zZVdLuRL1oDvyWxAvSQucJ+3/JNOsynkBTFrh3FUwshRA5IpXrdIZXlJ2S2d8KGnBc25Hbn52Gv83SsZFtr4UwCeq87HM1tk9NXNjHta68Rl3nDCsTRyMmirIAX7OkhUSkMGjc7xJk0nIGaeUKZkT4rZlPqdwZ+0y0/hPj1l9+S/o/e7158atzfLKYxjraQ/uoi2J0iI7RH3SO+ogFP4ObIA1YeBSOwywUT6VhsOjZRS8Q6kdKON/6</latexit><latexit sha1_base64="X/BbPQRM1pmBhxdK1enSbL+gJw=">AB2HicbZDNSgMxFIXv1L86Vq1rN8EiuCozbtSd4MZlBcW2qFkMnfa0ExmSO4IpfQFXLhRfDB3vo3pz0KtBwIf5yTk3pOUSloKgi+vtrW9s7tX3/cPGv7h0XGz8WSLygiMRKEK0u4RSU1RiRJYa80yPNEYTeZ3C3y7jMaKwv9SNMS45yPtMyk4OSszrDZCtrBUmwTwjW0YK1h83OQFqLKUZNQ3Np+GJQUz7ghKRTO/UFlseRiwkfYd6h5jaeLcecs3PnpCwrjDua2NL9+WLGc2uneJu5pzG9m+2MP/L+hVl1/FM6rIi1GL1UVYpRgVb7MxSaVCQmjrgwkg3KxNjbrg14zvOgj/brwJ0WX7ph0+BFCHUziDCwjhCm7hHjoQgYAUXuDNG3uv3vuqpq37uwEfsn7+AaqKYoN</latexit><latexit sha1_base64="xmLjPweMJoHYbUkLvjHC50iR23s=">ACqniclVFNTxsxFPRuKR8phZRrL1ajSskl2m0PhBtSL5W4gNQUJysvF47sfCuF/stIrL2J/bCjX9Tb8gBkgipT7I0mnPfp5JSyUtRNFzEH7Y+bi7t3/Q+nT4+ei4/eXw2urKMD5kWmlzk1LlSz4ECQoflMaTvNU8VF696vRw/cWKmLP7Ao+Sns0IKySh4Kmn/JXmqH53VAnL6WBPFBXSJMJQ5kmqV2YXlSM5hXkqnK3rbfSorpNqeoW728T5ezMXvSkBXdaO2HsDLksq323kbA49/N+XSftTtSPloU3QbwCHbSqy6T9RDLNqpwXwBS1dhxHJUwcNSCZ4nWLVJaXlN3RGR97WNCc24lb2l7j757JsNDGnwLwkn094Whumz19Z7OmXdcacps2rkAMJk4WZQW8YC8PiUph0LjJEGfScAZq4QFlRvpdMZtTnxn4pFvehHj9y5tg+KN/1o+vIrSPvqJvqItidIrO0W90iYaIBT+D2yANWDgIp2H24lYrGw7QW8qFP8Alave2Q=</latexit><latexit sha1_base64="xmLjPweMJoHYbUkLvjHC50iR23s=">ACqniclVFNTxsxFPRuKR8phZRrL1ajSskl2m0PhBtSL5W4gNQUJysvF47sfCuF/stIrL2J/bCjX9Tb8gBkgipT7I0mnPfp5JSyUtRNFzEH7Y+bi7t3/Q+nT4+ei4/eXw2urKMD5kWmlzk1LlSz4ECQoflMaTvNU8VF696vRw/cWKmLP7Ao+Sns0IKySh4Kmn/JXmqH53VAnL6WBPFBXSJMJQ5kmqV2YXlSM5hXkqnK3rbfSorpNqeoW728T5ezMXvSkBXdaO2HsDLksq323kbA49/N+XSftTtSPloU3QbwCHbSqy6T9RDLNqpwXwBS1dhxHJUwcNSCZ4nWLVJaXlN3RGR97WNCc24lb2l7j757JsNDGnwLwkn094Whumz19Z7OmXdcacps2rkAMJk4WZQW8YC8PiUph0LjJEGfScAZq4QFlRvpdMZtTnxn4pFvehHj9y5tg+KN/1o+vIrSPvqJvqItidIrO0W90iYaIBT+D2yANWDgIp2H24lYrGw7QW8qFP8Alave2Q=</latexit><latexit sha1_base64="AbyU6bi+ZfnNFEn17cfYTXS7Do=">ACtXiclVFNT+MwFHSywLlq8BxLxYVUrlUCRyAG2IvSHsBaUtBdRs5jt1aOHGwX1ZUVn7iXva2/wan9LBAhcRIlkYz79nj9JSQtR9C8Iv6ysrn1d/9ba2Nza3mnv7t1aXRnG+0wrbe5SarmSBe+DBMXvSsNpnio+SB9+NP7gNzdW6uIXzEo+yumkEIyCl5K2n9InuonZ7WAnD7VRHEBXSIMZY6kWmV25n3lSE5hmgpn63qZPKjrpBrf4O4yc/pRz8+jMQFd1o7YRwMuSypfbeRkCkf405fdJu1O1IvmwO9JvCAdtMB10v5LMs2qnBfAFLV2GEcljBw1IJnidYtUlpeUPdAJH3pa0JzbkZuPvcaHXsmw0MafAvBc/b/D0dw2OX1lE9O+9RpxmTesQJyNnCzKCnjBXh4SlcKgcbNDnEnDGaiZJ5QZ6bNiNqV+Z+A3fJDiN9+T3pH/fOe/FN1Lm4XExjHX1HB6iLYnSKLtAVukZ9xIKT4D5IAxaeheMwC8VLaRgsevbRK4T6GUj43/Y=</latexit><latexit sha1_base64="Ww9FJ6bLKv1dlMWMarWC3TPI4es=">ACtXiclVFNTxsxFPQu0EJKS4AjF4uoUrhEu20l6A3BpRIXkAgBxcnK67UTC+96a7+tiKz9iVx649/gDTnwESExkqXRzHv2+L20VNJCFD0E4crq2qfP6xutL5tfv21t3eurK4M432mlTbXKbVcyYL3QYLi16XhNE8VH6S3p40/+MeNlbq4hFnJRzmdFJIRsFLSfue5Km+c1YLyOldTRQX0CXCUOZIqlVmZ95XjuQUpqlwtq6XyYO6TqrxBe4uM6fv9ZwdjAnosnbE/jXgsqTy1UZOpnCAP3zZVdLuRL1oDvyWxAvSQucJ+3/JNOsynkBTFrh3FUwshRA5IpXrdIZXlJ2S2d8KGnBc25Hbn52Gv83SsZFtr4UwCeq87HM1tk9NXNjHta68Rl3nDCsTRyMmirIAX7OkhUSkMGjc7xJk0nIGaeUKZkT4rZlPqdwZ+0y0/hPj1l9+S/o/e7158atzfLKYxjraQ/uoi2J0iI7RH3SO+ogFP4ObIA1YeBSOwywUT6VhsOjZRS8Q6kdKON/6</latexit><latexit sha1_base64="Ww9FJ6bLKv1dlMWMarWC3TPI4es=">ACtXiclVFNTxsxFPQu0EJKS4AjF4uoUrhEu20l6A3BpRIXkAgBxcnK67UTC+96a7+tiKz9iVx649/gDTnwESExkqXRzHv2+L20VNJCFD0E4crq2qfP6xutL5tfv21t3eurK4M432mlTbXKbVcyYL3QYLi16XhNE8VH6S3p40/+MeNlbq4hFnJRzmdFJIRsFLSfue5Km+c1YLyOldTRQX0CXCUOZIqlVmZ95XjuQUpqlwtq6XyYO6TqrxBe4uM6fv9ZwdjAnosnbE/jXgsqTy1UZOpnCAP3zZVdLuRL1oDvyWxAvSQucJ+3/JNOsynkBTFrh3FUwshRA5IpXrdIZXlJ2S2d8KGnBc25Hbn52Gv83SsZFtr4UwCeq87HM1tk9NXNjHta68Rl3nDCsTRyMmirIAX7OkhUSkMGjc7xJk0nIGaeUKZkT4rZlPqdwZ+0y0/hPj1l9+S/o/e7158atzfLKYxjraQ/uoi2J0iI7RH3SO+ogFP4ObIA1YeBSOwywUT6VhsOjZRS8Q6kdKON/6</latexit><latexit sha1_base64="Ww9FJ6bLKv1dlMWMarWC3TPI4es=">ACtXiclVFNTxsxFPQu0EJKS4AjF4uoUrhEu20l6A3BpRIXkAgBxcnK67UTC+96a7+tiKz9iVx649/gDTnwESExkqXRzHv2+L20VNJCFD0E4crq2qfP6xutL5tfv21t3eurK4M432mlTbXKbVcyYL3QYLi16XhNE8VH6S3p40/+MeNlbq4hFnJRzmdFJIRsFLSfue5Km+c1YLyOldTRQX0CXCUOZIqlVmZ95XjuQUpqlwtq6XyYO6TqrxBe4uM6fv9ZwdjAnosnbE/jXgsqTy1UZOpnCAP3zZVdLuRL1oDvyWxAvSQucJ+3/JNOsynkBTFrh3FUwshRA5IpXrdIZXlJ2S2d8KGnBc25Hbn52Gv83SsZFtr4UwCeq87HM1tk9NXNjHta68Rl3nDCsTRyMmirIAX7OkhUSkMGjc7xJk0nIGaeUKZkT4rZlPqdwZ+0y0/hPj1l9+S/o/e7158atzfLKYxjraQ/uoi2J0iI7RH3SO+ogFP4ObIA1YeBSOwywUT6VhsOjZRS8Q6kdKON/6</latexit><latexit sha1_base64="Ww9FJ6bLKv1dlMWMarWC3TPI4es=">ACtXiclVFNTxsxFPQu0EJKS4AjF4uoUrhEu20l6A3BpRIXkAgBxcnK67UTC+96a7+tiKz9iVx649/gDTnwESExkqXRzHv2+L20VNJCFD0E4crq2qfP6xutL5tfv21t3eurK4M432mlTbXKbVcyYL3QYLi16XhNE8VH6S3p40/+MeNlbq4hFnJRzmdFJIRsFLSfue5Km+c1YLyOldTRQX0CXCUOZIqlVmZ95XjuQUpqlwtq6XyYO6TqrxBe4uM6fv9ZwdjAnosnbE/jXgsqTy1UZOpnCAP3zZVdLuRL1oDvyWxAvSQucJ+3/JNOsynkBTFrh3FUwshRA5IpXrdIZXlJ2S2d8KGnBc25Hbn52Gv83SsZFtr4UwCeq87HM1tk9NXNjHta68Rl3nDCsTRyMmirIAX7OkhUSkMGjc7xJk0nIGaeUKZkT4rZlPqdwZ+0y0/hPj1l9+S/o/e7158atzfLKYxjraQ/uoi2J0iI7RH3SO+ogFP4ObIA1YeBSOwywUT6VhsOjZRS8Q6kdKON/6</latexit><latexit sha1_base64="Ww9FJ6bLKv1dlMWMarWC3TPI4es=">ACtXiclVFNTxsxFPQu0EJKS4AjF4uoUrhEu20l6A3BpRIXkAgBxcnK67UTC+96a7+tiKz9iVx649/gDTnwESExkqXRzHv2+L20VNJCFD0E4crq2qfP6xutL5tfv21t3eurK4M432mlTbXKbVcyYL3QYLi16XhNE8VH6S3p40/+MeNlbq4hFnJRzmdFJIRsFLSfue5Km+c1YLyOldTRQX0CXCUOZIqlVmZ95XjuQUpqlwtq6XyYO6TqrxBe4uM6fv9ZwdjAnosnbE/jXgsqTy1UZOpnCAP3zZVdLuRL1oDvyWxAvSQucJ+3/JNOsynkBTFrh3FUwshRA5IpXrdIZXlJ2S2d8KGnBc25Hbn52Gv83SsZFtr4UwCeq87HM1tk9NXNjHta68Rl3nDCsTRyMmirIAX7OkhUSkMGjc7xJk0nIGaeUKZkT4rZlPqdwZ+0y0/hPj1l9+S/o/e7158atzfLKYxjraQ/uoi2J0iI7RH3SO+ogFP4ObIA1YeBSOwywUT6VhsOjZRS8Q6kdKON/6</latexit><latexit sha1_base64="Ww9FJ6bLKv1dlMWMarWC3TPI4es=">ACtXiclVFNTxsxFPQu0EJKS4AjF4uoUrhEu20l6A3BpRIXkAgBxcnK67UTC+96a7+tiKz9iVx649/gDTnwESExkqXRzHv2+L20VNJCFD0E4crq2qfP6xutL5tfv21t3eurK4M432mlTbXKbVcyYL3QYLi16XhNE8VH6S3p40/+MeNlbq4hFnJRzmdFJIRsFLSfue5Km+c1YLyOldTRQX0CXCUOZIqlVmZ95XjuQUpqlwtq6XyYO6TqrxBe4uM6fv9ZwdjAnosnbE/jXgsqTy1UZOpnCAP3zZVdLuRL1oDvyWxAvSQucJ+3/JNOsynkBTFrh3FUwshRA5IpXrdIZXlJ2S2d8KGnBc25Hbn52Gv83SsZFtr4UwCeq87HM1tk9NXNjHta68Rl3nDCsTRyMmirIAX7OkhUSkMGjc7xJk0nIGaeUKZkT4rZlPqdwZ+0y0/hPj1l9+S/o/e7158atzfLKYxjraQ/uoi2J0iI7RH3SO+ogFP4ObIA1YeBSOwywUT6VhsOjZRS8Q6kdKON/6</latexit>vas
<latexit sha1_base64="9wokvrVdASnFSwxmZDSpQm7FDA=">ACGHicbVA9T8MwEL2Ur1K+AkgsLBEVElOVsABbBQtjKxFaqYmK4zitVSeObKdSFfVvsPBXWBgAsXZj4bfgtB2g7UmWn9670717QcqoVLb9bZTW1jc2t8rblZ3dvf0D8/DoUfJMYOJizrhoB0gSRhPiKqoYaeCoDhgpBUM7gq9NSRCUp48qFK/Bj1EhpRjJSmuqbtBZyFchTrL/dipPpBlA/H4y5aJUgtmFW7Zk/LWgbOHFTrJ82fJwBodM2JF3KcxSRmCEpO46dKj9HQlHMyLjiZKkCA9Qj3Q0TFBMpJ9PLxtb5oJrYgL/RJlTdm/EzmKZeFRdxYW5aJWkKu0Tqaiaz+nSZopkuDZoihjluJWEZMVUkGwYiMNEBZUe7VwHwmElQ6zokNwFk9eBu5l7abmNJ1q/RZmVYZTOIMLcOAK6nAPDXABwzO8wjt8GC/Gm/FpfM1aS8Z85hj+lTH5BfxqpIw=</latexit><latexit sha1_base64="TQJ2duIbC1x/nDjLdl07jf/V9f0=">ACGHicbVDLSsNAFJ3UV62vqODGzWARXJXEjbordeOyBWMLTSiTyaQdOpmEmUmhPyG7/BP3Aj+MBtd278FidtF9r2wjCHc+7lnv8hFGpLOvbK2tb2xulbcrO7t7+wfm4dGDjFOBiYNjFouOjyRhlBNHUcVIJxERT4jbX94W+jtERGSxvxejRPiRajPaUgxUprqmZbrxyQ40h/mRshNfDbJTnPbRKkFowq1bNmhZcBvYcVOsnrR/63Hhr9syJG8Q4jQhXmCEpu7aVKC9DQlHMSF5xU0kShIeoT7oachQR6WXTy3J4rpkAhrHQjys4Zf9OZCiShUfdWViUi1pBrtK6qQqvYzyJFWE49miMGVQxbCICQZUEKzYWAOEBdVeIR4gbDSYVZ0CPbiycvAuazd1OyWXa03wKzK4BScgQtgytQB3egCRyAwSN4Ae/gw3gyXo1P42vWjLmM8fgXxmTX0v+pkg=</latexit><latexit sha1_base64="TQJ2duIbC1x/nDjLdl07jf/V9f0=">ACGHicbVDLSsNAFJ3UV62vqODGzWARXJXEjbordeOyBWMLTSiTyaQdOpmEmUmhPyG7/BP3Aj+MBtd278FidtF9r2wjCHc+7lnv8hFGpLOvbK2tb2xulbcrO7t7+wfm4dGDjFOBiYNjFouOjyRhlBNHUcVIJxERT4jbX94W+jtERGSxvxejRPiRajPaUgxUprqmZbrxyQ40h/mRshNfDbJTnPbRKkFowq1bNmhZcBvYcVOsnrR/63Hhr9syJG8Q4jQhXmCEpu7aVKC9DQlHMSF5xU0kShIeoT7oachQR6WXTy3J4rpkAhrHQjys4Zf9OZCiShUfdWViUi1pBrtK6qQqvYzyJFWE49miMGVQxbCICQZUEKzYWAOEBdVeIR4gbDSYVZ0CPbiycvAuazd1OyWXa03wKzK4BScgQtgytQB3egCRyAwSN4Ae/gw3gyXo1P42vWjLmM8fgXxmTX0v+pkg=</latexit><latexit sha1_base64="pcN9IaEh2MPGDrhs17FsZ1IhClk=">ACGHicbVC7TsMwFHXKq5RXgJHFokJiqhIWYKtgYSwSoZWaqHIcp7Xq2JHtVKqi/AYLv8LCAIi1G3+D02aAtleyfHTOvbrnjBlVGnH+bFqG5tb2zv13cbe/sHhkX18qxEJjHxsGBC9kKkCKOceJpqRnqpJCgJGemG4/tS706IVFTwJz1NSZCgIacxUgbamA7fihYpKaJ+XI/QXoUxvmkKAZonaCMYDedljMvuArcCjRBVZ2BPfMjgbOEcI0ZUqrvOqkOciQ1xYwUDT9TJEV4jIakbyBHCVFBPr+sgBeGiWAspHlcwzn7dyJHiSo9ms7SolrWSnKd1s90fBPklKeZJhwvFsUZg1rAMiYUmwZlMDEJbUeIV4hCTC2oTZMCG4yevAu+qdtyH91m+65Kow7OwDm4BC64Bm3wADrAxi8gDfwAT6tV+vd+rK+F601q5o5Bf/Kmv0Cb7eipw=</latexit>s>Wh
<latexit sha1_base64="i0vQagZw+7zEH+Ggi2BPmMaiLeE=">ACNHicbVA9T8MwEL3wWcJXgJElokJiqhIWYEBUsDAwFInQSk2pHNdprTpxZDtIVRSJ38TCT2BngoEBECu/AaftAG1Psvz03p3u3QsSRqVynDdjbn5hcWm5tGKurq1vbFpb27eSpwITD3PGRSNAkjAaE09RxUgjEQRFASP1oH9R6PV7IiTl8Y0aJKQVoW5MQ4qR0lTbuvIDzjpyEOkv8yOkekGYyTy/8xVPsnyWs9n0r08b1tlp+IMy54G7hiUz57N0wcAqLWtF7/DcRqRWGpGy6TqJaGRKYkZy08lSRDuoy5pahijiMhWNrw6t/c107FDLvSLlT1k/05kKJKFR91ZWJSTWkHO0pqpCo9bGY2TVJEYjxaFKbMVt4sI7Q4VBCs20ABhQbVXG/eQFjpoE0dgjt58jTwDisnFfaLVfPYVQl2IU9OAXjqAKl1ADzA8wit8wKfxZLwbX8b3qHXOGM/swL8yfn4Bw9Gxaw=</latexit><latexit sha1_base64="vDZY27gxMqr2hX0poldzlAC+P0=">ACNHicbVC7TsMwFHXKq4RXgJHFokJiqhIWYEBUsDAwFInQSk2pHMdprTpxZDtIVZSv4Q9Y+AR2FmBgAMTKN+C0HaDtlSwfnXOv7rnHTxiVyrbfjNLc/MLiUnZXFldW9+wNrduJE8FJi7mjIumjyRhNCauoqRZiIinxGn7/vNAbd0RIyuNrNUhIO0LdmIYUI6WpjnXp+ZwFchDpL/MipHp+mMk8v/UT7J8ltrIZ9K9PO9YFbtqDwtOA2cMKqdP5kly/2LWO9azF3CcRiRWmCEpW46dqHaGhKYkdz0UkShPuoS1oaxigisp0Nr87hnmYCGHKhX6zgkP07kaFIFh51Z2FRTmoFOUtrpSo8amc0TlJFYjxaFKYMKg6LCGFABcGKDTRAWFDtFeIeEgrHbSpQ3AmT54G7kH1uOpcOZXaGRhVGeyAXbAPHAIauAC1IELMHgAr+ADfBqPxrvxZXyPWkvGeGYb/Cvj5xez7Lf</latexit><latexit sha1_base64="vDZY27gxMqr2hX0poldzlAC+P0=">ACNHicbVC7TsMwFHXKq4RXgJHFokJiqhIWYEBUsDAwFInQSk2pHMdprTpxZDtIVZSv4Q9Y+AR2FmBgAMTKN+C0HaDtlSwfnXOv7rnHTxiVyrbfjNLc/MLiUnZXFldW9+wNrduJE8FJi7mjIumjyRhNCauoqRZiIinxGn7/vNAbd0RIyuNrNUhIO0LdmIYUI6WpjnXp+ZwFchDpL/MipHp+mMk8v/UT7J8ltrIZ9K9PO9YFbtqDwtOA2cMKqdP5kly/2LWO9azF3CcRiRWmCEpW46dqHaGhKYkdz0UkShPuoS1oaxigisp0Nr87hnmYCGHKhX6zgkP07kaFIFh51Z2FRTmoFOUtrpSo8amc0TlJFYjxaFKYMKg6LCGFABcGKDTRAWFDtFeIeEgrHbSpQ3AmT54G7kH1uOpcOZXaGRhVGeyAXbAPHAIauAC1IELMHgAr+ADfBqPxrvxZXyPWkvGeGYb/Cvj5xez7Lf</latexit><latexit sha1_base64="e+AYpyL1QMbabPHsyaEoV6qcFs=">ACNHicbVC7TsMwFHV4lvIKMLJYVEhMVcICbBUsDAxFIrRSEyrHcVqrjh3ZDlIV5adY+A8mGBgAsfINOG0GaHsly0fn3Kt7glTRpV2nDdraXldW29tlHf3Nre2bX39u+VyCQmHhZMyG6IFGUE09TzUg3lQlISOdcHRV6p1HIhUV/E6PUxIkaMBpTDHShurbN34oWKTGiflyP0F6GMa5KoHX4s0LxapnWIhPSyKvt1wms6k4DxwK9AVbX79osfCZwlhGvMkFI910l1kCOpKWakqPuZIinCIzQgPQM5SogK8snVBTw2TARjIc3jGk7YvxM5SlTp0XSWFtWsVpKLtF6m4/MgpzNOF4uijOGNQClhHCiEqCNRsbgLCkxivEQyQR1ibougnBnT15HninzYume+s2WpdVGjVwCI7ACXDBGWiBa9AGHsDgCbyCD/BpPVv1pf1PW1dsqZA/CvrJ9fVmavng=</latexit>Sockeye supports a range of attention models (currently limited to RNN encoders/decoders)
Recommended model training recipe:
10
Recommended model training recipe:
11
Monitor training with standalone TensorBoard:
12
Primary decoding features:
13
Visualize beam search history Adding new features? 14
15
Adding your code to Sockeye
Fast and scalable deep learning framework
Flexible programing model
Bindings for various languages (Python, C++, Scala, R, Julia, Perl) Officially supported by Amazon/AWS
16
17
Imperative
from mxnet.ndarray import * x = zeros((64, 12)) weights = zeros((128, 12)) x = FullyConnected( x, weights, num_hidden=128) pred = SoftmaxActivation(x) pred = pred.asnumpy()
Symbolic
from mxnet.symbol import * y = Variable('y') x = Variable('x') weights = Variable('w') x = FullyConnected( x, weights, num_hidden=128) pred = SoftmaxOutput(x, y) model = Module(pred) model.fit(…) model.forward_backward(data)
18 Training - Symbolic:
memory and parameters Inference - Symbolic and Imperative:
k-best hypotheses at each step until <EOS>
sequence length batch size sequence length batch size sequence length batch size batch size sequence length
Official Amazon software on GitHub 19
20 Developer guidelines for reliable, understandable code:
Public code review process—community feedback welcome! 21
Felix Hieber, Tobias Domhan, Michael Denkowski, David Vilar, Artem Sokolov, Ann Clifton, Matt Post github.com/awslabs/sockeye
23 System Architecture EN→DE LV→EN FairSeq CNN 23.37 15.38 Marian RNN 25.93 16.19 Transformer 27.41 17.58 Nematus RNN 23.78 14.70 Neural Monkey RNN 13.73 10.54 OpenNMT RNN 22.69 13.85 OpenNMT-py RNN 21.95 13.55 Tensor2Tensor Transformer 26.34 17.67 Sockeye CNN 24.59 15.82 RNN 25.55 15.92 Transformer 27.50 18.06
WMT BLEU (cased)