Improving Neural Language Models with Weight Norm Initialization and Regularization
Christian Herold∗ Yingbo Gao∗ Hermann Ney
<surname>@i6.informatik.rwth-aachen.de October 31st, 2018 Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University Third Conference on Machine Translation (WMT18) Brussels, Belgium * Equal Contribution
Herold et al.: Improving NLMs with Weight Norm Initialization and Regularization 1/13 31.10.2018