IN5550: Neural Methods in Natural Language Processing Lecture 11/1 Contextualized embeddings
Andrey Kutuzov
University of Oslo
14 April 2020
1
IN5550: Neural Methods in Natural Language Processing Lecture 11/1 - - PowerPoint PPT Presentation
IN5550: Neural Methods in Natural Language Processing Lecture 11/1 Contextualized embeddings Andrey Kutuzov University of Oslo 14 April 2020 1 Contents Brief Recap 1 Problems of static word embeddings 2 Solution: contextualized
1
1
2
3
3
4
5
5
6
7
8
9
◮ ...actually, they are UTF-8 code units (bytes), not characters per se. 10
11
12
13
14
◮ conceptually, the same workflow as with ‘static’ word embeddings
◮ Potentially more powerful. More on that later today.
15
16
◮ http://vectors.nlpl.eu/repository/
◮ https://huggingface.co/transformers/pretrained_models.html
◮ Takes about 24 hours to train one ELMo epoch on 1 billion words using
◮ Much more for BERT!
17
18
19
20
21