Mikolovs Language Models: Distributed Representations of Sentences - PowerPoint PPT Presentation

Motivation Introduction and Background Paragraph Embeddings Performance Linguistic Regularities in Continuous Space Word Representations Mikolov’s Language Models: Distributed Representations of Sentences and Documents Recurrent Neural Language Model Tomas Mikolov 1 May 16, 2014 1 Google Inc1 Tomas Mikolov 2 Mikolov’s Language Models:

Motivation Introduction and Background Paragraph Embeddings Performance Linguistic Regularities in Continuous Space Word Representations Table of contents 1 Motivation 2 Introduction and Background 3 Paragraph Embeddings 4 Performance 5 Linguistic Regularities in Continuous Space Word Representations Tomas Mikolov 3 Mikolov’s Language Models:

Motivation Introduction and Background Paragraph Embeddings Performance Linguistic Regularities in Continuous Space Word Representations Motivation Quoth Tomas Mikolov, http://www.fit.vutbr.cz/ imikolov/rnnlm/google.pdf Statistical language models assign probabilities to word sequences Meaningful sentences should be more likely than ambiguous ones Language modeling is an artificial intelligence problem. Tomas Mikolov 4 Mikolov’s Language Models:

Motivation Introduction and Background Paragraph Embeddings Performance Linguistic Regularities in Continuous Space Word Representations Classical Ngram Models Figure: Text Modeling using Markov Chains, Claude Shannon (1984) max P ( w i | w i − 1 , ... ) (1) Where each w i representation is a 1-N encoding. Tomas Mikolov 5 Mikolov’s Language Models:

Motivation Introduction and Background Paragraph Embeddings Performance Linguistic Regularities in Continuous Space Word Representations Neural Representation of Words Neural Language Model Bengio et al, 2006. Figure: Word2Vec , Tomas Mikolov Tomas Mikolov 6 Mikolov’s Language Models:

Motivation Introduction and Background Paragraph Embeddings Performance Linguistic Regularities in Continuous Space Word Representations Beyond Word Embeddings Recursive Deep Tensor Models Socher et. al. Figure: Recursive Tree Structure , Richard Socher 2013 Tomas Mikolov 7 Mikolov’s Language Models:

Motivation Introduction and Background Paragraph Embeddings Performance Linguistic Regularities in Continuous Space Word Representations Beyond Word Embeddings Recurrent Neural Network Language Model Mikolov et. al. Figure: Recurrent NN , Tomas Mikolov 2010 Tomas Mikolov 8 Mikolov’s Language Models:

Motivation Introduction and Background Paragraph Embeddings Performance Linguistic Regularities in Continuous Space Word Representations Beyond Word Embeddings Character-Level Recognition Figure: Text Understanding from Scratch , Zhang, LeCun 2015 Tomas Mikolov 9 Mikolov’s Language Models:

Motivation Introduction and Background Paragraph Embeddings Performance Linguistic Regularities in Continuous Space Word Representations Algorithm Overview Figure: Paragraph Embedding, Learning Model , Tomas Mikolov 2013 Tomas Mikolov 10 Mikolov’s Language Models:

Motivation Introduction and Background Paragraph Embeddings Performance Linguistic Regularities in Continuous Space Word Representations Algorithmic Overview Part 1. Word embeddings. Given sentence w 1 , w 2 , w 3 ... : T − k max 1 X log p ( w t | w t − k , ..., w t + k ) (2) T t = k where e y wt p ( w t | w t − k , ..., w t + k ) = (3) P i e y i Tomas Mikolov 11 Mikolov’s Language Models:

Motivation Introduction and Background Paragraph Embeddings Performance Linguistic Regularities in Continuous Space Word Representations Algorithmic Overview Parameters for Step 1: U , b y = b + Uh ( w t − k , ..., w t + k ; W ) (4) Tomas Mikolov 12 Mikolov’s Language Models:

Motivation Introduction and Background Paragraph Embeddings Performance Linguistic Regularities in Continuous Space Word Representations Algorithmic Overview Part II. Joint Word and Paragraph y = b + Uh ( w t − k , ..., w t + k ; W , D ) (5) W ∈ R p × N D ∈ R p × M p × ( M + N ) Tomas Mikolov 13 Mikolov’s Language Models:

Motivation Introduction and Background Paragraph Embeddings Performance Linguistic Regularities in Continuous Space Word Representations Algorithm Overview Figure: Distributed Memory Model Tomas Mikolov 14 Mikolov’s Language Models:

Motivation Introduction and Background Paragraph Embeddings Performance Linguistic Regularities in Continuous Space Word Representations Algorithm Overview Figure: Distributed Bag of Words Model Model Tomas Mikolov 15 Mikolov’s Language Models:

Motivation Introduction and Background Paragraph Embeddings Performance Linguistic Regularities in Continuous Space Word Representations Sentiment Analysis Figure: Stanford Sentiment Treebank Dataset Tomas Mikolov 16 Mikolov’s Language Models:

Motivation Introduction and Background Paragraph Embeddings Performance Linguistic Regularities in Continuous Space Word Representations Sentiment Analysis Figure: iMDB Dataset Tomas Mikolov 17 Mikolov’s Language Models:

Motivation Introduction and Background Paragraph Embeddings Performance Linguistic Regularities in Continuous Space Word Representations Model Figure: Recurrent NN , Tomas Mikolov 2010 Tomas Mikolov 18 Mikolov’s Language Models:

Motivation Introduction and Background Paragraph Embeddings Performance Linguistic Regularities in Continuous Space Word Representations Components: input : x ( t ) = w ( t ) + s ( t − 1) ⇣ X ⌘ hidden : s j ( t ) = f x i ( t ) ∗ u ji i ⇣ X ⌘ output : y k ( t ) = g s j ( t ) ∗ v kj j where f is sigmoid and g is softmax. Tomas Mikolov 19 Mikolov’s Language Models:

Motivation Introduction and Background Paragraph Embeddings Performance Linguistic Regularities in Continuous Space Word Representations Spatial Meaning: Vector O ff set Method for Running Linguistic Analogy Questions: y = x b − x a + x c x w y w ∗ = arg max || x w |||| y || w Tomas Mikolov 20 Mikolov’s Language Models:

Motivation Introduction and Background Paragraph Embeddings Performance Linguistic Regularities in Continuous Space Word Representations Results Tomas Mikolov 21 Mikolov’s Language Models:

Mikolovs Language Models: Distributed Representations of Sentences - PowerPoint PPT Presentation

Motivation Introduction and Background Paragraph Embeddings Performance Linguistic Regularities in Continuous Space Word Representations Mikolovs Language Models: Distributed Representations of Sentences and Documents Recurrent Neural

Neural Networks for Natural Language Processing Tomas Mikolov, Facebook Brno University of

Models of Language Evolution models thereof its evolution language Models of Language Evolution

Distributed Representations of Sentences and Documents Quoc Le and Tomas Mikolov (ICML 2014)

Distributed Representation of Sentences LU Yangyang luyy11@sei.pku.edu.cn July 16,2014 @ KERE

Sentences and Documents Authors: QUOC LE, TOMAS MIKOLOV Presenters: Marjan Delpisheh, Nahid

Deep learning: Challenges in learning and generalization Tomas Mikolov, Facebook AI What is

4 Language Models 2: Log-linear Language Models This chapter will discuss another set of language

Chapter 7 Language models Statistical Machine Translation Language models Language models

Language Models Language Models Dan Klein, John DeNero UC Berkeley Language Models Acoustic

Language Models Dan Klein, John DeNero UC Berkeley Language Models Language Models Acoustic

Language Models Philipp Koehn 8 September 2020 Philipp Koehn Machine Translation: Language

Sequence-to-sequence Models and Attention Graham Neubig Preliminaries: Language Models

Overview for today Natural Language Processing with NNs [~15m] Supervised

Outline Language learning Computers Computers Computers Topic 6: CALL Topic 6: CALL Topic 6:

N-grams & Language ID If N-gram models represent language models, can we use N-gram

Developmental Developmental Disorders affecting Disorders affecting language language

. By Novita Judha

LCA OF BIODEGRADABLE LCA OF BIODEGRADABLE MULTILAYER FILM FROM MULTILAYER FILM FROM BIOPOLYMERS

ORAL PRESENTATION SCHEDULE Session 28 August 2019 13.30 - 15.00 Room Gajah Mada Room No

TRAINI NING G PROGRAM GRAMS S IN INDONESI SIA A AND EG EGYPT PT: : A Comp mparativ

A GReD and Proteco solution www.geoguard.eu Gred and Proteco partnership GReD and Proteco entered

BIOTECHNOLOGY AND FOREST HEALTH THE NATIONAL ACADEMY MARCH 27 2018 Rachel Smolker, Ph.D.

LPG PRODUCT KNOWLEDGE LPG PRODUCT KNOWLEDGE L P G IQUEFIED P ETROLEUM AS It is a fossil

Mo de lling o f Pro pe lla nt Ma na g e me nt Syste ms in E a rly-Pha se L a unc he r De ve lo

Mikolovs Language Models: Distributed Representations of Sentences - PowerPoint PPT Presentation

Motivation Introduction and Background Paragraph Embeddings Performance Linguistic Regularities in Continuous Space Word Representations Mikolovs Language Models: Distributed Representations of Sentences and Documents Recurrent Neural

Neural Networks for Natural Language Processing Tomas Mikolov, Facebook Brno University of

Models of Language Evolution models thereof its evolution language Models of Language Evolution

Distributed Representations of Sentences and Documents Quoc Le and Tomas Mikolov (ICML 2014)

Distributed Representation of Sentences LU Yangyang luyy11@sei.pku.edu.cn July 16,2014 @ KERE

Sentences and Documents Authors: QUOC LE, TOMAS MIKOLOV Presenters: Marjan Delpisheh, Nahid

Deep learning: Challenges in learning and generalization Tomas Mikolov, Facebook AI What is

4 Language Models 2: Log-linear Language Models This chapter will discuss another set of language

Chapter 7 Language models Statistical Machine Translation Language models Language models

Language Models Language Models Dan Klein, John DeNero UC Berkeley Language Models Acoustic

Language Models Dan Klein, John DeNero UC Berkeley Language Models Language Models Acoustic

Language Models Philipp Koehn 8 September 2020 Philipp Koehn Machine Translation: Language

Sequence-to-sequence Models and Attention Graham Neubig Preliminaries: Language Models

Overview for today Natural Language Processing with NNs [~15m] Supervised

Outline Language learning Computers Computers Computers Topic 6: CALL Topic 6: CALL Topic 6:

N-grams &amp; Language ID If N-gram models represent language models, can we use N-gram

Developmental Developmental Disorders affecting Disorders affecting language language

. By Novita Judha

LCA OF BIODEGRADABLE LCA OF BIODEGRADABLE MULTILAYER FILM FROM MULTILAYER FILM FROM BIOPOLYMERS

ORAL PRESENTATION SCHEDULE Session 28 August 2019 13.30 - 15.00 Room Gajah Mada Room No

TRAINI NING G PROGRAM GRAMS S IN INDONESI SIA A AND EG EGYPT PT: : A Comp mparativ

A GReD and Proteco solution www.geoguard.eu Gred and Proteco partnership GReD and Proteco entered

BIOTECHNOLOGY AND FOREST HEALTH THE NATIONAL ACADEMY MARCH 27 2018 Rachel Smolker, Ph.D.

LPG PRODUCT KNOWLEDGE LPG PRODUCT KNOWLEDGE L P G IQUEFIED P ETROLEUM AS It is a fossil

Mo de lling o f Pro pe lla nt Ma na g e me nt Syste ms in E a rly-Pha se L a unc he r De ve lo

N-grams & Language ID If N-gram models represent language models, can we use N-gram