Billion Word Imputation Guide: Prof. Amitabha Mukherjee Group 20: - PowerPoint PPT Presentation

CS365 Course Project Billion Word Imputation Guide: Prof. Amitabha Mukherjee Group 20: Aayush Mudgal [12008] Shruti Bhargava [13671]

Problem Statement Problem Description : https://www.kaggle.com/c/billion-word-imputation

Examples : 1. “Michael described Sarah to a at the shelter .” ◦ “Michael described Sarah to a __________? at the shelter. 2. “He added that people should not mess with mother nature , and let sharks be .”

Basic Approach Location ? Word ? ? 1. Language modelling using Word2Vec 2. Strengthening using HMM / NLP Parser

Skip Gram VS N Gram • Data is Sparse • Example Sentence : “I hit the tennis ball” • Word level trigrams: “I hit the”, “hit the tennis” and “the tennis ball” • But skipping the word tennis, results in an equally important trigram Word as Distributed Atomic Units Representation

Word2vec by Mikolov et al.(2013) Two architectures 1. Continuous Bag-of-Word ◦ Predict the word given the context 2. Skip Gram ◦ Predict the context given the word ◦ The training objective is to find word representations that are useful for predicting the surrounding words in a sentence or a document

Skip Gram Method Given a sequence of training words w1, w2, w3, . . . , wT , the objective of the Skip-gram model is to maximize the average log probability : c is the size of the training context (which can be a function of the center word wt)

Skip Gram Method The basic Skip-gram formulation defines p( 𝑥 𝑢+𝑘 | 𝑥 𝑢 ) using the softmax function ′ where 𝑤 𝑥 and 𝑤 𝑥 are the “input” and “output” vector representations of w W is the number of words in the vocabulary. IMPRACTICAL because the cost of computing ∇ log p(wO|wI ) is proportional to W, which is often large (105 – 107 terms).

Sub-Sampling of Frequent Words • The most frequent words like “in”, “the”, “a” can easily occur hundreds of millions of times (e.g., “in”, “the”, and “a”). • Such words usually provide less information value than the rare words • Example : Observation of France and Paris is much more beneficial • Than the frequent occurrence of “France” and “the” • Vector representation of frequent words do not change significantly after training on several million examples

Skip-Gram Model : Limitation • Word representations are limited by their inability to represent idiomatic phrases that are not compositions of the individual words. • Example , “Boston Globe” is a newspaper, and not “Boston” + “Globe” Therefore, using vectors to represent the whole phrases makes the Skip-gram model considerably more expressive.

Questions ?

Refrences 1. Mikolov, Tomas, et al. "Distributed representations of words and phrases and their compositionality." Advances in Neural Information Processing Systems . 2013. 2. Mnih, Andriy, and Koray Kavukcuoglu. "Learning word embeddings efficiently with noise- contrastive estimation." Advances in Neural Information Processing Systems . 2013. 3. A Closer Look at Skip-gram Modelling David Guthrie, Ben Allison, W. Liu, Louise Guthrie, and Yorick Wilks. Proceedings of the Fifth international Conference on Language Resources and Evaluation (LREC-2006), Genoa, Italy, ( 2006 ) 4. Mikolov, Tomas, et al. "Efficient estimation of word representations in vector space." arXiv preprint arXiv:1301.3781 (2013). Challenge Description and Data : https://www.kaggle.com/c/billion-word-imputation

Hidden Markov Models 1. States : Parts of Speech 2. Combine Word2Vec with HMM

Skip-Gram Method ◦ Vocabulary size is V ◦ Hidden layer size is N ◦ Input Vector : One-hot encoded vector, i.e. only one node of { 𝑌 1 , 𝑌 2 , … . , 𝑌 𝑤 } is 1 and others 0 ◦ Weights between the input layer and the output layer is represented by a VxN matrix W

Skip-Gram Method h= 𝑦 𝑈 𝑋 = 𝑤 𝑋𝑗 • • 𝑤 𝑋𝑗 is the vector representation of the input word 𝑥 𝑗 𝑈 . ℎ • ′ 𝑣 𝑘 = 𝑤 𝑥 𝑘 ′ • 𝑣 𝑘 is the score of each word in vocabulary and 𝑤 𝑋𝑗 is the j-th column of matrix 𝑋 ′

Billion Word Imputation Guide: Prof. Amitabha Mukherjee Group 20: - PowerPoint PPT Presentation

CS365 Course Project Billion Word Imputation Guide: Prof. Amitabha Mukherjee Group 20: Aayush Mudgal [12008] Shruti Bhargava [13671] Problem Statement Problem Description : https://www.kaggle.com/c/billion-word-imputation Examples : 1.

Multiple Imputation for Missing Data in KLoSA Juwon Song Korea University and UCLA Contents 1.

Overview Multiple Imputation for Multilevel Data Bayesian estimation for MLMs Univariate

Consistent Variance Estimates for Multiple Multiple imputation Imputation in R MI alternative

Memory Memory Decoders M bits M bits RWM NVRWM ROM S 0 S 0 Word 0 Word 0 S 1 Word 1 Word

Genotype Imputation in Genome-wide Association Studies Fernando Rivadeneira 1,2 1 Department of

Genotype imputation accuracy with different reference panels Guan-Hua Huang and Yi-Chi Tseng

Handling missing data in Stata: Imputation and likelihood-based approaches Rose Medeiros

MixtComp software: Model-based clustering/imputation with mixed data, missing data and uncertain

Missing Data and Imputation NINA ORWITZ OCTOBER 30 TH , 2017 Outline Types of missing data

Reference based multiple imputation; for sensitivity analysis of clinical trials with missing

Method for the imputation of the earnings variable in the Belgian LFS Workshop on LFS

Imputation by Gaussian Copula Model with an Application to Incomplete Customer Satisfaction Data

$17 Billion for NYC $375 billion total for counties and localities $34 Billion for NY State $500

>>>CLICK HERE<<< Presentation d un document word New Haven. peugeot 207 workshop

Is this a word that would be used by a mature language user? Is it a frequently used word?

Word Meaning & Word Sense Disambiguation CMSC 723 / LING 723 / INST 725 M ARINE C ARPUAT

The Art of Confident Public Speaking Postgraduate Skills Programme Vicky Davis Acknowledgements

Kernel-Size Lower Bounds: The Evidence from Complexity Theory Andrew Drucker IAS Worker 2013,

ST. JOE COMPANY (JOE) Jaguar Investing Series What is St. Joe Company? Today, St. Joe is a land

Witnessing metrologically useful multiparticle entanglement G. Tth 1 , 2 , 3 in collaboration

1 Peter Series Lesson #017 May 21, 2015 Dean Bible Ministries www.deanbibleministries.org Dr.

Encouraging and Rewarding Classroom Innovation Andrew Dorsey Eric Salahub President

THE WRITING PROCESS ATI TEAS ENGLISH AND LANGUAGE USAGE THE WRITING PROCESS For the TEAS, you

www.film-english.com by Kieran Donaghy

Billion Word Imputation Guide: Prof. Amitabha Mukherjee Group 20: - PowerPoint PPT Presentation

CS365 Course Project Billion Word Imputation Guide: Prof. Amitabha Mukherjee Group 20: Aayush Mudgal [12008] Shruti Bhargava [13671] Problem Statement Problem Description : https://www.kaggle.com/c/billion-word-imputation Examples : 1.

Multiple Imputation for Missing Data in KLoSA Juwon Song Korea University and UCLA Contents 1.

Overview Multiple Imputation for Multilevel Data Bayesian estimation for MLMs Univariate

Consistent Variance Estimates for Multiple Multiple imputation Imputation in R MI alternative

Memory Memory Decoders M bits M bits RWM NVRWM ROM S 0 S 0 Word 0 Word 0 S 1 Word 1 Word

Genotype Imputation in Genome-wide Association Studies Fernando Rivadeneira 1,2 1 Department of

Genotype imputation accuracy with different reference panels Guan-Hua Huang and Yi-Chi Tseng

Handling missing data in Stata: Imputation and likelihood-based approaches Rose Medeiros

MixtComp software: Model-based clustering/imputation with mixed data, missing data and uncertain

Missing Data and Imputation NINA ORWITZ OCTOBER 30 TH , 2017 Outline Types of missing data

Reference based multiple imputation; for sensitivity analysis of clinical trials with missing

Method for the imputation of the earnings variable in the Belgian LFS Workshop on LFS

Imputation by Gaussian Copula Model with an Application to Incomplete Customer Satisfaction Data

$17 Billion for NYC $375 billion total for counties and localities $34 Billion for NY State $500

&gt;&gt;&gt;CLICK HERE&lt;&lt;&lt; Presentation d un document word New Haven. peugeot 207 workshop

Is this a word that would be used by a mature language user? Is it a frequently used word?

Word Meaning &amp; Word Sense Disambiguation CMSC 723 / LING 723 / INST 725 M ARINE C ARPUAT

The Art of Confident Public Speaking Postgraduate Skills Programme Vicky Davis Acknowledgements

Kernel-Size Lower Bounds: The Evidence from Complexity Theory Andrew Drucker IAS Worker 2013,

ST. JOE COMPANY (JOE) Jaguar Investing Series What is St. Joe Company? Today, St. Joe is a land

Witnessing metrologically useful multiparticle entanglement G. Tth 1 , 2 , 3 in collaboration

1 Peter Series Lesson #017 May 21, 2015 Dean Bible Ministries www.deanbibleministries.org Dr.

Encouraging and Rewarding Classroom Innovation Andrew Dorsey Eric Salahub President

THE WRITING PROCESS ATI TEAS ENGLISH AND LANGUAGE USAGE THE WRITING PROCESS For the TEAS, you

www.film-english.com by Kieran Donaghy

>>>CLICK HERE<<< Presentation d un document word New Haven. peugeot 207 workshop

Word Meaning & Word Sense Disambiguation CMSC 723 / LING 723 / INST 725 M ARINE C ARPUAT