Date
Presenter: Zheng ZHANG Supervisors: Pierre ZWEIGENBAUM & Yue MA
Graph-Based Word Embeddings Learning
(Post-) Doctoral Seminar of Group ILES, LIMSI 10/04/2018
1
Graph-Based Word Embeddings Learning Presenter: Zheng ZHANG - - PowerPoint PPT Presentation
(Post-) Doctoral Seminar of Group ILES, LIMSI 10/04/2018 Graph-Based Word Embeddings Learning Presenter: Zheng ZHANG Supervisors: Pierre ZWEIGENBAUM & Yue MA 1 Date One year ago Our plan: Using graph-of-words for
Date
Presenter: Zheng ZHANG Supervisors: Pierre ZWEIGENBAUM & Yue MA
(Post-) Doctoral Seminar of Group ILES, LIMSI 10/04/2018
1
2
28/03/2017
terms of the document and whose edges represent co-
sliding window.
→ negative examples
co-occurrences for the context word selection, but not for the negative examples selection. 3
https://safetyapp.shinyapps.io/GoWvis/
!"# $(&'|&)) term in the Skip-gram objective.
distribution using logistic regression, where there are k negative examples for each data sample.
4
word_id word_count
5
word_id word_id lg($
%(&))
word count ((&) Heat map of the negative examples distribution $
%(&)
Same !
word_id word_id lg($%&' (% − %((*&&+,(+)
training words contexts distribution
6 Heat map of the word co-occurrence distribution
trained on the server prevert (50 threads used):
Word co-occurrence network (matrix) generation word2vec training 7
preprocessing, POS-tagging, weighted word co-
tool for that !
sentences, extract word pairs and define edges weights.
graph-tool) as a front end providing data to boost network generation speed. 8
9
28/03/2017
a short paper to ACL 2018)
Networks Using corpus2graph (to appear in TextGraphs 2018)
be available in GitHub (https://github.com/zzcoolj/corpus2graph) by the end of this week.
10