Mixture of Training Data Xinyu Wang, Yong Jiang, Kewei Tu School of - PowerPoint PPT Presentation

Aug 19, 2023 •178 likes •340 views

Enhanced Universal Dependency Parsing with Second-Order Inference and Mixture of Training Data Xinyu Wang, Yong Jiang, Kewei Tu School of Information Science and Technology, ShanghaiTech University DAMO Academy, Alibaba Group Our Parser A

Enhanced Universal Dependency Parsing with Second-Order Inference and Mixture of Training Data Xinyu Wang, Yong Jiang, Kewei Tu School of Information Science and Technology, ShanghaiTech University DAMO Academy, Alibaba Group
Our Parser • A second-order semantic dependency parser based on Wang et al. (2019) • Equip the parser with state-of-the-art contextual multilingual embeddings: XLM-R (Conneau et al., 2019) • Improve the accuracy for the low-resource language (Tamil) through mixing the training set with another language (English/Czech) • Our Parser performs 0.6 ELAS better than the best parser in official results after fixing the graph connectivity issues [1]: Xinyu Wang, Jingxian Huang, and Kewei Tu. 2019. Second-order semantic dependency parsing with end-to-end neural networks. [2]: Alexis Conneau, Kartikay Khandelwal, Naman Goyal, Vishrav Chaudhary, Guillaume Wenzek , Francisco Guzmán, Edouard Grave, Myle Ott, Luke Zettlemoyer, and Veselin Stoyanov. 2019. Unsupervised cross-lingual representation learning at scale.
Preprocessing: Empty Nodes
Preprocessing: Repeated Edges
Preprocessing • Tokenization: Stanza (Qi et al., 2020) • Multiple Treebanks: concatenate the datasets • Splitting the development sets into halves as validation and test sets [1]: Peng Qi, Yuhao Zhang, Yuhui Zhang, Jason Bolton, and Christopher D Manning. 2020. Stanza: A python natural language processing toolkit for many human languages.
Approach (Wang et al., 2019)
Mixture of Training Data For Tamil • Problem: low-resource • Only 400 training sentences for Tamil • Solution: utilizing rich-resource language corpus • Multilingual Embedding: XLM-R • Rich-Resource languages: English (12k sents) or Czech (100k sents) • Remove the label of dependency edges in rich-resource training data • New training data: Upsampled Tamil training data + rich-resource training data • Additional language-specific embeddings: Flair (Akbik et al., 2018) and fastText (Bojanowski et al., 2017) [1]: Alan Akbik, Duncan Blythe, and Roland Vollgraf.2018. Contextual string embeddings for sequence labeling. [2]: Piotr Bojanowski, Edouard Grave, Armand Joulin, and Tomas Mikolov. 2017. Enriching word vectors with subword information
Graph Connection • Original submission: • Non-connected graphs (all potential edges with probability > 0.5) • New solution: • Tree algorithms: Maximum Spanning Tree (MST) or Eisner’s Algorithm • First use MST or Eisner’s algorithm to keep connectivity of graphs and then add potential edges with probabilities larger than 0.5
Results
Results
Mixture of Data Comparison
First-Order vs. Second-Order and Concatenating Other Embeddings *: We use labeled F1 score here, which is the metric for SDP
Comparisons of Graph Connection Approaches (Treebank Level)
Comparisons of Graph Connection Approaches (Language Level)
Thank you • Paper: https://arxiv.org/abs/2006.01414

Recommend

Bernoulli Mixture Models Victor Medina Researcher at SBIF DataCamp Mixture Models in R The

DataCamp Mixture Models in R MIXTURE MODELS IN R Bernoulli Mixture Models Victor Medina Researcher at SBIF DataCamp Mixture Models in R The handwritten digits dataset DataCamp Mixture Models in R Continuous versus discrete variables

440 views • 41 slides

Structure of mixture models Victor Medina Researcher at SBIF DataCamp Mixture Models in R

DataCamp Mixture Models in R MIXTURE MODELS IN R Structure of mixture models Victor Medina Researcher at SBIF DataCamp Mixture Models in R Description of mixture models 1. Which is the suitable probability distribution? Get familiar with

1.07k views • 36 slides

MIXTURE DENSITY NETWORKS MIXTURE DENSITY NETWORKS Charles Martin SO FAR; RNNS THAT MODEL

MIXTURE DENSITY NETWORKS MIXTURE DENSITY NETWORKS Charles Martin SO FAR; RNNS THAT MODEL CATEGORICAL DATA SO FAR; RNNS THAT MODEL CATEGORICAL DATA SO FAR; RNNS THAT MODEL CATEGORICAL DATA SO FAR; RNNS THAT MODEL CATEGORICAL DATA Remember that

1.26k views • 99 slides

Classification of High Dimensional Data By Two-way Mixture Models Jia Li Statistics Department

Classification of High Dimensional Data By Two-way Mixture Models Jia Li Statistics Department The Pennsylvania State University 1 Outline Goals Two-way mixture model approach Background: mixture discriminant analysis Model

418 views • 23 slides

Lecture 20 Lecture 20 Nov 12 th 2008 Clustering with Mixture of Gaussians Clustering with Mixture

Lecture 20 Lecture 20 Nov 12 th 2008 Clustering with Mixture of Gaussians Clustering with Mixture of Gaussians Underlying model for data: assumes a generative process: Each cluster follows a Gaussian distribution First choose which

302 views • 19 slides

Solutions Unit 6 1 Solutions Homogenous Mixture (Solution) two or more substances mixed

Solutions Unit 6 1 Solutions Homogenous Mixture (Solution) two or more substances mixed together to have a uniform composition, its components are not distinguishable from one another Heterogenous Mixture Homogenous Mixture (Not a

523 views • 29 slides

Mixture Selection, Mechanism Design, and Signaling Ho Yee Cheung Shaddin Dughmi Yu Cheng Ehsan

Mixture Selection, Mechanism Design, and Signaling Ho Yee Cheung Shaddin Dughmi Yu Cheng Ehsan Emamjomeh-Zadeh Li Han Shang-Hua Teng University of Southern California Yu Cheng (USC) 1 / 14 Mixture selection Mixture Selection Optimization

655 views • 40 slides

Deep Gaussian Mixture Models Cinzia Viroli (University of Bologna, Italy) joint with Geoff

Deep Gaussian Mixture Models Cinzia Viroli (University of Bologna, Italy) joint with Geoff McLachlan (University of Queensland, Australia) JOCLAD 2018, Lisbona, April 5th, 2018 Outline Deep Learning Mixture Models Deep Gaussian Mixture

915 views • 66 slides

Constrained Mixture Estimation for Constrained Mixture Estimation Analysis and Robust

Constrained Mixture Estimation for Constrained Mixture Estimation Analysis and Robust Classification of for Analysis and Robust Classification Clinical Time Series of Clinical Time Series Alexander Schnhuth (joint work with Ivan Costa,

615 views • 35 slides

Binary liquid mixture of EmimBF 4 and methoxyethanol Binary liquid mixture excess molar volume

Binary liquid mixture of EmimBF 4 and methoxyethanol Binary liquid mixture excess molar volume (mL/mol) T=298 K P = 1 bar * + ( 1 x 1 ) V 2 * + x 1 ( 1 x 1 )( c + d x 1 ) V m = x 1 V 1 1 / 28 2 / 28 source: M. S. Reddy, et al .,

197 views • 4 slides

CSC321 Lecture 18: Mixture Modeling Roger Grosse Roger Grosse CSC321 Lecture 18: Mixture

CSC321 Lecture 18: Mixture Modeling Roger Grosse Roger Grosse CSC321 Lecture 18: Mixture Modeling 1 / 27 Overview Some examples of situations where youd use unupservised learning You want to understand how a scientific field has changed

613 views • 27 slides

Flexible Mixture Modeling and Model-Based Clustering in R Bettina Grn September 2017 c

Flexible Mixture Modeling and Model-Based Clustering in R Bettina Grn September 2017 c Flexible Mixture Modeling and Model-Based Clustering in R 0 / 170 Outline Bettina Grn September 2017 c Flexible Mixture Modeling and

2.01k views • 171 slides

AND MACHINE LEARNING CHAPTER 10: MIXTURE MODELS AND EM Mixture Models - Define a joint

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 10: MIXTURE MODELS AND EM Mixture Models - Define a joint distribution over observed and latent variables - The corresponding distribution of the observed variables alone is obtained by

878 views • 62 slides

Gaussian Mixture Models & EM CE-717: Machine Learning Sharif University of Technology M.

Gaussian Mixture Models & EM CE-717: Machine Learning Sharif University of Technology M. Soleymani Fall 2016 Mixture Models: definition Mixture models: Linear supper-position of mixtures or components | =

1.04k views • 39 slides

Assignment 3 Zahra Sheikhbahaee Zeou Hu & Colin Vandenhof February 2020 1 [2 points]

Assignment 3 Zahra Sheikhbahaee Zeou Hu & Colin Vandenhof February 2020 1 [2 points] Mixture of Bernoullis A mixture of Bernoullis model is like the Gaussian mixture model which weve discussed in this course. Each of the mixture

216 views • 5 slides

Joint Optimisation of Tandem Systems using Gaussian Mixture Density Neural Network Discriminative

Joint Optimisation of Tandem Systems using Gaussian Mixture Density Neural Network Discriminative Sequence Training Chao Zhang and Phil Woodland March 8, 2017 Cambridge University Engineering Department Introduction Tandem Systems as Mixture

926 views • 17 slides

Korean 9/20/2010 Speakers spoken in North and South Korean, each with various dialects and a

Languages of the World Korean 9/20/2010 Speakers spoken in North and South Korean, each with various dialects and a different standard form spoken by about 66,305,890 people (www.ethnologue.com), mostly in N. and S. Korea Location

306 views • 13 slides

External and Intrinsic Plagiarism Detection using a Cross-Lingual Retrieval and Segmentation

External and Intrinsic Plagiarism Detection using a Cross-Lingual Retrieval and Segmentation System Markus Muhr, Roman Kern, Mario Zechner, Michael Granitzer { mmuhr, rkern, mzechner, mgrani } @know-center.at CLEF 2010 / PAN / 2010-09-22

282 views • 15 slides

Reactive Programming Models for IoT Todd L. Montgomery @toddlmontgomery Psst! Already Here! Not

Reactive Programming Models for IoT Todd L. Montgomery @toddlmontgomery Psst! Already Here! Not New! Internet of Things? Just a Silicon Valley buzzword? Psst! Also Not New! Reactive Programming? Just another buzzword? Connecting Things

699 views • 51 slides

FacetE: Exploiting Web Tables for Domain-Specific Word Embedding Evaluation Michael Gnther ,

FacetE: Exploiting Web Tables for Domain-Specific Word Embedding Evaluation Michael Gnther , Paul Sikorski, Maik Thiele, and Wolfgang Lehner DBTest 20 Workshop at SIGMOD 2020 19.06.2020 NLP Systems Workflow Data Storage with textual data

206 views • 19 slides

Quantitative Text Analysis. Applications to Social Media Research Pablo Barber a London

Quantitative Text Analysis. Applications to Social Media Research Pablo Barber a London School of Economics www.pablobarbera.com Course website: pablobarbera.com/text-analysis-vienna Dictionary Methods Applied to Social Media Text

224 views • 10 slides

A Bilinear Model for Text Regression Daniel Preotiuc-Pietro daniel@dcs.shef.ac.uk

A Bilinear Model for Text Regression Daniel Preotiuc-Pietro daniel@dcs.shef.ac.uk www.preotiuc.ro 13.05.2013 Linear Regression Text Regression Task: predict real valued outputs based on textual variables (e.g. word counts) LASSO on word

880 views • 24 slides

Data Linkage Techniques: Past, Present and Future Peter Christen Department of Computer Science,

Data Linkage Techniques: Past, Present and Future Peter Christen Department of Computer Science, The Australian National University Contact: peter.christen@anu.edu.au Project Web site: http://datamining.anu.edu.au/linkage.html Funded by the

819 views • 32 slides

Lecture 12: Clustering 1 6.0002 LECTURE 12 Re Reading Chapter 23 6.0002 LECTURE 12 2 Mach Ma

Lecture 12: Clustering 1 6.0002 LECTURE 12 Re Reading Chapter 23 6.0002 LECTURE 12 2 Mach Ma chine e Lea earn rning Paradigm Observe set of examples: training data Infer something about process that generated that data Use

847 views • 36 slides