Multiword expressions:
Getting the taste of things to come MWE 2017 Workshop — Panel discussion
Multiword expressions: Getting the taste of things to come MWE 2017 - - PowerPoint PPT Presentation
Multiword expressions: Getting the taste of things to come MWE 2017 Workshop Panel discussion Outline 1. Announcements 2. SIGLEX MWE Section 3. Shared Task 2 Announcements 3 Phraseology and Multiword Expressions (PMWE)
Getting the taste of things to come MWE 2017 Workshop — Panel discussion
2
3
4/XTOTALX
http://langsci-press.org/catalog/series/pmwe Α new series with Language Science Press
Editors
“MWE representation and parsing” Yannick Parmentier and Yakub Waszczuk (editors) “Mutliword Expressions: Insights from a Multi-lingual Perspective” Manfred Sailer and Stella Markantonatou (editors)
5
6/XTOTALX
7
8
○ Organising and endorsing events:
■ *SEM, SemEval, MWE workshop, MUMTTT wroskhop
○ Adam Kilgarriff prize ○ 2 sections: SemEval, MWE
○ 8 people elected for 3 years ○ One representative per section ○ Skype meeting every 3 months
9
○ multiword-expressions@lists.sourceforge.net
○ Integration of PARSEME outcomes into a larger international context
○ MWE workshop (yearly) ○ Stabilizing the MUMTTT workshop ○ Others (shared tasks, books, joints events with other SIGs)?
10
be led by a single person
○ SIGLEX-MWE representative + 3-4 other people
○ naming organizers of the annual MWE workshop (to be approved by the SIGLEX board) ○ animating the community ○ maintaining the website and the mailing list
11
○ By the SIGLEX board upon a proposal of the previous members (?) ○ Advantage: balance can be ensured (of continents, language families, gender, age, CS/Ling expertize, etc.) ○ Drawback: non-democratic principle
○ Advantages: democratic principle ○ Drawbacks:
■ Balance not ensured ■ The elected people may have problems working as a team
12/XTOTALX
+ Simplicity
+ Transfer of competences ensured +
Important for the shared task infrastructure
elections/nominations
13/XTOTALX
○ working in the area, ○ balance wrt. languages, continents, gender, CS/linguistics background ○ Proposal by the SIGLEX-MWE representative, validation by SIGMEX board
○ As for SIGLEX board (nominating officers, electronic vote, …)
14/XTOTALX
15
specially shared task participants
○ What worked well? ○ What could have been better?
16/XTOTALX
○ 3 language families (Romance, Slavic, Germanic) + others ○ More than 60k annotated VMWEs in all languages
components of a VMWE
○ Allowing discontinuities, overlap, and nesting
○ 6 in the closed track ○ 1 in the open track
17/XTOTALX
○ Work organization into languages, language groups, etc. ○ Dynamic guidelines with multilingual examples ○ Customizable annotation platform FLAT ○ Dedicated tools to verify coherence and silence ○ File formats and evaluation tools ○ Communication tools: mailing lists, git issues, websites
18/XTOTALX
○ Definition of predicative nouns ○ Meaning shift for IReflV ○ ...
19/XTOTALX
○ Extension of first edition with additional and better data ○ Keep focus on token-based identification of VMWEs ○ To be submitted to SemEval 2018
○ New task definition ○ Extension to new MWE categories ○ To be submitted to CoNLL 2019 (?)
20/XTOTALX
○ English, Asian languages: Japanese, Chinese, Korean, Hindi ○ Other languages?
○ Intensive use of OTH category in some languages ○ Creation of language-specific categories (e.g. compound verbs) ○ Reformulation and clarification of LVC tests (see Issues)
○ Double annotation and/or mandatory coherence check
21/XTOTALX
○ Joint parsing and MWE identification ○ MWE and named entity identification
○ Adjectival, adverbial, nominal, terms, similes
22/XTOTALX
Spread the word!
23/XTOTALX
24