a method for grouping synonyms
play

A Method for Grouping Synonyms Ingrid Falk 1 , Claire Gardent 2 , - PowerPoint PPT Presentation

A Method for Grouping Synonyms Ingrid Falk 1 , Claire Gardent 2 , Evelyne Jacquey 3 , Fabienne Venant 4 1 INRIA / Universit e Nancy 2 2 CNRS / INRIA Nancy Grand-Est, Nancy 3 CNRS / ATILF, Nancy 4 Universit e Nancy 2 / INRIA, Nancy Grand-Est,


  1. A Method for Grouping Synonyms Ingrid Falk 1 , Claire Gardent 2 , Evelyne Jacquey 3 , Fabienne Venant 4 1 INRIA / Universit´ e Nancy 2 2 CNRS / INRIA Nancy Grand-Est, Nancy 3 CNRS / ATILF, Nancy 4 Universit´ e Nancy 2 / INRIA, Nancy Grand-Est, Nancy eLexicography in the 21st century Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) A Method for Grouping Synonyms eLex’09 1 / 31

  2. Introduction and motivation Outline 1 Introduction and motivation Example 2 Approach 3 Resources 4 Method Extracting indexes. Similarity of two indexes. Reference sample. 5 Results Evaluation measures Reflexive usage. 6 Conclusion 7 Outlook Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) A Method for Grouping Synonyms eLex’09 2 / 31

  3. Introduction and motivation Outline Objective Build a dictionary which assigns to each meaning of a word (given by a definition) the group of synonyms of that word which correspond to this meaning. A method to merge synonym dictionaries and a large coverage general purpose dictionary. ◮ 5 synonym dictionaries from the synonym base of the ATILF and ◮ the TLFi (Tr´ esor de la Langue Fran¸ caise informatis´ e). Result : ◮ A large coverage synonym dictionary where groups of synonyms of a word are associated with definitions. ◮ A general purpose dictionary assigning a group of synonyms to each meaning (definition) of a word. Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) A Method for Grouping Synonyms eLex’09 3 / 31

  4. Introduction and motivation Example Desired results Example : achever (finish, accomplish) Synonym dictionaries: abattre, aboutir, accomplir, aiguiser, am´ eliorer, an´ eantir, assommer, boucler, cesser, clore, clˆ oturer, compl´ ementer, compl´ eter, conclure, conduire, consommer, continuer, couronner, estoquer, exp´ edier, ex´ ecuter, finir, parachever, parfaire, perfectionner, raser, ruiner, r´ ealiser, r´ eussir, se taire, terminer, tuer. TLFi definitions: 1. Mettre la derni` ere main pour perfectionner (Finalize to improve.) 2. Porter un coup mortel ` a un animal d´ ej` a atteint physiquement; donner le coup de grˆ ace. (Give a mortal blow to an animal already physically damaged;) 3. Mener ` a sa fin, compl´ eter l’action de. (Lead to an end, complete the action.) Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) A Method for Grouping Synonyms eLex’09 4 / 31 . . .

  5. Introduction and motivation Example Desired results Synonym groupings, attached to the meaning(s) given by the TLFi definitions: Synonym dictionaries abattre, aboutir, accomplir, accomplir, aiguiser, am´ eliorer, an´ eantir, assommer, boucler, cesser, clore, clˆ oturer, compl´ ementer, compl´ eter, conclure, conduire, consommer, continuer, couronner, estoquer, exp´ edier, ex´ ecuter, finir, parachever, parfaire, perfectionner, raser, ruiner, r´ ealiser, r´ eussir, se taire, terminer, tuer. TLFi definitions: 1. Mettre la derni` ere main pour perfectionner. (Finalize to improve.) 2. Porter un coup mortel ` a un animal d´ ej` a atteint physiquement; donner le coup de grˆ ace. (Give a mortal blow to an animal already achieved physically;) 3. Mener ` a sa fin, compl´ eter l’action de. (Lead to an end, complete the action.) Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) A Method for Grouping Synonyms eLex’09 5 / 31

  6. Approach Outline 1 Introduction and motivation Example 2 Approach 3 Resources 4 Method Extracting indexes. Similarity of two indexes. Reference sample. 5 Results Evaluation measures Reflexive usage. 6 Conclusion 7 Outlook Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) A Method for Grouping Synonyms eLex’09 6 / 31

  7. Approach Approach Input Synonym dictionaries: Synonym base from the ATILF General purpose dictionary: TLFi TLFi � definitions ≈ meaning (sense) Output A synonym dictionary which associates to each sense (definition) of a word the corresponding synonym group. Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) A Method for Grouping Synonyms eLex’09 7 / 31

  8. Approach Related work ◮ DicoSyn [Manguin et Al., 2004] ◮ Wolf [Sagot and Fiˇ ser, 2008] Differences DicoSyn: no synonym groupings, no associated definitions. Wolf : synonym groupings are obtained by translation to existing BalkaNet synsets, links to WordNet synsets. Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) A Method for Grouping Synonyms eLex’09 8 / 31

  9. Resources Outline 1 Introduction and motivation Example 2 Approach 3 Resources 4 Method Extracting indexes. Similarity of two indexes. Reference sample. 5 Results Evaluation measures Reflexive usage. 6 Conclusion 7 Outlook Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) A Method for Grouping Synonyms eLex’09 9 / 31

  10. Resources Resources used 5 of 7 synonym dictionaries from the ATILF. Syn. Dic. verbs Syn/verb Bailly 2600 1. Benac 2656 1.5 Du Chazaud 3808 5.25 Larousse 3835 4.7 Le Petit Robert 5027 6. total 5736 11. but: no part of speech information, no definitions. TLFi ◮ 54 280 entries, 92 997 lemmas, 271 166 definitions. ◮ digitized, available online ( http://atilf.atilf.fr/ ), XML format ◮ glosses have been lemmatised and POS-tagged. but: few synonyms, information is not systematic. Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) A Method for Grouping Synonyms eLex’09 10 / 31

  11. Method Outline 1 Introduction and motivation Example 2 Approach 3 Resources 4 Method Extracting indexes. Similarity of two indexes. Reference sample. 5 Results Evaluation measures Reflexive usage. 6 Conclusion 7 Outlook Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) A Method for Grouping Synonyms eLex’09 11 / 31

  12. Method Basic procedure Given: ◮ A verb V , ◮ A set of definitions D 1 V . . . D n V associated to V by the TLFi, ◮ The set of synonyms Syn 1 V . . . Syn m V associated to V by the synonym base. For each synonym Syn k V : Which are the definitions D i V for wich Syn k V is synonymous to V ? Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) A Method for Grouping Synonyms eLex’09 12 / 31

  13. Method Basic procedure, ctd. 1. Extract TLFi definitions, convert to index. Index = list of content words, lemmatised. 2. Associate indexes to each definition and each synonym. ◮ A synonym’s index � ∪ of indexes of each of its definitions. 3. Measure the similarity of two indexes. Which definition D i V of V is most similar to the definitions of Syn k V ? 4. Associate synonyms and definitions. Each synonym is associated to those definitions D i V of V which are most similar to the synonym’s index. Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) A Method for Grouping Synonyms eLex’09 13 / 31

  14. Method Extracting the index. TLFi definitions � index 1. We use XML tags to extract TLFi definitions . 2. Definitions without a gloss, synonym- or domain indicators are discarded. 3. Index = list of lemmatised content words contained in the gloss, the synonym- and the domain indicators. Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) A Method for Grouping Synonyms eLex’09 14 / 31

  15. Method Extracting indexes. Examples TLFi entry of projeter (to project) Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) A Method for Grouping Synonyms eLex’09 15 / 31

  16. Method Extracting indexes. Example: the verb projeter Extracted definitions and their indexes: Definition : Jeter loin en avant avec force. (To throw far ahead and with strength.) Index : � jeter, loin, avant, force � Definition : CIN. AUDIOVISUEL . Passer dans un projecteur. (To show on a projector.) Index : � cin´ ema, audiovisuel, passer, projecteur � Definition : ´ Eclaircir. Synon. jeter quelque lumi` ere. (To lighten, To throw some light.) Index : � ´ eclaircir, jeter, lumi` ere � Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) A Method for Grouping Synonyms eLex’09 16 / 31

  17. Method Similarity of two indexes. Measuring the similarity of two indexes. Experiments with 2 types of similarity measures: 1. Overlap of lemmas (or lemma sequences) between the two indexes. 2. First and second order vector similarity measures based on semantic space models. Total of 6 similarity measures. Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) A Method for Grouping Synonyms eLex’09 17 / 31

  18. Method Similarity of two indexes. Which similarity measure works best? Building a reference. ◮ Gold standard as reference: 27 verbs, their definitions and for each definition the associated synonyms. ◮ Build triples � Verb, Definition, Synonym � . ◮ To a triple � V , D V , Syn V � we associate the value 1 if Syn V is considered synonymous to V with the sense given by D V , the value 0 else. Example: � achever, Mettre la derni` ere main pour perfectionner , accomplir � � 1 ( � perfect, Finalize to improve, accomplish � � 1) ◮ Triple ↔ value associations done by system and annotators ◮ Result comparisons done on the basis of standard evaluation measures: precision, recall and F-measure. Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) A Method for Grouping Synonyms eLex’09 18 / 31

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend