grouping synonyms by definitions
play

Grouping Synonyms by Definitions Ingrid Falk 1 , Claire Gardent 2 , - PowerPoint PPT Presentation

Grouping Synonyms by Definitions Ingrid Falk 1 , Claire Gardent 2 , Evelyne Jacquey 3 , Fabienne Venant 4 1 INRIA / Universit e Nancy 2 2 CNRS / INRIA Nancy Grand-Est, Nancy 3 CNRS / ATILF, Nancy 4 Universit e Nancy 2 / INRIA, Nancy Grand-Est,


  1. Grouping Synonyms by Definitions Ingrid Falk 1 , Claire Gardent 2 , Evelyne Jacquey 3 , Fabienne Venant 4 1 INRIA / Universit´ e Nancy 2 2 CNRS / INRIA Nancy Grand-Est, Nancy 3 CNRS / ATILF, Nancy 4 Universit´ e Nancy 2 / INRIA, Nancy Grand-Est, Nancy Recent Advances in Natural Language Processing Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) Grouping Synonyms by Definitions RANLP’09 1 / 31

  2. Introduction and motivation Outline Introduction and motivation 1 Example Approach 2 Resources 3 Method 4 Extracting indexes. Similarity of two indexes. Reference sample. Results 5 Evaluation measures Reflexive usage. Conclusion 6 Outlook 7 Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) Grouping Synonyms by Definitions RANLP’09 2 / 31

  3. Introduction and motivation Outline Objective Build a synonym dictionary which assigns to each meaning of a word the group of synonyms of that word which correspond to this meaning. A method to merge synonym dictionaries and a large coverage general purpose dictionary. ◮ 5 synonym dictionaries from the synonym base of the ATILF and ◮ the TLFi (Tr´ esor de la Langue Fran¸ caise informatis´ e). Result : ◮ A large coverage synonym dictionary with definitions. Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) Grouping Synonyms by Definitions RANLP’09 3 / 31

  4. Introduction and motivation Example Desired results Example : achever (finish, accomplish) Synonym dictionaries: abattre, aboutir, accomplir, aiguiser, am´ eliorer, an´ eantir, assommer, boucler, cesser, clore, clˆ oturer, compl´ ementer, compl´ eter, conclure, conduire, consommer, continuer, couronner, estoquer, exp´ edier, ex´ ecuter, finir, parachever, parfaire, perfectionner, raser, ruiner, r´ ealiser, r´ eussir, se taire, terminer, tuer. TLFi definitions: 1. Mettre la derni` ere main pour perfectionner (Finalize to improve.) 2. Porter un coup mortel ` a un animal d´ ej` a atteint physiquement; donner le coup de grˆ ace. (Give a mortal blow to an animal already physically damaged;) 3. Mener ` a sa fin, compl´ eter l’action de. (Lead to an end, complete the action.) Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) Grouping Synonyms by Definitions RANLP’09 4 / 31 . . .

  5. Introduction and motivation Example Desired results Synonym groupings, attached to the meaning(s) given by the TLFi definitions: Synonym dictionaries abattre, aboutir, accomplir, accomplir, aiguiser, am´ eliorer, an´ eantir, assommer, boucler, cesser, clore, clˆ oturer, compl´ ementer, compl´ eter, conclure, conduire, consommer, continuer, couronner, estoquer, exp´ edier, ex´ ecuter, finir, parachever, parfaire, perfectionner, raser, ruiner, r´ ealiser, r´ eussir, se taire, terminer, tuer. TLFi definitions: 1. Mettre la derni` ere main pour perfectionner. (Finalize to improve.) 2. Porter un coup mortel ` a un animal d´ ej` a atteint physiquement; donner le coup de grˆ ace. (Give a mortal blow to an animal already achieved physically;) 3. Mener ` a sa fin, compl´ eter l’action de. (Lead to an end, complete the action.) Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) Grouping Synonyms by Definitions RANLP’09 5 / 31

  6. Approach Outline Introduction and motivation 1 Example Approach 2 Resources 3 Method 4 Extracting indexes. Similarity of two indexes. Reference sample. Results 5 Evaluation measures Reflexive usage. Conclusion 6 Outlook 7 Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) Grouping Synonyms by Definitions RANLP’09 6 / 31

  7. Approach Approach Input Synonym dictionaries: Synonym base from the ATILF General purpose dictionary: TLFi TLFi � definitions ≈ meaning (sense) Output A synonym dictionary which associates to each sense (definition) of a word the corresponding synonym group. Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) Grouping Synonyms by Definitions RANLP’09 7 / 31

  8. Approach Related work ◮ DicoSyn [Manguin et Al., 2004] ◮ Wolf [Sagot and Fiˇ ser, 2008] Differences DicoSyn: no synonym groupings, no associated definitions. Wolf : synonym groupings are obtained by translation to existing BalkaNet synsets, links to WordNet synsets. Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) Grouping Synonyms by Definitions RANLP’09 8 / 31

  9. Resources Outline Introduction and motivation 1 Example Approach 2 Resources 3 Method 4 Extracting indexes. Similarity of two indexes. Reference sample. Results 5 Evaluation measures Reflexive usage. Conclusion 6 Outlook 7 Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) Grouping Synonyms by Definitions RANLP’09 9 / 31

  10. Resources Resources used 5 of 7 synonym dictionaries from the ATILF. Syn. Dic. verbs Syn/verb Bailly 2600 1. Benac 2656 1.5 Du Chazaud 3808 5.25 Larousse 3835 4.7 Le Petit Robert 5027 6. total 5736 11. but: no part of speech information, no definitions. TLFi ◮ 54 280 entries, 92 997 lemmas, 271 166 definitions. ◮ digitized, available online ( http://atilf.atilf.fr/ ), XML format ◮ glosses have been lemmatised and POS-tagged. but: few synonyms, information is not systematic. Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) Grouping Synonyms by Definitions RANLP’09 10 / 31

  11. Method Outline Introduction and motivation 1 Example Approach 2 Resources 3 Method 4 Extracting indexes. Similarity of two indexes. Reference sample. Results 5 Evaluation measures Reflexive usage. Conclusion 6 Outlook 7 Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) Grouping Synonyms by Definitions RANLP’09 11 / 31

  12. Method Basic procedure Given: ◮ A verb V , ◮ A set of definitions D 1 V . . . D n V associated to V by the TLFi, ◮ The set of synonyms Syn 1 V . . . Syn m V associated to V by the synonym base. For each synonym Syn k V : Which are the definitions D i V for wich Syn k V is synonymous to V ? Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) Grouping Synonyms by Definitions RANLP’09 12 / 31

  13. Method Basic procedure, ctd. 1. Extract TLFi definitions, convert to index. Index = list of content words, lemmatised. 2. Associate indexes to each definition and each synonym. 3. A synonym’s index � ∪ of indexes of each of its definitions. 4. Measure the similarity of two indexes. Which definition D i V of V is most similar to the definition Syn k V ? 5. Associate synonyms and definitions. Each synonym is associated to those definitions D i V of V which are most similar to the synonym’s index. Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) Grouping Synonyms by Definitions RANLP’09 13 / 31

  14. Method Extracting the index. TLFi definitions � index 1. We use XML tags to extract TLFi definitions . 2. Definitions without a gloss, synonym- or domain indicators are discarded. 3. Index = list of lemmatised content words contained in the gloss, the synonym- and the domain indicators. Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) Grouping Synonyms by Definitions RANLP’09 14 / 31

  15. Method Extracting indexes. Examples TLFi entry of projeter (to project) Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) Grouping Synonyms by Definitions RANLP’09 15 / 31

  16. Method Extracting indexes. Example: the verb projeter Extracted definitions and their indexes: Definition : Jeter loin en avant avec force. (To throw far ahead and with strength.) Index : � jeter, loin, avant, force � Definition : CIN. AUDIOVISUEL . Passer dans un projecteur. (To show on a projector.) Index : � cin´ ema, audiovisuel, passer, projecteur � Definition : ´ Eclaircir. Synon. jeter quelque lumi` ere. (To lighten, To throw some light.) Index : � ´ eclaircir, jeter, lumi` ere � Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) Grouping Synonyms by Definitions RANLP’09 16 / 31

  17. Method Similarity of two indexes. Measuring the similarity of two indexes. Experiments with 2 types of similarity measures: 1. Overlap of lemmas (or lemma sequences) between the two indexes. 2. First and second order vector similarity measures with and without TF.IDF cut-off. Total of 6 similarity measures. Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) Grouping Synonyms by Definitions RANLP’09 17 / 31

  18. Method Similarity of two indexes. Which similarity measure works best? Building a reference. ◮ Gold standard as reference: 27 verbs, their definitions and for each definition the associated synonyms. ◮ Build triples � Verb, Definition, Synonym � . ◮ To a triple � V , D V , Syn V � we associate the value 1 if Syn V is considered synonymous to V with the sense given by D V , the value 0 else. Example: � achever, Mettre la derni` ere main pour perfectionner , accomplir � � 1 ( � perfect, Finalize to improve, accomplish � � 1) ◮ Triple ↔ value associations done by system and annotators ◮ Result comparisons done on the basis of standard evaluation measures: precision, recall and F-measure. Falk et al. (CNRS, INRIA, Universit´ e Nancy 2) Grouping Synonyms by Definitions RANLP’09 18 / 31

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend