extraction of author s definitions using indexed
play

Extraction of Authors Definitions Using Indexed Reference - PowerPoint PPT Presentation

Background Framework Implementation Demo Evaluation Conclusion Extraction of Authors Definitions Using Indexed Reference Identification Marc Bertin, Iana Atanassova and Jean-Pierre Descl es Paris-Sorbonne University, LaLIC Laboratory


  1. Background Framework Implementation Demo Evaluation Conclusion Extraction of Author’s Definitions Using Indexed Reference Identification Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory 18 September 2009, RANLP 2009 Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009

  2. Background Framework Implementation Demo Evaluation Conclusion Outline 1 Background 2 Framework 3 Implementation 4 Demo 5 Evaluation 6 Conclusion Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009

  3. Background Framework Implementation Demo Evaluation Conclusion 1 Background 2 Framework 3 Implementation 4 Demo 5 Evaluation 6 Conclusion Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009

  4. Background Framework Implementation Demo Evaluation Conclusion Studies on definition in the LaLIC laboratory (E. Cartier, 2004; T. Hacene 2008; C. Teissedre 2008) Implementation: several tools for segmentation and semantic annotation: SegaTex: G. Mourad 2001, B. Djioua 2006; Excom annotation platform: B. Djioua and J.-P. Descles 2006, M. Alrahabi 2008. Work in the field of Bibliosemantics (M. Bertin 2006-2009): identification and annotation of relations between authors based on bibliographic links. Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009

  5. Background Framework Implementation Demo Evaluation Conclusion 1 Background 2 Framework 3 Implementation 4 Demo 5 Evaluation 6 Conclusion Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009

  6. Background Framework Implementation Demo Evaluation Conclusion Our aim is to establish links between authors by using indexed references in the text, and then identify the definitions and relate them to the authors. The method that we propose is based on the indexed references which allow us, in the case when we identify a definition in the research scope determined by the segmentation, to link this definition to the author cited in the text. Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009

  7. Background Framework Implementation Demo Evaluation Conclusion Two relations: 1 relation between the definiendum , what is to be defined, and the definiens , what defines it. 2 relation between the definition itself and the author. We can associate a definition to an author. In this case we can talk about signed definitions . The bibliographic links give us a starting context or scope for the research of definitions. Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009

  8. Background Framework Implementation Demo Evaluation Conclusion Linguistic Study of the Definition We have used the semantic map proposed by T. Hacene (2008). In the implementation we have used a part of this semantic map according to our purpose. The linguistic study of our corpus has led us to a better understanding of the distinction between a definition and a definatory characteristic , which has been taken in consideration for the construction of our linguistic resources. We define a definatory characteristic as a sentence that gives only some essential properties of the defined object. Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009

  9. Background Framework Implementation Demo Evaluation Conclusion Linguistic Study of the Definition We have distinguished three sub-categories of the definatory characteristics: 1 identification 2 determined categorization 3 pseudo-definition Two sub-categories of the definition: 1 general definitions 2 axiomatic definitions Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009

  10. Background Framework Implementation Demo Evaluation Conclusion (Taouise Hacene, 2008) Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009

  11. Background Framework Implementation Demo Evaluation Conclusion 1 Background 2 Framework 3 Implementation 4 Demo 5 Evaluation 6 Conclusion Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009

  12. Background Framework Implementation Demo Evaluation Conclusion Processing Overview Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009

  13. Background Framework Implementation Demo Evaluation Conclusion Segmentation Segmentation tools: SegaTex (G. Mourad, 2001; B. Djioua 2006), Excom-2 (M. Alrahabi, 2008) Segmentation into sentences, paragraphs, sections. Segmentation rules based on the punctuation and capitalisation. Different languages (French, English, Bulgarian, Arabic, ... ) Input: text files Output: DocBook format, UTF8 encoding Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009

  14. Background Framework Implementation Demo Evaluation Conclusion Segmentation Output Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009

  15. Background Framework Implementation Demo Evaluation Conclusion Processing Overview Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009

  16. Background Framework Implementation Demo Evaluation Conclusion Processing Overview Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009

  17. Background Framework Implementation Demo Evaluation Conclusion Indexed Reference Identification - 1 Norms: ISO-690, ISO 690-2, AFNOR NF Z 44-005, AFNOR NF Z 44-005-2 Examples: (Hoc, 1990a), (Thom, 1970), (Dingwall et al., 1995; Hartmann and G¨ orlich, 1995), [24], Pickett-Heaps et al. (1990), (like other authors e.g. Raven, 1983), (Cwuc and SPRAGUE 1989), (18, 53, 56) Finite state automata and identification of known names entities Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009

  18. Background Framework Implementation Demo Evaluation Conclusion Indexed Reference Identification - 2 Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009

  19. Background Framework Implementation Demo Evaluation Conclusion Annotation Automatic annotation through exploration of the context: The Contextual Exploration Method (Descl´ es, 1997, 2006) Based on linguistic resources, which are manually constructed Resources: surface linguistic markers (indicators and clues) and contextual exploration rules Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009

  20. Background Framework Implementation Demo Evaluation Conclusion Annotation Excom annotation system (B. Djioua, 2006; M. Alrahabi, 2008). Available online: www.excom.fr Input: segmented XML files Output: annotated XML files Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009

  21. Background Framework Implementation Demo Evaluation Conclusion Contextual Exploration Rule: Example Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009

  22. Background Framework Implementation Demo Evaluation Conclusion Annotated sentence: Example Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009

  23. Background Framework Implementation Demo Evaluation Conclusion What can we do with the annotations? Information retrieval of definitions. Identify the definitions of a given notion. Sometimes the same notion has several different definitions, esp. in humanitarian sciences. For a given keyword, identify the domains in which it is used. Find the definitions related to an author. Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009

  24. Background Framework Implementation Demo Evaluation Conclusion System Overview: Interface and Exploitation Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009

  25. Background Framework Implementation Demo Evaluation Conclusion 1 Background 2 Framework 3 Implementation 4 Demo 5 Evaluation 6 Conclusion Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009

  26. Background Framework Implementation Demo Evaluation Conclusion 1 Background 2 Framework 3 Implementation 4 Demo 5 Evaluation 6 Conclusion Marc Bertin, Iana Atanassova and Jean-Pierre Descl´ es Paris-Sorbonne University, LaLIC Laboratory RANLP 2009

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend