the treatment of word formation in the lila knowledge base
play

The Treatment of Word Formation in the LiLa Knowledge Base Eleonora - PowerPoint PPT Presentation

The Treatment of Word Formation in the LiLa Knowledge Base Eleonora Litta , Marco Passarotti and Francesco Mambrini DeriMo 2019 | FAL, Prague | 19-20 September 2019 Research question State of affairs 1 We have built and collected (for Latin


  1. The Treatment of Word Formation in the LiLa Knowledge Base Eleonora Litta , Marco Passarotti and Francesco Mambrini DeriMo 2019 | ÚFAL, Prague | 19-20 September 2019

  2. Research question State of affairs 1 We have built and collected (for Latin and other languages): ◮ Textual Resources ◮ Lexical Resources ◮ NLP Tools Eleonora Litta, Marco Passarotti and Francesco Mambrini | The Treatment of Word Formation in the LiLa Knowledge Base

  3. Research question State of affairs 1 We have built and collected (for Latin and other languages): ◮ Textual Resources ◮ Lexical Resources ◮ NLP Tools Scattered and unconnected Eleonora Litta, Marco Passarotti and Francesco Mambrini | The Treatment of Word Formation in the LiLa Knowledge Base

  4. Research need Making sense 2 To make sense of this quantity of empirical data: Eleonora Litta, Marco Passarotti and Francesco Mambrini | The Treatment of Word Formation in the LiLa Knowledge Base

  5. Research need Making sense 2 To make sense of this quantity of empirical data: ◮ to extract maximum benefit from our research investments Eleonora Litta, Marco Passarotti and Francesco Mambrini | The Treatment of Word Formation in the LiLa Knowledge Base

  6. Research need Making sense 2 To make sense of this quantity of empirical data: ◮ to extract maximum benefit from our research investments ◮ to impact and improve the life of Classicists through exploitable computational resources and tools Eleonora Litta, Marco Passarotti and Francesco Mambrini | The Treatment of Word Formation in the LiLa Knowledge Base

  7. Research need Making sense 2 To make sense of this quantity of empirical data: ◮ to extract maximum benefit from our research investments ◮ to impact and improve the life of Classicists through exploitable computational resources and tools From Information to Knowledge Eleonora Litta, Marco Passarotti and Francesco Mambrini | The Treatment of Word Formation in the LiLa Knowledge Base

  8. LiLa Knowledge Base Approach: Linked Data paradigm 3 2018-2023 A collection of interoperable linguistics resources (and NLP tools) described with the same vocabulary for knowledge description Interlinking as a Form of Interaction Eleonora Litta, Marco Passarotti and Francesco Mambrini | The Treatment of Word Formation in the LiLa Knowledge Base

  9. LiLa Knowledge Base Conceptual and structural interoperability 4 LiLa is based on an ontology made of: Eleonora Litta, Marco Passarotti and Francesco Mambrini | The Treatment of Word Formation in the LiLa Knowledge Base

  10. LiLa Knowledge Base Conceptual and structural interoperability 4 LiLa is based on an ontology made of: ◮ Individuals : instances of objects (one specific token, lemma etc.) Eleonora Litta, Marco Passarotti and Francesco Mambrini | The Treatment of Word Formation in the LiLa Knowledge Base

  11. LiLa Knowledge Base Conceptual and structural interoperability 4 LiLa is based on an ontology made of: ◮ Individuals : instances of objects (one specific token, lemma etc.) ◮ Classes : types of objects/concepts (token, lemma, PoS etc.) Eleonora Litta, Marco Passarotti and Francesco Mambrini | The Treatment of Word Formation in the LiLa Knowledge Base

  12. LiLa Knowledge Base Conceptual and structural interoperability 4 LiLa is based on an ontology made of: ◮ Individuals : instances of objects (one specific token, lemma etc.) ◮ Classes : types of objects/concepts (token, lemma, PoS etc.) ◮ Data properties : attributes that objects can/must have (morphological features for lemmas/tokens) Eleonora Litta, Marco Passarotti and Francesco Mambrini | The Treatment of Word Formation in the LiLa Knowledge Base

  13. LiLa Knowledge Base Conceptual and structural interoperability 4 LiLa is based on an ontology made of: ◮ Individuals : instances of objects (one specific token, lemma etc.) ◮ Classes : types of objects/concepts (token, lemma, PoS etc.) ◮ Data properties : attributes that objects can/must have (morphological features for lemmas/tokens) ◮ Object properties : ways in which classes and individuals can be related to one another: RDF triples. Labels from a restricted vocabulary of knowledge description: hasLemma , hasPoS Eleonora Litta, Marco Passarotti and Francesco Mambrini | The Treatment of Word Formation in the LiLa Knowledge Base

  14. LiLa Knowledge Base Conceptual and structural interoperability 4 LiLa is based on an ontology made of: ◮ Individuals : instances of objects (one specific token, lemma etc.) ◮ Classes : types of objects/concepts (token, lemma, PoS etc.) ◮ Data properties : attributes that objects can/must have (morphological features for lemmas/tokens) ◮ Object properties : ways in which classes and individuals can be related to one another: RDF triples. Labels from a restricted vocabulary of knowledge description: hasLemma , hasPoS Each component of the ontology is uniquely identified through a URI. Eleonora Litta, Marco Passarotti and Francesco Mambrini | The Treatment of Word Formation in the LiLa Knowledge Base

  15. LiLa Knowledge Base Lexically-based architecture and (meta)data sources 5 NLP_Tools Form/Lemma Lexical_Ress Token Morpho_Feats Textual_Ress Eleonora Litta, Marco Passarotti and Francesco Mambrini | The Treatment of Word Formation in the LiLa Knowledge Base

  16. Word Formation Latin recap 6 WFL: Word formation-based lexical resource for Classical Latin Eleonora Litta, Marco Passarotti and Francesco Mambrini | The Treatment of Word Formation in the LiLa Knowledge Base

  17. Word Formation Latin recap 6 WFL: Word formation-based lexical resource for Classical Latin ◮ WFRs are modelled as directed one-to-many input-output relations between lemmas (based on I&A model of grammatical description) Eleonora Litta, Marco Passarotti and Francesco Mambrini | The Treatment of Word Formation in the LiLa Knowledge Base

  18. Word Formation Latin recap 6 WFL: Word formation-based lexical resource for Classical Latin ◮ WFRs are modelled as directed one-to-many input-output relations between lemmas (based on I&A model of grammatical description) ◮ Morphotactic approach: each WF process is treated individually as the application of one single rule in a certain order Eleonora Litta, Marco Passarotti and Francesco Mambrini | The Treatment of Word Formation in the LiLa Knowledge Base

  19. WFL Online https://wfl.marginalia.it 7 ◮ Relationships between lemmas of the same “word formation family” are represented as the edges in a directed graph with a hierarchical tree-like structure Eleonora Litta, Marco Passarotti and Francesco Mambrini | The Treatment of Word Formation in the LiLa Knowledge Base

  20. WFL Online https://wfl.marginalia.it 7 ◮ Relationships between lemmas of the same “word formation family” are represented as the edges in a directed graph with a hierarchical tree-like structure ◮ A node is a lemma, and an edge is the WFR used to derive the output lemma from the input one, together with any affix Eleonora Litta, Marco Passarotti and Francesco Mambrini | The Treatment of Word Formation in the LiLa Knowledge Base

  21. WFL I&A Problems 8 But: directed graphs are not completely satisfactory in representing the full range of relationships included within a word formation family. Main problems: ◮ Directionality ◮ Non-linear derivations Eleonora Litta, Marco Passarotti and Francesco Mambrini | The Treatment of Word Formation in the LiLa Knowledge Base

  22. Paradigmatic approach to WF: Requirements 9 Eleonora Litta, Marco Passarotti and Francesco Mambrini | The Treatment of Word Formation in the LiLa Knowledge Base

  23. Paradigmatic approach to WF: Requirements 9 ◮ No directionality: necessary to accommodate those lemmas for which the derivational process is not of the simplex (or simpler) > complex type Eleonora Litta, Marco Passarotti and Francesco Mambrini | The Treatment of Word Formation in the LiLa Knowledge Base

  24. Paradigmatic approach to WF: Requirements 9 ◮ No directionality: necessary to accommodate those lemmas for which the derivational process is not of the simplex (or simpler) > complex type ◮ The CELL has a central role in the paradigm (predictability and regularity) Eleonora Litta, Marco Passarotti and Francesco Mambrini | The Treatment of Word Formation in the LiLa Knowledge Base

  25. Paradigmatic approach to WF: Requirements 9 ◮ No directionality: necessary to accommodate those lemmas for which the derivational process is not of the simplex (or simpler) > complex type ◮ The CELL has a central role in the paradigm (predictability and regularity) ◮ Each cell must be described in both its morphological characteristics and its semantic features, due to the underlying role of semantics in accounting for derivational processes Eleonora Litta, Marco Passarotti and Francesco Mambrini | The Treatment of Word Formation in the LiLa Knowledge Base

  26. Word Formation in LiLa 10 Different approach to Word Formation: Eleonora Litta, Marco Passarotti and Francesco Mambrini | The Treatment of Word Formation in the LiLa Knowledge Base

  27. Word Formation in LiLa 10 Different approach to Word Formation: ◮ Structure: declarative rather than procedural Eleonora Litta, Marco Passarotti and Francesco Mambrini | The Treatment of Word Formation in the LiLa Knowledge Base

  28. Word Formation in LiLa 10 Different approach to Word Formation: ◮ Structure: declarative rather than procedural ◮ No directionality Eleonora Litta, Marco Passarotti and Francesco Mambrini | The Treatment of Word Formation in the LiLa Knowledge Base

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend