Electronic Tools and Resources for Multi-Word Unit detection and - - PowerPoint PPT Presentation

electronic tools and resources for multi word unit
SMART_READER_LITE
LIVE PREVIEW

Electronic Tools and Resources for Multi-Word Unit detection and - - PowerPoint PPT Presentation

Electronic Tools and Resources for Multi-Word Unit detection and research in Serbian Jelena Mitrovic, University of Belgrade Serbian is one of the under-resourced languages when it comes to NLP many resources and tools are still being


slide-1
SLIDE 1

Electronic Tools and Resources for Multi-Word Unit detection and research in Serbian

Jelena Mitrovic, University of Belgrade

  • Serbian is one of the under-resourced languages when it comes to NLP –

many resources and tools are still being developed

  • Electronic MWUs dictionary – morphological dictionary with complex

prepositions, conjunctions, interjections, complex adjectives e.g. mrtav pijan ‘dead drunk’ and complex nouns e.g. nemasno mleko u prahu ’fat free powdered milk’

  • Serbian WordNet – percentage of MWUs approximately 32.5%
slide-2
SLIDE 2
  • Ontology of Rhetorical Figures for Serbian – unambiguous formal

description of 98 rhetorical figures in Serbian

  • Human annotation of rhetorical figures is not precise enough due to

their large number and similarities that exist – that is why an ontology is very helpful

  • Many rhetorical figures are MWUs
slide-3
SLIDE 3