Semantic Analysis for NLP-based Applications Johannes Leveling - PowerPoint PPT Presentation

Semantic Analysis for NLP-based Applications Johannes Leveling former affiliation: Intelligent Information and Communication Systems (IICS) University of Hagen (FernUniversität in Hagen) 58084 Hagen, Germany Johannes Leveling Semantic Analysis for NLP-based Applications 1 / 44

Outline Introduction The MultiNet Paradigm Applications based on Semantic NLP NLI-Z39.50 IRSAW DeLite GIRSA-WP Conclusions Johannes Leveling Semantic Analysis for NLP-based Applications 2 / 44

Background and General Strategy ◮ Deep semantic natural language processing → Knowledge and meaning representation MultiNet (concept-oriented) (Hel06) ◮ Supported by large semantically oriented computational lexicon ◮ Important requirements for meaning representation: ◮ Homogeneity: representation of lexical knowledge, general background knowledge (world knowledge), dialogue context, and meaning of sentences and texts with the same means ◮ Universality: independent of domain or language ◮ Cognitive adequate: concept -centered ◮ Interoperability: applicable to theoretic research of automatic NLP and in modules of applied AI systems Johannes Leveling Semantic Analysis for NLP-based Applications 3 / 44

MultiNet: Meaning Representation of Text MultiNet (Multilayered Extended Semantic Networks) characteristics: ◮ concepts: lexicalized and non-lexicalized, e.g. “c134”, “New_York.0”, “play.1.1”, “play.1.2”, “play.2.1” ◮ semantic relations/functions, e.g. AGT (agent), OBJ (neutral object), DUR (duration), ORNT (orientation), * IN (location-generating function) ◮ layer features, e.g. FACT (facticity of a concept), REFER (determination of reference), QUANT (quantificational content) ◮ semantic sorts, e.g. d (discrete object), ta (temporal abstractum) Johannes Leveling Semantic Analysis for NLP-based Applications 4 / 44

MultiNet: Selected Semantic Relations Relation Description association ASSOC attachment of object to object ATTCH change of sorts (property → abstract object) CHPA experiencer EXP an informational process or object MCONT neutral object OBJ predicative concept specifying a plurality PRED property relationship PROP meronymy PARS carrier of a state SCAR state specifier SSPE conceptual subordination for objects SUB conceptual subordination for situations SUBS synonymy SYNO temporal restriction for a situation TEMP ⋆ ALTN 1 an introduction of alternatives Johannes Leveling Semantic Analysis for NLP-based Applications 5 / 44

MultiNet: Tools and Resources ◮ WOCADI (Word Class Controlled Disambiguating Parser): Syntactic-semantic parser (Har03) ◮ HaGenLex (Hagen German Lexicon): Large semantic computational lexicon (HHO03) ◮ LiaPlus (Lexicon in action): Workbench for the computer lexicographer (Oss04) Johannes Leveling Semantic Analysis for NLP-based Applications 6 / 44

WOCADI: Semantic Analysis ◮ WOCADI parser produces semantic network representation from (German) texts, including ◮ resolution of anaphoric references (e.g. Peter = he ), ◮ analysis of idioms, support verb constructions (e.g. kick the bucket = lose one’s life = die ), ◮ structural and semantic decomposition of compound nouns and adjectives (e.g. swimming pool vs. Schwimmbecken ), ◮ identification of metonymy (lexicon support via meaning facets), ◮ analysis of deictic expressions (e.g. temporal: yesterday ) ◮ Applied to large corpora, e.g. CLEF-NEWS newspaper corpus (275,000 articles) and German Wikipedia (2006: 500,000 articles, 12 million sentences; 2009: 20 million sentences) ◮ Coverage: full semantic network for 54% of sentences, partial semantic network (chunks) for 34% Johannes Leveling Semantic Analysis for NLP-based Applications 7 / 44

� � � � � � WOCADI: Example Parse Result (German) In which year did Charles de Gaulle die? / In welchem Jahr starb Charles de Gaulle? ❝✸✶ ❞ ❞❡ ●❛✉❧❧❡ ❙❯❇ ▼❡♥s❝❤ ❝✸✸ ♥❛ ❝✻ ❞♥ st❛r❜  r❡❛❧  ❙❯❇ ◆❛❝❤♥❛♠❡ ❢❛❝t ❙❯❇❙ st❡r❜❡♥ ❆❋❋ ❆❚❚❘ ❣❡♥❡r s♣ � ❣❡♥❡r s♣ � ❚❊▼P ♣❛st✳✵ q✉❛♥t ♦♥❡   q✉❛♥t ♦♥❡  ❞❡t  r❡❢❡r ✶ [ ❣❡♥❡r s♣ ]   ❝❛r❞ ✶ ❝❛r❞ ✵   ❡t②♣❡ ✵ ❡t②♣❡ ✈❛r✐❛ ❝♦♥ ❚❊▼P ❆❚❚❘ ❱❆▲ ❝✺❄✇❤✲q✉❡st✐♦♥ ♠❡ ∨ ♦❛ ∨ t❛ ❏❛❤r ❝✸✷ ♥❛ ❙❯❇ ❏❛❤r ❙❯❇ ❱♦r♥❛♠❡ r❡❛❧  ❢❛❝t  ❉❡❴●❛✉❧❧❡✳✵ ❢❡ ❣❡♥❡r s♣ � ❣❡♥❡r s♣ � q✉❛♥t ♦♥❡ q✉❛♥t ♦♥❡   ❞❡t ✶ r❡❢❡r ❝❛r❞   ✶ ✵ ❝❛r❞ ❡t②♣❡ ✵ ❡t②♣❡ ❱❆▲ ❈❤❛r❧❡s✳✵ ❢❡ Johannes Leveling Semantic Analysis for NLP-based Applications 8 / 44

WOCADI: Example Parse Result (German, simplified) du.1.1 streß.1.1 psychisch.1.1 PROP SUB SUBS dokument.1.1 problem.1.1 PRED * A c3 c7 c6 L T N 1 prüfling.1.1 PRED EXP PRED c10 OBJ MCONT ATTCH *ALTN1 c2 c1 c5 c8 kandidat.1.1 SUBS E SCAR P S S c9 PRED SUB SUBS finden.1.1 c4 berichten.2.2 ASSOC prüfungskandidat.1.1prüfung.1.1 Finde Dokumente, die über psychische Probleme oder Stress von Prüfungskandidaten oder Prüflingen berichten. (GIRT topic 116) Johannes Leveling Semantic Analysis for NLP-based Applications 9 / 44

WOCADI: Example Parse Result (English, simplified) you stress mental PROP SUB SUBS document problem PRED * A c3 c7 c6 L T N 1 examinee PRED EXP PRED c10 OBJ MCONT ATTCH *ALTN1 c2 c1 c5 c8 candidate SUBS E SCAR P S S c9 PRED SUB SUBS find c4 report ASSOC exam “Find documents reporting on mental problems or stress of exam candidates or examinees.” (GIRT topic 116) Johannes Leveling Semantic Analysis for NLP-based Applications 10 / 44

HaGenLex: The Computational Lexicon ◮ HaGenLex is a semantically oriented (German) lexical resource ◮ Consists of multiple lexicons: ◮ full morpho-syntactic and semantic information (30,000 entries), ◮ additional flat lexicon (50,000 entries), ◮ name lexicons (350,000 entries in 50 classes) ◮ compound lexicon (about 500,000 entries; structure and semantics), Johannes Leveling Semantic Analysis for NLP-based Applications 11 / 44

HaGenLex: Sample Concepts ◮ essen.1.1 (eat): (Der Student) (ißt) (eine Schokolade). (The student) (eats) (a bar of chocolate). ◮ essen.1.2 (eat [one’s fill]): (Der Student) (ißt) sich (satt). (The student) (eats) his (fill). ◮ essen.2.1 (food): Das Kind hat kein Essen bekommen. The child did not get any food. ◮ essen.2.2 (diner): Das Essen am Abend dauerte 2 Stunden. The diner in the evening lasted 2 hours. ◮ fressen.1.1 (eat): (Der Hund) (frißt) (einen Knochen). (The dog) (eats) (a bone). ◮ fressen.1.2 (be crazy about sth.): (Die Großmutter) (frißt) (einen Narren) (an den Blumen). (Grandmother) (is crazy about) (flowers). Johannes Leveling Semantic Analysis for NLP-based Applications 12 / 44

HaGenLex: Excerpt from Entry essen.1.1 (eat) n-sign   � � ”essen” base morph   infl-para i129g   � v-syn �     main v-type  syn  haben perf-aux   v-control nocontr     � �   sem   sem entity nonment-action       ”essen.1.1” c-id             rel � agt �           � �  np-syn          cat np     syn       agr � case nom �       sel            � � ��  semsel sem   � �   semsel sem     entity human-object     select rel � aff �            � �    np-syn         cat np      syn     agr � case acc �              sel        � � ��  sem         semsel sem entity � sort co � Johannes Leveling Semantic Analysis for NLP-based Applications 13 / 44

Semantic Analysis for NLP-based Applications Johannes Leveling - PowerPoint PPT Presentation

Semantic Analysis for NLP-based Applications Johannes Leveling former affiliation: Intelligent Information and Communication Systems (IICS) University of Hagen (FernUniversitt in Hagen) 58084 Hagen, Germany Johannes Leveling Semantic

Semantic Analysis and Semantic Roles Ling 571 Deep Processing Techniques for NLP February 10,

SI485i : NLP Missing Topics and the Future Who cares about NLP? NLP has expanded quickly

SI425 : NLP Missing Topics and the Future Who cares about NLP? NLP has expanded quickly

Application: Semantic Role Labeling CS 6956: Deep Learning for NLP Overview What is semantic

Adapting NLP Tools and Frame-Semantic Resources for the Semantic Analysis of Ritual Descriptions

NLP: Two pictures Wordnet and Word Sense Problem NLP Disambiguation Semantics NLP Trinity

Recurrent Neural Networks Graham Neubig Site https://phontron.com/class/nn4nlp2017/ NLP and

Semantic Analysis Wilhelm/Seidl/Hack: Compiler Design Syntactic and Semantic Analysis,

Semantic Analysis/Checking Symbol tables Semantic analysis: the final part of analysis half of

Semantic Analysis CMSC 35100 Natural Language Processing May 8, 2003 Roadmap Semantic

Semantic Roles & Semantic Role Labeling Ling571 Deep Processing Techniques for NLP February

Semantic Role Labeling Deep Processing Techniques for NLP Ling571 February 27, 2017 Semantic

Introduction to NLP and NLG Introduction to NLP Rules or Statistics?? Lexical Analysis,

Creating Semantic Mashups: Bridging Web 2.0 and the Semantic Web Jamie Taylor, Colin Evans, Toby

: on the Semantic Web : on the Semantic Web Building a Semantic Prototype for Danish Building a

Semantic Processing Augmenting CFGs Currying Quantifier scope Semantic Grammars L445 / L545

Part-of-Speech Annotation Challenges in Marathi Gajanan Rane, Nilesh Joshi, Geetanjali Rane,

Engineering Analysis ENG 3420 Fall 2009 Dan C. Marinescu Office: HEC 439 B Office hours: Tu-Th

An Introduction to TeX and LaTeX: Lecture 1: The Basics Tuesday 11 October 2016 Background

EECS 4441 Human-Computer Interaction Topic #7: More on Formatting and Writing I. Scott MacKenzie

AP Stylebook Get to know your new best friend. What is AP style? AP style is a standardized

Course Content Structural Programming and Data Structures Introduction Vectors

GCSE 2020 Aims of this evening To know the format of the exams that students will be sitting

Writing P o etry Session 2 Sharing Homework (w/ partners) 3 Broad types of poetry Figurative

Semantic Analysis for NLP-based Applications Johannes Leveling - PowerPoint PPT Presentation

Semantic Analysis for NLP-based Applications Johannes Leveling former affiliation: Intelligent Information and Communication Systems (IICS) University of Hagen (FernUniversitt in Hagen) 58084 Hagen, Germany Johannes Leveling Semantic

Semantic Analysis and Semantic Roles Ling 571 Deep Processing Techniques for NLP February 10,

SI485i : NLP Missing Topics and the Future Who cares about NLP? NLP has expanded quickly

SI425 : NLP Missing Topics and the Future Who cares about NLP? NLP has expanded quickly

Application: Semantic Role Labeling CS 6956: Deep Learning for NLP Overview What is semantic

Adapting NLP Tools and Frame-Semantic Resources for the Semantic Analysis of Ritual Descriptions

NLP: Two pictures Wordnet and Word Sense Problem NLP Disambiguation Semantics NLP Trinity

Recurrent Neural Networks Graham Neubig Site https://phontron.com/class/nn4nlp2017/ NLP and

Semantic Analysis Wilhelm/Seidl/Hack: Compiler Design Syntactic and Semantic Analysis,

Semantic Analysis/Checking Symbol tables Semantic analysis: the final part of analysis half of

Semantic Analysis CMSC 35100 Natural Language Processing May 8, 2003 Roadmap Semantic

Semantic Roles &amp; Semantic Role Labeling Ling571 Deep Processing Techniques for NLP February

Semantic Role Labeling Deep Processing Techniques for NLP Ling571 February 27, 2017 Semantic

Introduction to NLP and NLG Introduction to NLP Rules or Statistics?? Lexical Analysis,

Creating Semantic Mashups: Bridging Web 2.0 and the Semantic Web Jamie Taylor, Colin Evans, Toby

: on the Semantic Web : on the Semantic Web Building a Semantic Prototype for Danish Building a

Semantic Processing Augmenting CFGs Currying Quantifier scope Semantic Grammars L445 / L545

Part-of-Speech Annotation Challenges in Marathi Gajanan Rane, Nilesh Joshi, Geetanjali Rane,

Engineering Analysis ENG 3420 Fall 2009 Dan C. Marinescu Office: HEC 439 B Office hours: Tu-Th

An Introduction to TeX and LaTeX: Lecture 1: The Basics Tuesday 11 October 2016 Background

EECS 4441 Human-Computer Interaction Topic #7: More on Formatting and Writing I. Scott MacKenzie

AP Stylebook Get to know your new best friend. What is AP style? AP style is a standardized

Course Content Structural Programming and Data Structures Introduction Vectors

GCSE 2020 Aims of this evening To know the format of the exams that students will be sitting

Writing P o etry Session 2 Sharing Homework (w/ partners) 3 Broad types of poetry Figurative

Semantic Roles & Semantic Role Labeling Ling571 Deep Processing Techniques for NLP February