Identifying, Finding and Encoding Semantic Relations Christiane - PDF document

Identifying, Finding and Encoding Semantic Relations Christiane Fellbaum Princeton University Questions • What kind of semantic knowledge does the NLP community need? • How to represent semantic knowledge? • How to expand on our present knowledge? 1

Assumptions Need a repository for word form-meaning pairs (lexicon) that serves as a standard for word sense representation, applications and evaluation Assumptions • Lexicon has a structure • Entities, events, properties are labeled (more or less) systematically • Structure and lexicalization patterns can be captured with semantic and lexical relations • Relations reflect (dis)similarities among labeled concepts in a fairly systematic way • Semantic similarity as reflected by relations is useful for WSD 2

WordNet--the Plus side • Broad coverage • Multilingual • Freely accessible • Continually enriched both by Princeton and the user community Limitations of WordNet • Too sparse: too few relations, too few links • Few syntagmatic (cross-POS) links • Links are not weighted • Many arcs are not directed ( dollar->green , ? green->dollar ) • Sense inventory is too fine-grained for current automatic WSD • Polysemy would be less of a problem if WordNet’s internal connectivity was greater 3

Sources of WordNet-style relations • Classical relations (Aristotle) • Lexical-semantic analysis of entities, events (causation, entailment, troponymy,...) • Finding examples via lexico-syntactic patterns (Cruse 1986; Hearst 1993) • Lexico-syntactic patterns presuppose specific relations WordNet connections based on human judgment • (Robust) word association norms • Human judgments of associations among WordNet concepts show many connections not currently encoded (WordNetPlus, Boyd- Graber et al. 2006) • Can’t all be easily classified or labeled! 4

Moving away from preconceived relations • Reconsider: structure of the lexicon • Which concepts are distinguished and labeled with words? • Discover systematic differences among concepts/words that can be encoded as relations Focus: Rigidity • Important meta-property for distinguishing concepts in ontology • Rigidity distinguishes Types vs. Roles e.g., DOLCE ontology (Guarino and Welty 2002), Generative Lexicon (Pustejovsky 1995) 5

Rigidity rigid entities: dog, orchid, man, shirts ,.. vs. non-rigid : pet, houseplant, teacher, laundry ,... Adjectives stage-level vs. individual-level (Carlson 1977) tall, intelligent, female ,... vs. married, tired, surprised , ... time-dependent: John is no longer tired/married/*tall/*intelligent 6

Rigid and non-rigid terms may be related via shared hypernym plant orchid violet *houseplant Rigid and non-rigid terms are compatible and not mutually exclusive: This is an orchid and a houseplant (type, role co-hyponyms) cf. * This is an orchid and a violet (type co-hyponyms) 7

Non-rigid properties are defeasible: This orchid is not a houseplant (type) *This orchid is not a plant (role) Type and role nouns noun are semantically similar when sharing a superordinate A given entity can be labeled with both kinds of nouns Useful for co-reference resolution Temporal relations 8

Encoding Role and type nouns can be systematically distinguished and encoded linked to shared superordinates with para(llel) relations (Cruse 1986): plant orchid houseplant Relations in the verb lexicon Lexicalization patterns show systematic, productive encoding of hyponymy (troponymy), causation Verb classes: Manner verbs Change-of-state verbs Can be distinguished via syntactic criteria 9

Another relation Analogous to Type-Role distinction Distinguish hyponyms (troponyms) from “purpose” verbs examples: exercise, control, greet, help, punish don’t encode manner or change-of state not productive (?) “purpose” verbs move exercise run running is necessarily a kind of moving (hyponym) running is not necessarily a kind of exercising 10

Co-Hyponyms defeasible/non-defeasible Run but not {exercise/*move} Wave but not {greet/*gesture} Scrub but not {clean/*rub} the table Fair amount of verb hyponyms in WN are defeasible But no systematic encoding, distinction Relation is not always captured by co-hyponymy • Find verbs expressing purpose • encode them in WordNet with “parallel” relation, following Cruse’s suggestion for Types and Role • Problem: such concepts are not systematically encoded 11

Finding examples via lexico- syntactic patterns V-ing is (not) V-ing to V is (not) to V V-ing is a way of V-ing Patterns are also valid for regular hyponyms but few such pairs are extracted (for pragmatic reasons?) Web examples spraying the action with a little WD-40 is not cleaning shake hands , using the right hand, and explain that his is a way of greeting one another tipping, leaving a gratuity, is a way of thanking people for their service 12

Boyd-Graber, J., Fellbaum, C., Osherson, D. and Schapire, R. (2006). Adding dense, weighted connections to WordNet. Proceedings of the 3rd Global WordNet Association, Jeju, Korea. Carlson, G. (1977). Reference to Kinds in English . University of Massachusetts, Amherst Dissertation. Cruse, Alan (1986). Lexical Semantics . Cambridge: Cambride University Press. Fellbaum, Christiane (Ed., 1998). WordNet . Cambridge, MA: MIT Press. Fellbaum, Christiane (2002a). Parallel hierachies in the verb lexicon. in: K. Simov, (Ed.), Proceedings of the Ontolex02 Workshop, LREC, Las Palmas, Spain. . Fellbaum, Christiane (2002b). On the semantics of troponymy. In: Green, R. et al. (Eds.) Relations. Dordrecht: Kluwer. Fellbaum, C. (2003). Distinguishing Verb Types in a Lexical Ontology. Proceedings of the Conference on the Generative Lexicon, Geneva, Switzerland Guarino, N. and Welty, C. (2002), Evaluating Ontological Decisions with OntoClean . Communications of the ACM 45:2, 61-65. Hearst, M. (1992). Automatic acquisition of hyponyms from large text corpora . COLING 92, 539-545. Pustejovsky, J. (1995). The Generative Lexicon . Cambridge, MA: MIT Press. 13

Identifying, Finding and Encoding Semantic Relations Christiane - PDF document

Identifying, Finding and Encoding Semantic Relations Christiane Fellbaum Princeton University Questions What kind of semantic knowledge does the NLP community need? How to represent semantic knowledge? How to expand on our

61A Extra Lecture 4 Announcements Encoding Strings Representing Strings: UTF-8 Encoding 4

Finding your way in a graph Finding your way in a graph Finding your way in a graph Finding your

Learning of Semantic Relations between Statistical Techniques Ontology Concepts using

Language and Computers Relation to language Encoding written language Prologue: Encoding

Language and Computers Relation to language Encoding written Prologue: Encoding Language

Deep Encode: Machine Learning for Per-Title Encoding Daniel Silhavy| IBC20| Per-Title Encoding

Creating Semantic Mashups: Bridging Web 2.0 and the Semantic Web Jamie Taylor, Colin Evans, Toby

Align, Disambiguate, and Walk A Unified Approach for Measuring Semantic Similarity Semantic

Module 13 Introduction to Semantic Technology, Ontologies and the Semantic Web Module 13 Outline

: on the Semantic Web : on the Semantic Web Building a Semantic Prototype for Danish Building a

Semantic Processing Augmenting CFGs Currying Quantifier scope Semantic Grammars L445 / L545

Semantic Similarity MultiJEDI ERC 259234 Semantic Similarity Semantic Similarity Mostly

Finding Hidden Supernovae with Finding Hidden Supernovae with Finding Hidden Supernovae with

Semantic Analysis and Semantic Roles Ling 571 Deep Processing Techniques for NLP February 10,

Semantic Analysis Wilhelm/Seidl/Hack: Compiler Design Syntactic and Semantic Analysis,

RDF, RDFS and OWL: Graph Data Models for the Semantic Web Semantic Web: The Idea Semantic

Formalizing Subsumption Relations in Ontologies using Type Classes in Coq Richard Dapoigny 1

Infrastructure Matias Frosterus, Mirja Anttila, Mikko Lappalainen, Susanna Nykyri, Tuomas

From knowledge of humans performing everyday activities to the service robots Lisset Salinas

Machine Learning for Big Data (CSE 547 / STAT 548)

Volunteerism This session is about opportunities for volunteer activities that can help to

Minimal subshifts, Sch utzenberger groups and profinite semigroups Dominique Perrin Groups

Designing and Using an Audio-Visual Description Core Ontology Antoine Isaac & Raphal

GCT535- Sound Technology for Multimedia Music and Audio Alignment Graduate School of Culture

Sambuz

Useful Links

Newsletter

Mail Us

Identifying, Finding and Encoding Semantic Relations Christiane - PDF document

Identifying, Finding and Encoding Semantic Relations Christiane Fellbaum Princeton University Questions What kind of semantic knowledge does the NLP community need? How to represent semantic knowledge? How to expand on our

61A Extra Lecture 4 Announcements Encoding Strings Representing Strings: UTF-8 Encoding 4

Finding your way in a graph Finding your way in a graph Finding your way in a graph Finding your

Learning of Semantic Relations between Statistical Techniques Ontology Concepts using

Language and Computers Relation to language Encoding written language Prologue: Encoding

Language and Computers Relation to language Encoding written Prologue: Encoding Language

Deep Encode: Machine Learning for Per-Title Encoding Daniel Silhavy| IBC20| Per-Title Encoding

Creating Semantic Mashups: Bridging Web 2.0 and the Semantic Web Jamie Taylor, Colin Evans, Toby

Align, Disambiguate, and Walk A Unified Approach for Measuring Semantic Similarity Semantic

Module 13 Introduction to Semantic Technology, Ontologies and the Semantic Web Module 13 Outline

: on the Semantic Web : on the Semantic Web Building a Semantic Prototype for Danish Building a

Semantic Processing Augmenting CFGs Currying Quantifier scope Semantic Grammars L445 / L545

Semantic Similarity MultiJEDI ERC 259234 Semantic Similarity Semantic Similarity Mostly

Finding Hidden Supernovae with Finding Hidden Supernovae with Finding Hidden Supernovae with

Semantic Analysis and Semantic Roles Ling 571 Deep Processing Techniques for NLP February 10,

Semantic Analysis Wilhelm/Seidl/Hack: Compiler Design Syntactic and Semantic Analysis,

RDF, RDFS and OWL: Graph Data Models for the Semantic Web Semantic Web: The Idea Semantic

Formalizing Subsumption Relations in Ontologies using Type Classes in Coq Richard Dapoigny 1

Infrastructure Matias Frosterus, Mirja Anttila, Mikko Lappalainen, Susanna Nykyri, Tuomas

From knowledge of humans performing everyday activities to the service robots Lisset Salinas

Machine Learning for Big Data (CSE 547 / STAT 548)

Volunteerism This session is about opportunities for volunteer activities that can help to

Minimal subshifts, Sch utzenberger groups and profinite semigroups Dominique Perrin Groups

Designing and Using an Audio-Visual Description Core Ontology Antoine Isaac &amp; Raphal

GCT535- Sound Technology for Multimedia Music and Audio Alignment Graduate School of Culture

Sambuz

Useful Links

Newsletter

Mail Us

Designing and Using an Audio-Visual Description Core Ontology Antoine Isaac & Raphal