montague grammar
play

Montague Grammar Stefan Thater Blockseminar Underspecification - PowerPoint PPT Presentation

Montague Grammar Stefan Thater Blockseminar Underspecification 10.04.2006 Overview Introduction Type Theory A Montague-Style Grammar Scope Ambiguities Summary Introduction The basic assumption underlying Montague


  1. Montague Grammar Stefan Thater Blockseminar “Underspecification” 10.04.2006

  2. Overview • Introduction • Type Theory • A Montague-Style Grammar • Scope Ambiguities • Summary

  3. Introduction • The basic assumption underlying Montague Grammar is that the meaning of a sentence is given by its truth conditions. - “Peter reads a book” is true iff Peter reads a book • Truth conditions can be represented by logical formulae - “Peter reads a book” → ∃ x(book(x) ∧ read(p*, x)) • Indirect interpretation: - natural language → logic → models

  4. Compositionality • An important principle underlying Montague Grammar is the so called “principle of compositionality” The meaning of a complex expression is a function of the meanings of its parts, and the syntactic rules by which they are combined (Partee & al, 1993)

  5. Compositionality John reads a book John reads a book [[ John reads a book ]] = reads a book C 1 ([[ John]], [[reads a book]] ) = C 1 ([[ John]], C 2 ([[reads]] , [[a book]] ) = a book C 1 ([[ John]], C 2 ([[reads]] , C 3 ([[a]], [[book]]))

  6. Representing Meaning • First order logic is in general not an adequate formalism to model the meaning of natural language expressions. • Expressiveness - “John is an intelligent student” ⇒ intelligent(j*) ∧ stud(j*) - “John is a good student” ⇒ good(j*) ∧ stud(j*) ?? - “John is a former student” ⇒ former(j*) ∧ stud(j*) ??? • Representations of noun phrases, verb phrases, … - “is intelligent” ⇒ intelligent( ∙ ) ? - “every student” ⇒ ∀ x(student(x) ⇒ ⋅ ) ???

  7. Type Theory • First order logic provides only n-ary first order relations, which is insufficient to model natural language semantics. • Type theory is more expressive and flexible – it provides higher-order relations and functions of different kinds. • Some type theoretical expressions - “John is a good student” ⇒ good(student)(j*) - “is intelligent” ⇒ intelligent - “every student” ⇒ λ P ∀ x(student(x) ⇒ P(x))

  8. Types • A set of basic types, for instance {e, t} - e is the type of individual terms (“entity”) - t is the type of formulas (“truth value”) • The set T of types is the smallest set such that - if σ is a basic type, then σ is a type - if σ , τ are types, then ‹ σ , τ › is a type • The type ‹ σ , τ › is the type of functions that map arguments of type σ to values of type τ .

  9. Some Example Types • One-place predicate constant: sleep, walk, student, … - ‹e, t› • Two-place relation: read, write, … - ‹e, ‹e,t›› • Attributive adjective: good, intelligent, former, … - ‹‹e,t›, ‹e,t››

  10. Vocabulary • Pairwise disjoint, possibly empty sets of non-logical constants: - Con τ , for every type τ • Infinite and pairwise disjoint sets of variables: - Var τ , for every type τ • Logical constants: - ∀ , ∃ , ∧ , ¬, …, λ

  11. Syntax • For every type τ , we define the set of meaningful expressions ME τ as follows: - Con τ ⊆ ME τ and Var τ ⊆ ME τ , for every type τ - If α ∈ ME ‹ σ , τ › and β ∈ ME σ , then α ( β ) ∈ ME τ . - If A, B ∈ ME t , then so are ¬A, (A ∧ B), (A ⇒ B), … - If A ∈ ME t , then so are ∀ xA and ∃ xA, where x is a variable of arbitrary type. - If α , β are well-formed expressions of the same type, then α = β ∈ ME t . - If α ∈ ME τ and x ∈ Var σ , then λ x α ∈ ME ‹ σ , τ › .

  12. Some Examples • “John works.” j* ∈ ME e work ∈ ME ‹e, t› work(j*) • “Every student works.” every ∈ ME ‹‹e, t›, ‹‹e, t›, t› student ∈ ME ‹e, t› every(student) ∈ ME ‹‹e, t›, ‹‹e, t›, t› work ∈ ME ‹e, t› every(student)(work) ∈ ME t

  13. Semantics • Let U be a non-empty set of entities. For every type τ , the domain of possible denotations D τ is given by - D e = U - D t = {0,1} - D ‹ σ , τ › = the set of functions from D σ to D τ • A model structure is a structure M = (U M , V M ) - U M is a non-empty set of individuals - V M is a function that assigns every non-logical constant of type τ an element of D τ . • Variable assignment g: Var τ → D τ

  14. Semantics • Let M be a model structure and g a variable assignment - [[ α ]] M,g = V M ( α ), if α is a constant - [[ α ]] M,g = g( α ), if α is a variable - [[ α ( β )]] M,g = [[ α ]] M,g ([[ β ]] M,g ) - [[¬ φ ]] M,g = 1 iff [[ φ ]] M,g = 0 - [[ φ ∧ ψ ]] M,g = 1 iff [[ φ ]] M,g = 1 and [[ ψ ]] M,g = 1, etc. - [[ ∃ v φ ]] M,g = 1 iff there is a ∈ D τ such that [[ φ ]] M,g[v/a] = 1 - [[ ∀ v φ ]] M,g = 1 iff for all a ∈ D τ , [[ φ ]] M,g[v/a] = 1 - [[ α = β ]] M,g = 1iff [[ α ]] M,g = [[ β ]] M,g

  15. � Semantics of λ -Expressions • Let M be a model structure and g a variable assignment - If α ∈ ME τ and v ∈ Var σ , then [[ λ v α ]] M,g is that function f from D σ to D τ such that for any a ∈ D σ , f(a) = [[ α ]] M,g[v/a| • “Syntactic shortcut:” β -reduction - ( λ x φ )( ψ ) ≡ φ [ ψ /x] if all free variables in ψ are free for x in φ - A variable y is free for x in φ if no free occurence of x in ψ is in the scope of a ∃ y, ∀ y, λ y

  16. Noun Phrases • “John works” → work(j*) • “A student works.” → ∃ x(student(x) ∧ work(x)) • “Every student works.” → ∀ x(student(x) ⇒ work(x)) • “John and Mary work.” → work(j*) ∧ work(m*)

  17. Noun Phrases • Using λ -abstraction, noun phrases can be given a uniform interpretation as “generalized quantifiers” - “John” → λ P .P(j*) - “A student” → λ P ∃ x(student(x) ∧ P(x)) - “Every student” → λ P ∀ x(student(x) ⇒ P(x)) - “John and Mary” → λ P .P(j*) ∧ P(m*)

  18. Noun Phrases • “John works” λ P .P(j*) ∈ ME ‹‹e, t›, t› work ∈ ME ‹e, t› ( λ P .P(j*))(work) ∈ ME t work(j*) ∈ ME t • “Every student works.” λ P ∀ x(student(x) ⇒ P(x)) ∈ ME ‹‹e, t›, t› work ∈ ME ‹e, t› ( λ P ∀ x(student(x) ⇒ P(x)))(work) ∈ ME t ∀ x(student(x) ⇒ work(x)) ∈ ME t

  19. Determiners • Determiners like “a,” “every,” “no” denote higher order functions taking (denotations of) common nouns and return a higher order relation. - “every” → λ P λ Q ∀ x(P(x) ⇒ Q(x)) - “some” → λ P λ Q ∃ x(P(x) ∧ Q(x)) - “no” → λ P λ Q¬ ∃ x(P(x) ∧ Q(x)) • “Every student” λ P λ Q ∀ x(P(x) ⇒ Q(x)) student ( λ P λ Q ∀ x(P(x) ⇒ Q(x)))(student) λ Q ∀ x(student(x) ⇒ Q(x))

  20. A Montague-Style Grammar for a Fragment of English

  21. Syntactic Component • Montague Grammar is based upon (a particular version of) categorial grammar. • The set of categories is the smallest set such that - S, IV, CN are categories - If A, B are categories, then A/B is a category • Some categories - IV/T [= TV] transitive verbs - S/IV [= T] terms (= noun phrases) - T/CN determiners

  22. Lexicon • For each category A, we assume a possibly empty set B A of basic expressions of category A. • For instance - B T = { John, Mary, he 0 , he 1 , … } - B CN = { student, man, woman, … } - B IV = { sleep, work, … } - B IV/T = { read, … } - B T/CN = { a, every, no, the, … }

  23. Syntactic Rules (Simplified) • General rule schema: - B A ⊆ P A - If α ∈ P A and δ ∈ P B/A , then δα ∈ P B • “Every student works” every student works, S every student, S/IV works, IV every, (S/IV)/CN student, CN

  24. Translation into Type Theory • A translation of natural language into type theory is a homomorphism that assigns each α ∈ P A an α ’ ∈ ME f(A) • f maps categories to types as follows - f(S) = t - f(CN) = f(IV) = ‹e, t› - f(A/B) = ‹f(B), f(A)›

  25. Translation: Lexical Categories • “John” → λ P .P(j*) • “every” → λ P λ Q ∀ x(P(x) ⇒ Q(x)) • “a” → λ P λ Q ∃ x(P(x) ∧ Q(x)) • “student” → student • “book” → book • “works” → work • …

  26. Translation: Phrasal Categories • Syntactic rule: - If α ∈ P A and δ ∈ P B/A , then δα ∈ P B B � � '( � ') • Corresponding translation rule: B/A � � ' A � � ' - If α → α ’, δ → δ ’, then δα → δ ’( α ’)

  27. “Every student works” every student works, S every student, S/IV works, IV every, (S/IV)/CN student, CN • “every” → λ P λ Q ∀ x(P(x) ⇒ Q(x)) • “student” → student • “every student” → λ P λ Q ∀ x(P(x) ⇒ Q(x))(student) = λ Q ∀ x(student(x) ⇒ Q(x)) • “every student works” → λ Q ∀ x(student(x) ⇒ Q(x))(work) = ∀ x(student(x) ⇒ work(x))

  28. Transitive Verbs • Transitive verbs have category IV/T (= IV/(S/IV)), the corresponding type is ‹‹‹e, t›, t›, ‹e, t›› • On the other hand, transitive verbs like “read,” “present,” … denote a two-place first order relation (type ‹e, ‹e, t››) - “John reads a book” → ∃ y(book(y) ∧ read(y)(j*)) • “read” → λ Q λ x.Q( λ y.read*(y)(x)) - read* ∈ ME ‹e, ‹e, t››

  29. “Every student reads a book” every student reads a book, S every student, T read a book, IV every, T/CN student, CN reads, IV/T a book, T a, T/CN book, CN

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend