Statistical Parsing
Grammars and grammar formalisms Çağrı Çöltekin
University of Tübingen Seminar für Sprachwissenschaft
October 27, 2016
Recap Introduction Constituency grammars Dependency grammars Grammar formalisms Finale
This course is about …
NP NP JJ statistical NN constituency CC and NN dependency NN parsing PP IN
- f
NP NN natural NN languages nmod amod case nmod conj cc amod
Ç. Çöltekin, SfS / University of Tübingen October 27, 2016 1 / 31 Recap Introduction Constituency grammars Dependency grammars Grammar formalisms Finale
Why do we need syntactic parsing?
- Often, syntactic analysis is an intermediate step helping
(semantic) interpretation of sentences hence it is useful for applications like question answering, information extraction
- (Statistical) parsers are also used as language models for
applications like speech recognition and machine translation
- It can be used for grammar checking, and can be a useful tool
for linguistic research
Ç. Çöltekin, SfS / University of Tübingen October 27, 2016 2 / 31 Recap Introduction Constituency grammars Dependency grammars Grammar formalisms Finale
Ingredients of a parser
- A grammar
- An algorithm for parsing
- A method for ambiguity resolution
Ç. Çöltekin, SfS / University of Tübingen October 27, 2016 3 / 31 Recap Introduction Constituency grammars Dependency grammars Grammar formalisms Finale
Grammars
The term grammar is used for,
- a description of the whole system/structure of a
language—as in a ‘grammar (book) of English’
- a grammar formalism, that are often developed as theory
- f language—as in HPSG, LFG, CCG
- A formal (fjnite) specifjcation of a language as a possibly
infjnite set of strings (not necessarily a natural language)
Ç. Çöltekin, SfS / University of Tübingen October 27, 2016 4 / 31 Recap Introduction Constituency grammars Dependency grammars Grammar formalisms Finale
Plan of the lecture
- Constituency grammars
- Dependency grammars
- Brief notes on some major grammar formalisms
Ç. Çöltekin, SfS / University of Tübingen October 27, 2016 5 / 31 Recap Introduction Constituency grammars Dependency grammars Grammar formalisms Finale
Constituency grammars
- Constituency grammars are
probably the most studied grammars both in linguistics, and computer science
- The main idea is that a group of
words form natural groups, or ‘constituents’, like no noun phrases
- r word phrases
- phrase structure grammars or
context-free grammars are often used as synonyms S NP John VP V saw NP Marry
Note: many grammar formalisms use constituency grammars in some way, we will not focus on a particular grammar formalism here.
Ç. Çöltekin, SfS / University of Tübingen October 27, 2016 6 / 31 Recap Introduction Constituency grammars Dependency grammars Grammar formalisms Finale
What is a constituency
Linguists ofger a number of tests for constituency, such as
- They can answer questions:
Q: ‘What did John do? →A: ‘saw Marry’
but, presumably, no question with answer ‘John saw’
- Substitution with a pronoun forms:
Q: ‘John [read the book] last week? →A: ‘John [did that] last week.’
- Fronting, topicalization:
‘John likes [reading books]’ →‘[Reading books], John likes’
- Coordination:
John [saw Marry] and [said ‘hi’]
- …
Note, however, these tests are leaky, e.g., ‘[John saw] and [Peter greated] Marry’ (see Müller 2016, for more examples).
Ç. Çöltekin, SfS / University of Tübingen October 27, 2016 7 / 31