Grammars, graphs and automata
Mark Johnson
Brown University
ESSLLI 2005 slides available from http:/ /cog.brown.edu/˜mj
1
High-level overview
- Probability distributions and graphical models
- (Probabilistic) finite state machines and context-free grammars
– computation (dynamic programming) – estimation
- Log-linear models
– stochastic unification-based grammars – reranking parsing
- Weighted CFGs and proper PCFGs
2
Topics
- Graphical models and Bayes networks
- (Hidden) Markov models
- (Probabilistic) context-free grammars and finite-state machines
- Computation with and estimation of PCFGs
- Lexicalized and bi-lexicalized PCFGs
- Non-local dependencies and log-linear models
- Features in reranking parsing
- Stochastic unification-based grammar
- Weighted CFGs and proper PCFGs
3
What is computational linguistics?
Computational linguistics studies the computational processes involved in language production, comprehension and acquisition.
- assumption that language is inherently computational
- scientific side:
– modeling human performance (computational psycholinguistics) – understanding how it can be done at all
- technological applications:
– speech recognition – information extraction (who did what to whom) and question answering – machine translation (translation by computer)
4