Natural Language Processing: How do humans process language? - - PowerPoint PPT Presentation

natural language processing how do humans process language
SMART_READER_LITE
LIVE PREVIEW

Natural Language Processing: How do humans process language? - - PowerPoint PPT Presentation

Natural Language Processing: How do humans process language? SCIENCE PASSION TECHNOLOGY Natural Language Processing: How do humans process language? Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 Philipp Gabler


slide-1
SLIDE 1

Natural Language Processing: How do humans process language?

SCIENCE PASSION TECHNOLOGY

Natural Language Processing: How do humans process language?

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 1

slide-2
SLIDE 2

Natural Language Processing: How do humans process language?

Outline

1 Motivation 2 Models of human language 3 Practical Connections to NLP

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 2

slide-3
SLIDE 3

Motivation

What does NLP have to do with humans, at all?

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 3

slide-4
SLIDE 4

Motivation

Fundamental questions of linguistics What do you know when you know a language? What do you know when you understand an uterance?

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 4

slide-5
SLIDE 5

Motivation

Linguistics & NLP Too much theory is bad? But why? “Every time I fire a linguist, the performance of the speech processing system goes up.” (Frederick Jelinek) Does it mean we should refrain from linguistic inspiration? (NLP already does that. Ask a linguist.)

  • Cf. the good, bad, and ugly parts of artificial neural networks

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 5

slide-6
SLIDE 6

Motivation

Levels of Abstraction Linguists and Engineers tend to have different focus Computational: what is explained? Description of linguistic performance vs. explanation of linguistic competence Algorithmic: how is it done? Cognitive realism, computational complexity/efficiency Implementational: how is is realized? Neurological plausibility

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 6

slide-7
SLIDE 7

Motivation

What this lecture is about A very short introduction to: Grammar theory What is language built of? Cognitive linguistics How does language work in the mind?

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 7

slide-8
SLIDE 8

Motivation

Insights from linguistics Get a beter understanding of what should work in language processing Afer all, it’s natural language processing Comparison gives confidence: NLU system behaviour vs. L1 acquisition Observation of similar effects/errors, e.g., garden path sentences Human performance is the ultimate (utopic?) benchmark! We’re not inventing something new...

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 8

slide-9
SLIDE 9

Motivation

Insights for linguistics We don’t yet know how human language really works Very conflicting hypotheses, most of which work only on a computational level New ideas: Shallow processing Distributed, implicit, usage-based knowledge Computational construction grammar Computational semantics (λ calculus)

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 9

slide-10
SLIDE 10

Motivation

Some words of caution Be warned! This is will be an extremely rough, simplified, and incomplete overview It is biased in favour of Cognitive Linguistics (and a bit against Generative Grammar) Linguistic theory is not rigorously formal “Theory” = “proposed descriptive model”, not “axiomatic system” Be aware of writen-language bias!

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 10

slide-11
SLIDE 11

Models of human language

Some examples from different areas of linguistics and cognitive science

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 11

slide-12
SLIDE 12

Models of human language

Cognitive & Linguistic Development Cognitive abilities develop in similar ways Typical progress: Statistical learning (expectation & surprise) Inductive learning (categorization & abstraction) Social learning (imitation, intention, theory of mind) Sensomotory system has an important influence in learning! Critical periods vs. extreme robustness

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 12

slide-13
SLIDE 13

Models of human language

Language acquisition Language learning tends to follow a U-shaped progress Phases: Simplification: How do you do dese...work/tortillas/in English Overgeneralization: Yesterday I didn’t painting; it noises Restructuring How do you...make this/like it; how...do cut it

  • Cf. exploration vs. exploitation in reinforcement learning

Computational and associative learning

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 13

slide-14
SLIDE 14

Models of human language

Creolization processes

Figure: Hotel room signs in Tok Pisin (Papua New Guinea)

https://commons.wikimedia.org/wiki/File: Tok-Pisin_New-Guinea-Pidgin_Pidgin-English_Melanesian-Pidgin_Papua-New-Guinea-Hotel-Room-Door-Sign_(DSC_3096).jpg Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 14

slide-15
SLIDE 15

Models of human language

Linguistic nativism Is langage1 special? Is language based on common cognitive machanisms? Categorization, association, memory, hierarchy... Or is there a specialized, innate language mechanism? Mental grammar, language acquisition device, Universal Grammar

1This is not a typo, but French.

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 15

slide-16
SLIDE 16

Models of human language

Universal Grammar Generative Grammar = trees + transformations Grammatical construal in terms of rules from deep structure to surface structure Exlaining all languages in terms of principles and parameters Solution to fast, one-shot L1 acquisition

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 16

slide-17
SLIDE 17

Models of human language

Triangles in the brain? CP ¯ C IP ¯ I VP ¯ V VP ¯ V V waited V have I should NP ¯ N N children Det The C

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 17

slide-18
SLIDE 18

Models of human language

Triangles in the brain? CP ¯ C IP ¯ I VP ¯ V VP ¯ V V waited V t I V have NP ¯ N N children Det The C

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 18

slide-19
SLIDE 19

Models of human language

Limits of Universal Grammar Criticism of this kind of analysis Explicitely not empirical (at least by Chomsky) Against “behaviourism”, focus on competence Tends to categorize everything in terms of recursive symbolic structures Good for English – what about Chinese? Pirahã? Conversational English? Computationally complex, cognitively... difficult to explain

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 19

slide-20
SLIDE 20

Models of human language

Pushing the Boundaries of Generative Grammar Language processing is basically an inverse problem: Colorless green ideas sleep furiously The Sally hugged him the Thomas Time flies like an arrow The apartment that the maid who the service had sent over was decorated Keine Kopfverletzung ist zu harmlos um sie nicht zu ignorieren

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 20

slide-21
SLIDE 21

Models of human language

Representations of meaning Language is conveying mental state through symbols Grammar is only an “artifact” to structure the transportation of mental state Or: only an instrument for performative uterance Semantics from a cognitive perspective: meaning is... perspectivic (relative to uterance context) dynamic (system changes with environment) encyclopedic (association with experiences & culture) determined by usage (a system derived from concrete experience)

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 21

slide-22
SLIDE 22

Models of human language

Aspects of Cognitive Grammar Some cognitive approaches to semantics and grammar How is meaning represented? Prototypes, radial networks, schemata, ... Metaphor How is meaning expressed through form? Construction grammar, grammatical construal, usage-based grammar... Information structure

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 22

slide-23
SLIDE 23

Models of human language

Information Structure (aka Information Packaging) Conveying more information beyond denotation Intonation can focus different parts of an uterance John only introduced Bill to Sue John only introduced Bill to Sue John only introduced Bill to Sue John only introduced Bill to Sue Differences in meaning independent of linguistic form!

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 23

slide-24
SLIDE 24

Models of human language

Information Structure (aka Information Packaging) Constructions that relate meaning in conversation2 Different pragmatic practices are associated with: As for John, he lost his wallet What happened was that John lost his wallet What John did was lose his wallet It was John who lost his wallet What John lost was his wallet

2See Martin Hilpert’s lectures: https://www.youtube.com/watch?v=PJecXZp_SYw

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 24

slide-25
SLIDE 25

Models of human language

Construction Grammar Constructions everywhere Constructions are paterns whose form or meaning is not strictly predictable from their components: He has whiffled my borogroves completely vorpal again *The knife chopped the carrots into the salad Embedded items are coerced: There was cat all over the road She smiled herself an upgrade

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 25

slide-26
SLIDE 26

Models of human language

Metaphors Not just arbitrary idioms and poetry! We understand things in terms of metaphor, and use it all the time3 Abstract term = container An argument has a hole, has less substance, does not have content To find something in an argument Argument = journey The content of the argument proceeds, path to the core of the argument, the direction has no substance

3See Metaphors we live by by John Lakoff

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 26

slide-27
SLIDE 27

Models of human language

Further sources of insight Acquiring meta-linguistic intuition Observe your own & others using language If you’re interested: go to the linguistics department; e.g., Sprache und Kognition Grammatiktheorie & Sprachtypologie Sprachen der Welt Also useful: more languages (for grammar, not talking)

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 27

slide-28
SLIDE 28

Applications

What does theory have to do with NLP, at all?

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 28

slide-29
SLIDE 29

Practical Connections to NLP

Overall assessment We have now already seen some ideas that agree: Statistical learning (“usage based”) Associative learning (“context based”) Shallow processing (no creation of deep structures) Now: some works of theory transfer from linguistics to NLP

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 29

slide-30
SLIDE 30

Practical Connections to NLP

Information structure + Generative Grammar Modeling Information Structure In A Cross-Linguistic Perspective4 Formalized version HPSG + Information structure Improve machine translation across multiple languages Information structure facilitates fluency in contiguous speech

4doi: 10.5281/ZENODO.818365

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 30

slide-31
SLIDE 31

Practical Connections to NLP

Construction Grammar + Semantics Computational construction grammar for visual question answering5 Based on computational construction grammar Mapping questions onto their executable semantic representations Constructions succintly capture form-meaning pairs in a domain

5doi: 10.1515/lingvan-2018-0070

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 31

slide-32
SLIDE 32

Practical Connections to NLP

Generative Grammar + Parsing Head-Driven Statistical Models for Natural Language Parsing6 Actual parsing using a Generative Grammar formalism Probabilistic context-free grammars to lexicalized grammars Parse tree represented as sequence of decisions corresponding to a head-centered, top-down derivation of the tree UG isn’t dead yet

6doi: 10.1162/089120103322753356

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 32

slide-33
SLIDE 33

Practical Connections to NLP

Fluid Construction Grammar + Agent-Based Modelling Linguistic Assessment Criteria for Explaining Language Change7 ... A Case Study on Syncretism in German Definite Articles. Evolution of the German definite article paradigm Agent-based simulation of communicative interactions (“language games”), implemented with Fluid Construction Grammar CxG can provide explanations for variation & change

7doi: 10.1163/22105832-13030106

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 33

slide-34
SLIDE 34

Practical Connections to NLP

Metaphor + Corpora Psychologically Motivated Text Mining8 Corpus-based learning of paterns of metaphorical framing Detection of the structure of metaphorical associations through clustering Metaphors are useful and detectable empirically

8arXiv: 1609.09019

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 34

slide-35
SLIDE 35

Practical Connections to NLP

More Cognitive Linguistics Cognitive approach to natural language processing Several essays, mostly on semantics in NLP Word association, disambiguation, frequency estimation, stylistic analysis...

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 35

slide-36
SLIDE 36

Practical Connections to NLP

More Metaphor Processing Metaphor: A Computational Perspective9 Introduction & special topics on metaphor in AI, NLP, and corpus linguistics “Researchers can build figurative-language processing systems that are practical and efficient and cognitively plausible”

9doi: 10.2200/S00694ED1V01Y201601HLT031

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 36

slide-37
SLIDE 37

Thank You!

Implementations are waiting for you. Next: ???

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 37

slide-38
SLIDE 38

References

1.

  • B. Sharp, F. Sedes, and W. Lubaszewski, Eds., Cognitive approach to natural language processing. London: ISTE

Press; Elsevier, 2017. 2.

  • J. Nevens, P. Van Eecke, and K. Beuls, Computational construction grammar for visual question answering,

Linguistics Vanguard, vol. 5, no. 1, 2019 3.

  • M. Collins, Head-Driven Statistical Models for Natural Language Parsing, Computational Linguistics, vol. 29, no.

4, pp. 589–637, 2003 4.

  • R. van Trijp, Linguistic Assessment Criteria for Explaining Language Change: A Case Study on Syncretism in

German Definite Articles, Language Dynamics and Change, vol. 3, no. 1, pp. 105–132, 2013 5.

  • S. Song, Modeling Information Structure In A Cross-Linguistic Perspective. Berlin: Language Science Press,

2017. 6.

  • E. Shutova and P. Lichtenstein, Psychologically Motivated Text Mining, arXiv:1609.09019 [cs], 2016

7.

  • T. Veale, E. Shutova, and B. Beigman Klebanov, Metaphor: A Computational Perspective. Morgan Claypool,

2016.

Philipp Gabler <pgabler@student.tugraz.at> 2020-05-07 38