cognitive complexity of linguistic patterns
play

Cognitive Complexity of Linguistic Patterns Artificial Grammar - PDF document

AGL Workshop 1 Cognitive Complexity of Linguistic Patterns Artificial Grammar Learning Workshop Max Planck Institute for Psycholinguistics 2324 November 2010 Slide 1 James Rogers Dept. of Computer Science Earlham College


  1. AGL Workshop 1 Cognitive Complexity of Linguistic Patterns Artificial Grammar Learning Workshop Max Planck Institute for Psycholinguistics 23–24 November 2010 Slide 1 James Rogers Dept. of Computer Science Earlham College jrogers@cs.earlham.edu http://cs.earlham.edu/~jrogers/slides/agl.ho.pdf This work completed, in part, while in residence at the Radcliffe Institute for Advanced Study Joint work with: • Geoffrey K. Pullum School of Philosophy, Psychology and Language Sciences University of Edinburgh • Marc D. Hauser Depts. of Psychology, Organismic & Evolutionary Biology and Biological Anthropology Slide 2 Harvard University • Jeffery Heinz Dept. of Linguistics and Cognitive Science University of Delaware • Gil Bailey, Matt Edlefsen, Magaret Fero, Molly Visscher, David Wellcome, Aaron Weeden and Sean Wibel Dept. of Computer Science Earlham College

  2. AGL Workshop 2 Cognitive Complexity of Simple Patterns Sequences of ‘ A ’s and ‘ B ’s which end in ‘ B ’: S 0 − → AS 0 , S 0 − → BS 0 , S 0 − → B B A B ( A + B ) ∗ B A Slide 3 Sequences of ‘ A ’s and ‘ B ’s which contain an odd number of ‘ B ’s: S 0 − → AS 0 , S 0 − → BS 1 , S 0 − → B, S 1 − → AS 1 , S 1 − → BS 0 , S 1 − → A A A B ( A ∗ BA ∗ BA ∗ ) ∗ A ∗ BA ∗ B Finite State Automata and Regular Grammars A A B A B B A A A B A A State N Y Slide 4 A A B S 0 S 1 B S 0 − → AS 0 , S 0 − → BS 1 , S 0 − → B, S 1 − → AS 1 , S 1 − → BS 0 , S 1 − → A

  3. AGL Workshop 3 Some More Simple Patterns Sequences of ‘ A ’s and ‘ B ’s which contain at least one ‘ B ’: S 0 − → AS 0 , S 0 − → BS 1 , S 0 − → B, S 1 − → AS 1 , S 1 − → BS 1 , S 1 − → A, S 1 − → B A, B A B A ∗ B ( A + B ) ∗ Slide 5 Sequences of ‘ A ’s and ‘ B ’s which contain exactly one ‘ B ’: S 0 − → AS 0 , S 0 − → BS 1 , S 0 − → B, S 1 − → AS 1 , S 1 − → A A, B A A B B A ∗ BA ∗ Cognitive Complexity from First Principles What kinds of distinctions does a cognitive mechanism need to be sensitive to (attend to) in order to classify an event with respect to a pattern? Slide 6 Reasoning about patterns • What objects/entities/things are we reasoning about? • What relationships between them are we reasoning with?

  4. AGL Workshop 4 Some Assumptions about (Proto-)Linguistic Behaviors • Perceive/process/generate linear sequence of (sub)events Slide 7 • Can model as strings—linear sequence of abstract symbols – Positions—Discrete linear order (initial segment of N ). – Labeled with alphabet of events Partitioned into subsets, each the set of positions at which a particular event occurs. Dual characterizations of complexity classes Computational classes • Characterized by abstract computational mechanisms • Equivalence between mechanisms • Means to determine structural properties of stringsets Slide 8 Descriptive classes • Characterized by the nature of information about the properties of strings that determine membership • Independent of mechanisms for recognition • Subsume wide range of types of patterns

  5. AGL Workshop 5 Local and Piecewise Hierarchies Reg MSO SF FO LTT Slide 9 LT PT Propositional SL SP Restricted Prop. Fin Local (+1) Piecewise ( < ) Stringset inference experiments { A, B } ∗ AABBB ∅ ∅ ABABAB AABBBA A m B n 2 | ( m + n ) A m B n A n B n AAABBB | w | A = | w | B I F Slide 10 A n B n n ≤ 3 AABBBB I = { A n B n | n ≥ 1 } F = { AAABBB } D =?

  6. AGL Workshop 6 Formal Issues for AGL Experiments Design • Identifying relevant classes of patterns • Finding minimal pairs of stringsets • Finding sets of stimuli that distinguish those stringsets Slide 11 Interpretation • Identifying the class of patterns subject has generalized to • Inferring the properties of the cognitive mechanism involved – properties common to all mechanisms capable of identifying that class of patterns Inferences from AGL experiments Subject successfully generalizes a pattern in a given complexity class: • Mechanism is sensitive to features characteristic of class. • Does not imply that subject can generalize every pattern in that class. – Other processing factors may interfere. Slide 12 Subject consistently fails to generalize patterns in a given class: • Suggests mechanism is not sensitive to features characteristic of class. • Inability to generalize may be due to interfering factors. – Complexity of patterns properly in class may exceed other limitations of processing.

  7. AGL Workshop 7 Assumptions • Inferred set is not arbitrary Slide 13 • Principle determining membership is structural • Inference exhibits some sort of minimality Yawelmani Yokuts (Kissberth’73) ⋆ CCC Σ ∗ CCC Σ ∗ V Σ ∗ C C C Slide 14 V C CC CCC V V CCV CV V CV CCV CV CV CCV CCV V V CV CCV CV V CV CCC ⋆ V CV CV CCV CCV V V CV

  8. AGL Workshop 8 Adjacency—Substrings Definition 1 ( k -Factor) v is a factor of w if w = uvx for some u, v ∈ Σ ∗ . v is a k -factor of w if it is a factor of w and | v | = k .  { v ∈ Σ k | ( ∃ u, x ∈ Σ ∗ )[ w = uvx ] } if | w | ≥ k, F k ( w ) def  = { w } otherwise .  Slide 15 F k ( w ) is the set of length k substrings (contiguous) of w (or just w itself if length of w < k ). AB ABAB F 2 ( ABABAB ) = { AB, BA } F 7 ( ABABAB ) = { ABABAB } Strictly Local Stringsets—SL Strictly k -Local Definitions G ⊆ F k ( { ⋊ } · Σ ∗ · { ⋉ } ) = G def w | ⇐ ⇒ F k ( ⋊ · w · ⋉ ) ⊆ G L ( G ) def = { w | w | = G} A stringset L is Strictly k -Local iff membership depends Slide 16 solely on the k -factors that are permitted. G ( AB ) n = { ⋊ A, AB, BA, B ⋉ } * AB ABAB ABBAB ⋊ ⋉ ⋊ ⋉ Membership in an SL k stringset depends only on the individual k -factors which actually occur in the string.

  9. AGL Workshop 9 Scanners a b a b a b a b a a b a b a b a b a · · · · · · b k k D Q G : · · · ∈ Slide 17 φ · · · a a · · · b b · · · a b · · · k Recognizing an SL k stringset requires only remembering the k most recently encountered symbols. Character of Strictly k -Local Sets Theorem (Suffix Substitution Closure): A stringset L is strictly k -local iff whenever there is a string x of length k − 1 and strings w , y , v , and z , such that k − 1 Slide 18 ���� w · x · y ∈ L · · ∈ L v x z then it will also be the case that · · ∈ L w x z

  10. AGL Workshop 10 Examples of Suffix Substitution The dog · likes · the biscuit ∈ L Alice · likes · Bob ∈ L The dog · likes · Bob ∈ L Slide 19 But: The dog · likes · the biscuit ∈ L Bob, Alice · likes · ∈ L ε ⋆ The dog · likes · �∈ L ε SL Hierarchy Definition 2 ( SL ) A stringset is Strictly k -Local if it is definable with an SL k definition. A stringset is Strictly Local (in SL) if it is SL k for some k . Slide 20 Theorem 1 (SL-Hierarchy) SL 2 � SL 3 � · · · � SL i � SL i +1 � · · · � SL Every Finite stringset is SL k for some k : Fin ⊆ SL. There is no k for which SL k includes all Finite languages.

  11. AGL Workshop 11 Alawa 0 ⋊ σ σ σσ ⋉ ´ σ ´ σ ⋊ ´ σ σ ⋉ 3 σ ⋆ ⋊ σ σ ⋉ Slide 21 1 ´ σ G Alawa = { ⋊ σσ, ⋊ σ ´ σ, ⋊ ´ σσ, σσσ, σσ ´ σ, σ ´ σσ, σ 4 ⋊ ´ σ ⋉ , ´ σσ ⋉ } σ 2 Some syllable must get primary stress k − 1 0 σ � �� � ⋊ σ 1 σ 0 · · · σ 0 σ 2 ⋉ ´ k − 1 Slide 22 � �� � ´ ⋊ ´ σ 0 · · · σ 0 σ σ 2 σ 1 ⋉ k − 1 σ , ´ σ � �� � σ 0 · · · σ 0 ⋆ ⋊ σ 1 σ 1 ⋉ 1

  12. AGL Workshop 12 Cognitive interpretation of SL • Any cognitive mechanism that can distinguish member strings from non-members of an SL k stringset must be sensitive, at least, to the length k blocks of events that occur in the presentation of the string. Slide 23 • If the strings are presented as sequences of events in time, then this corresponds to being sensitive, at each point in the string, to the immediately prior sequence of k − 1 events. • Any cognitive mechanism that is sensitive only to the length k blocks of events in the presentation of a string will be able to recognize only SL k stringsets. Strictly Local Stress Patterns Heinz’s Stress Pattern Database (ca. 2007)—109 patterns 9 are SL 2 Abun West, Afrikans, . . . Cambodian,. . . Maranungku 44 are SL 3 Alawa, Arabic (Bani-Hassan),. . . Slide 24 24 are SL 4 Arabic (Cairene),. . . 3 are SL 5 Asheninca, Bhojpuri, Hindi (Fairbanks) 1 is SL 6 Icua Tupi 28 are not SL Amele, Bhojpuri (Shukla Tiwari), Ara- bic Classical, Hindi (Keldar), Yidin,. . . 72% are SL, all k ≤ 6. 49% are SL 3 .

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend