Individual differences in phoneme categorization
Effie Kapnoula, Bob McMurray, Eunjong Kong, Matthew Winn, & Jan Edwards
19th Mid-Continental Phonetics & Phonology Conference
Individual differences in phoneme categorization Effie Kapnoula, - - PowerPoint PPT Presentation
Individual differences in phoneme categorization Effie Kapnoula, Bob McMurray, Eunjong Kong, Matthew Winn, & Jan Edwards 19th Mid-Continental Phonetics & Phonology Conference The problem of lack of invariance There is no one-to-one
Effie Kapnoula, Bob McMurray, Eunjong Kong, Matthew Winn, & Jan Edwards
19th Mid-Continental Phonetics & Phonology Conference
frequencies) and the perceived phoneme
Hillenbrand, Getty, Clark & Wheeler, 1995
frequencies) and the perceived phoneme
frequencies) and the perceived phoneme
bar
par
1 2 3 4 5 6 7 par bar
frequencies) and the perceived phoneme
+Simple solution +Fast commitment
bar
par
1 2 3 4 5 6 7 par bar
Werker & Tees, 1987; Joanisse et al, 2000; López-Zamora et al, 2010
transitions) and the perceived phoneme
+Simple solution +Fast commitment
bar
par
1 2 3 4 5 6 7 par bar
bar
par
1 2 3 4 5 6 7 par bar
transitions) and the perceived phoneme
+Simple solution +Fast commitment
+Flexibility +Late commitment +Keep useful within-category information
bar
par
1 2 3 4 5 6 7 par bar
bar
par
1 2 3 4 5 6 7 par bar
5 10 15 20 25 30 35 40 0.02 0.03 0.04 0.05 0.06 0.07 0.08
VOT (ms)
Category Boundary
Response = Response = Looks to Looks to Competitor Fixations
McMurray, Tanenhaus & Aslin (2002)
Werker & Tees, 1987; Joanisse et al, 2000; López-Zamora et al, 2010
Kong, E. J., & Edwards, J. (2011)
Kong, E. J., & Edwards, J. (2011)
Listeners are capable of gradient categorization of phonemes The VAS task allows for this gradiency to be expressed in participants’ responses
Listeners are capable of gradient categorization of phonemes The VAS task allows for this gradiency to be expressed in participants’ responses
Listeners are capable of gradient categorization of phonemes The VAS task allows for this gradiency to be expressed in participants’ responses
Establish a way of quantifying gradiency via the VAS task
Listeners are capable of gradient categorization of phonemes The VAS task allows for this gradiency to be expressed in participants’ responses
Establish a way of quantifying gradiency via the VAS task 1. Investigate possible sources of gradiency (e.g. executive function) 2. Link gradiency to multiple cue use 3. Examine whether gradiency is good or bad for speech perception
labial alveolar Real words bull-pull den-ten Nonwords buv-puv dev-tev CVs buh-puh deh-teh
F0 steps 5 4 3 2 1 1 2 3 4 5 6 7 VOT steps
labial alveolar Real words bull-pull den-ten Nonwords buv-puv dev-tev CVs buh-puh deh-teh bull pull pull bull
Trail making task (cognitive flexibility) N-Back task (working memory) Flanker task (inhibition)
non-speech cognitive processes
Trail making task (cognitive flexibility) N-Back task (working memory) Flanker task (inhibition) AZ-bio (sentences in babbling noise - 1:1 STN ratio)
non-speech cognitive processes
Trail making task (cognitive flexibility) N-Back task (working memory) Flanker task (inhibition) AZ-bio (sentences in babbling noise - 1:1 STN ratio)
non-speech cognitive processes
10 20 30 40 0 10 20 30 40 50 60 70 80 90100 10 20 30 40 0 10 20 30 40 50 60 70 80 90100 10 20 30 40 0 10 20 30 40 50 60 70 80 90100 10 20 30 40 0 10 20 30 40 50 60 70 80 90100
Sub 8 Sub 9 Sub 7 Sub 68
F0 steps 5 4 3 2 1 1 2 3 4 5 6 7 VOT steps F0 steps 5 4 3 2 1 1 2 3 4 5 6 7 VOT steps VOT steps F0 steps θ s VOT steps F0 steps θ s
F0 steps 5 4 3 2 1 1 2 3 4 5 6 7 VOT steps F0 steps 5 4 3 2 1 1 2 3 4 5 6 7 VOT steps VOT steps F0 steps θ s VOT steps F0 steps θ s
Steep s slope Shallow s slope
gradient categorical
0.2 0.4 0.6 0.8 1 1 2 3 4 5 6 7 b 2AFC response p VOT step 1 2 F0 = 90 hZ F0 = 125 hZ
20 40 60 80 100 1 2 3 4 5 6 7 b VAS response p VOT step NP NW RW 0.2 0.4 0.6 0.8 1 1 2 3 4 5 6 7 b 2AFC response p VOT step Real words 0.2 0.4 0.6 0.8 1 1 2 3 4 5 6 7 b 2AFC response p VOT step
Nonwords
0.2 0.4 0.6 0.8 1 1 2 3 4 5 6 7 b 2AFC response p VOT step
CVs 90hZ 125hZ
20 40 60 80 100 1 2 3 4 5 6 7 b VAS response p VOT step alv lab
0.2 0.4 0.6 0.8 1 1 2 3 4 5 6 7 b 2AFC response p VOT step Real words 0.2 0.4 0.6 0.8 1 1 2 3 4 5 6 7 b 2AFC response p VOT step
Nonwords
0.2 0.4 0.6 0.8 1 1 2 3 4 5 6 7 b 2AFC response p VOT step
CVs 90hZ 125hZ
20 40 60 80 100 1 2 3 4 5 6 7 b VAS response p VOT step NP NW RW 20 40 60 80 100 1 2 3 4 5 6 7 b VAS response p VOT step alv lab
F<1 F<1
F(1,250) = 27.8, p < 0.001
0.2 0.4 0.6 0.8 1 1 2 3 4 5 6 7 b 2AFC response p VOT step
Labials
0.2 0.4 0.6 0.8 1 1 2 3 4 5 6 7 VOT step
Alveolars
1 5
90hZ 125hZ
1.
Do individual differences in gradiency derive from differences in general cognitive function?
1.
Do individual differences in gradiency derive from differences in general cognitive function?
EF measures did not account for a statistically significant amount of variance in VAS slope, F(3,108)=1.75, p=.162, or F0 use, F<0
gradient categorical gradient categorical gradient categorical
1.
Do individual differences in gradiency derive from differences in general cognitive function?
EF measures did not account for a statistically significant amount of variance in VAS slope, F(3,108)=1.75, p=.162, or F0 use, F<0 Speech perception processes may be played out on a different level of processing than higher cognitive processes, such as working memory
1.
Do individual differences in gradiency derive from differences in general cognitive function?
2.
Are individual differences in gradiency linked to multiple cue use?
1.
Do individual differences in gradiency derive from differences in general cognitive function?
2.
Are individual differences in gradiency linked to multiple cue use?
Positive relationship: Better encoding of fine-grained detail (more gradiency) enables access to multiple cues Negative relationship: Listeners who use more cues have more accurate, sharper boundaries
β=-0.305, t=-3.4, p < 0.01
gradient categorical
1.
Do individual differences in gradiency derive from differences in general cognitive function?
2.
Are individual differences in gradiency linked to multiple cue use?
Positive relationship: Better encoding of fine-grained detail (more gradiency) enables access to multiple cues Negative relationship: Listeners who use more cues have more accurate, sharper boundaries
1.
Do individual differences in gradiency derive from differences in general cognitive function?
2.
Are individual differences in gradiency linked to multiple cue use?
3.
In what way are these differences important for speech perception?
r = .164, p=.068
gradient categorical
r = .243, p=.007 r = .164, p=.068
gradient categorical
Gradiency Speech-in- noise Gradiency Speech-in- noise Working Memory 1) 2)
R2 = 0.019
β=-0.14, t=-1.48, p = .143
categorical gradient
Gradiency Speech-in- noise Gradiency Speech-in- noise Working Memory 1) 2)
1.
Do individual differences in gradiency derive from differences in general cognitive function?
2.
Are individual differences in gradiency linked to multiple cue use?
3.
In what way are these differences important for speech perception?
More gradient listeners tend to better perceive speech in noise
1.
Do individual differences in gradiency derive from differences in general cognitive function?
Probably not. Maybe speech perception operates on a different level than higher cognitive processes.
1.
Do individual differences in gradiency derive from differences in general cognitive function?
Probably not. Maybe speech perception operates on a different level than higher cognitive processes. 2.
Are individual differences in gradiency linked to multiple cue use?
Yes, more gradient listeners tend to rely more on the secondary cue (F0). Better encoding of fine-grained detail (more gradiency) enables access to multiple cues. And/or more gradient listeners commit later to a category.
1.
Do individual differences in gradiency derive from differences in general cognitive function?
Probably not. Maybe speech perception operates on a different level than higher cognitive processes. 2.
Are individual differences in gradiency linked to multiple cue use?
Yes, more gradient listeners tend to rely more on the secondary cue (F0). Better encoding of fine-grained detail (more gradiency) enables access to multiple cues. And/or more gradient listeners commit later to a category. 3.
In what way are these differences important for speech perception?
More gradient listeners do a bit better (marginally) in perceiving speech in noise. Gradiency is not all that bad - maybe good for some things.
1.
Gradiency indicates more accurate, true-to-the-signal perception.
1.
Gradiency indicates more accurate, true-to-the-signal perception.
2.
Some listeners are more gradient than others in categorizing phonemes.
1.
Gradiency indicates more accurate, true-to-the-signal perception.
2.
Some listeners are more gradient than others in categorizing phonemes.
3.
This gradiency may be a good thing.