Nearey & Smits: Perception of VCCV
1
4aSC43 Patterns in the perception of VC(C)V Nearey and Smits - - PowerPoint PPT Presentation
4aSC43 Patterns in the perception of VC(C)V Nearey and Smits Nearey & Smits: Perception of VCCV 1 Patterns in the perception of VC(C)V strings Terrance M. Nearey University of Alberta Roel Smits Max Planck Institute for
Nearey & Smits: Perception of VCCV
1
Terrance M. Nearey
University of Alberta
Roel Smits
Max Planck Institute for Psycholinguistics
(Work supported by SSHRC & MPI )
Nearey & Smits: Perception of VCCV
3
synthetic VC(C)V where C = [+stop]
– Repp ran 3 sub experiments 1a, 1b and 1c – Our experiment had smaller total gap duration range than Repp, but larger than any single sub experiment – Our experiment had more spectral patterns than Repp
Nearey & Smits: Perception of VCCV
4
have been quite successful
– e.g. CV or /hVC/ syllables [3,4]
– Variable length strings in one stimulus set – VCV: aba, ada; VCCV : abda, abda, abba, adda – Apparently very complex perceptual results [1, 2]
– Can we extend the simple architecture of our models to handle variable string-length case?
Nearey & Smits: Perception of VCCV
5
100 200 300 400 500 600 1000 2000 3000 4000 5000 Stimulus # 78 of 144. Most extreme [abda] place cues
Nearey & Smits: Perception of VCCV
6
– F2 (F3) VC offset: 1060 (2180) to 1450 (2539) Hz in 6 steps – F2 (F3) CV onset: 1099 (2262) to 1635 (2500) Hz in 6 steps]
– Gap Duration: 80, 120, 190 and 300 ms; – Total 144 = 6 x 6 x 4 stimuli
– Each responded to10 repetitions of each of 144 stimuli – 13 native speakers of Canadian English – Response button layout:
[b] [bb] [bd] [d] [dd] [db]
Nearey & Smits: Perception of VCCV
7
– Singletons /aba/ or /ada/ – Clusters (heterorganic) /abda/ or /adba/ – Geminates /ab#ba/ or /ad#da/
– Closing place class (place of first or only stop)
– Opening place class (place of second or only stop)
Nearey & Smits: Perception of VCCV
8
– Short duration gaps favor singletons – Intermediate gaps favor heterorganic clusters – Very long gaps favor geminates – But single-place singleton and geminates always less likely when transition patterns clash see below.
Nearey & Smits: Perception of VCCV
9
– Low F2 F3 offset favors labial closers /b bb bd/ – High F2 F3 offset favors dental closers /d dd db/
– Low F2 F3 offset favors labial openers /b bb db/ – High F2 F3 offset favors dental closers /d dd bd/
Nearey & Smits: Perception of VCCV
10
Definition: Clash of transition patterns
– Closing VC_ transitions near the low /_b/ end but opening _CV near the high /_d/ end – or vice versa
– even fairly small clash tends favor clusters [bd], [db]
– singletons favored unless clash is quite large – singleton responses dominated by opening _CV cues
Nearey & Smits: Perception of VCCV
11
– Proactive (left to right): preceding VC1_ stimulus affects judgment of following _C2V – Retroactive (right to left); following _C2V stimulus affects judgment of preceding VC1_
– Assimilation (more one place responses: /d, dd, b, bb/) – Dissimilation (more two place: /db, bd/)
Nearey & Smits: Perception of VCCV
12
– Retroactive effects larger than proactive – Strong assimilation prevalent for shorter gap durations – Some dissimilation present for longer gap duration – Longest gap duration shows little effect
– Trends (using Repp’s measure) are similar [see notes].
Nearey & Smits: Perception of VCCV
13
– Label them ‘CC’
CC1 = b, CC2 = d, CC3 = bb, CC4 = dd, CC5 = bd, CC6 = db/ * /d/ and /b/ are ‘degenerate’ diphones
1) Xclose – VC (syl. 1) closure transition step 2) Xopen – CV (syl. 2) opening transition step 3) Xdur – gap duration
[note: duration coded as sqrt(ms)]
Nearey & Smits: Perception of VCCV
14
1) RMS = 5.96%
2) Percent modal agreement PMA = 93.75%
most votes corresponds to predicted response category with highest predicted score.
Nearey & Smits: Perception of VCCV
15
separately
– Basis for simple repeated measures comparisons
weight patterns for each cue
integrated
Nearey & Smits: Perception of VCCV
16
Baseline model CC * Xclose d dd db b bb bd
0.5 1
Baseline Model: CC * Xclose C
f f i c i e n t ResponseCategory
Matlab Boxplots. “Belt” is median, notch width robust 95% confidence estimate.
Nearey & Smits: Perception of VCCV
17
transition patterns] show very strong clustering
– Labial closers /b, bb, bd/ show low values – Dental closers /d, dd, db/ show high high values [Statistics in appendix panels]
negligible effect on how Xclose tunes likelihood
matters.
Nearey & Smits: Perception of VCCV
18
Baseline model CC * Xopen d dd db b bb bd
0.5 1 1.5
Baseline Model: CC * Xopen C
f f i c i e n t ResponseCategory
Matlab Boxplots. “Belt” is median, notch width robust 95% confidence estimate.
Nearey & Smits: Perception of VCCV
19
transition patterns] show very strong clustering
– Labial openers /b, bb, db/ show low values – Dental openers /d, dd, bd/ show high high values [Statistics in appendix panels]
minimal effect on how Xopen tunes likelihood of
Nearey & Smits: Perception of VCCV
20
Baseline model CC * Xdur d dd db b bb bd
0.2 0.4 0.6
Baseline Model: CC * Xdur C
f f i c i e n t ResponseCategory
Nearey & Smits: Perception of VCCV
21
strong clustering
– Singletons /b, d/ show low coefficients for Xdur – Heterorganic clusters /db, bd/ show moderately high coefficients – Geminates /bb, dd/ show highest coefficients
within duration class
[See appendix panels for statistics]
Nearey & Smits: Perception of VCCV
22
extremely simple tuning of response by ‘local’ stimulus properties
model
– Extension of methods of Nearey 1990
Nearey & Smits: Perception of VCCV
23
Factoring characteristics of CC in baseline model
– Pdur: tripartite category of phonological duration classes
– Pclose:
– Popen:
Nearey & Smits: Perception of VCCV
24
for different stimuli
– (e.g.) replace CC*Xdur with Pdur * Xdur – Reduces coefficients from 6 to 3, (and df from 5 to 2)
Duration Class, then this should give similar fit with fewer coefficients
Nearey & Smits: Perception of VCCV
25
9 20 Model df 93.75 5.94 2514.1 711 Factored 93.75 5.96 2403.1 700 Baseline pma rms G2 Resid. df Model
Better G2 fit of larger baseline model is not reliable according to bootstrap model comparison (train on 5 Ss, test on 13, repeat random splits 200 x)
Nearey & Smits: Perception of VCCV
26
functions of the sets of coefficients of categories j and k and stimulus properties X.
and dissimilation effects come from
space, they ‘fall out’ of the baseline model
Nearey & Smits: Perception of VCCV
27
Observed assimilation indices
(indices calculated per Repp 83, data from current experiment)
5 10 15 20 25 30 35 80 ms 120 ms 190 ms 300 ms Retroactive Proactive
Nearey & Smits: Perception of VCCV
28
5 10 15 20 25 30 80 ms 120 ms 190 ms 300 ms Retroactive Proactive
Nearey & Smits: Perception of VCCV
29
– Need to break phoneme into parts – Not all parts show up in all contexts
– Closing (implosion) : [ >b] – Closure (hold): [$] – Opening (explosion): [B<] – Complete pattern [ >b $ B< ]
Nearey & Smits: Perception of VCCV
30
[see notes pages for more]
[a]—[<d]—[$]—[D>]—[a]
[a]—[<b]—[$]—[$]—[D>]—[a]
[a]—[<b]—[$]—[#]—[$]—[B>]—[a]
Nearey & Smits: Perception of VCCV
31
shows some may work reliably better
Smits
– Plotted proportion of one versus two consonants against
– Simple measure of formant offset/onset clash Xdiscon=abs(ClosingF2F3StepNo-OpeningF2F3StepNo)
consonant responses more likely
Nearey & Smits: Perception of VCCV
32
to baseline model [see notes]
class ( Pdur * Xdiscon) is nearly as good
– Improvement appears to be primarily due to prediction
discontinuity is large [Puzzling fact is that direction of discontinuity doesn’t matter – Further research contemplated
Nearey & Smits: Perception of VCCV
33
works well
accounted for
better
– May be more difficult to interpret
Nearey & Smits: Perception of VCCV
34
[1] Repp, B. H. (1983). Bidirectional contrast effects in the perception of VC- CV sequences. Perception and Psychophysics, 33(2), 147-155. [2] Repp, B. (1978). Perceptual integration and differentiation of spectral cues for intervocalic stop consonants. Perception and Psychophysics, 24(5), 471-485. [3] Nearey, T. (1997). Speech perception as pattern recognition. J. Acoust.
[4] Smits, R. (2001). Evidence for hierarchical categorization of coarticulated phonemes, J. Exp. Psych.: HPP. 27, 111-135 [5] Davison, A. C., & Hinkley, D. V. (1997). Bootstrap methods and their application. Cambridge: Cambridge University Press. [6] Shao, J. (1996). Bootstrap model selection. J. Amer. Statist. Assoc., 96(434), 655-665. ==========================================================
Nearey & Smits: Perception of VCCV
35
– Consider only stimuli with most extreme /d/- like _C2V pattern (step 6 of opening F2 F3)
– Consider only stimuli with most extreme /b/- like _C2V pattern (step 1 of opening F2 F3)
– Repp’s patterns change for three sub experiments 1a, 1b and 1c, covering different gap duration ranges.
– Repp’s original formulation of assimilation appears to have typos, switching -/ba/ and -/da/ on p 149.
Nearey & Smits: Perception of VCCV
36
Comparison of empirical assimilation indices of Repp 83 experiment 1b and current experiment
80 ms 120 ms 160 ms 190 ms 300 ms Retro-R Proactive Retro-ns Pro-ns
Nearey & Smits: Perception of VCCV
37
93.75 5.94 2514.1 711 Factored 96.53 5.69 2007.1 707 Factored+ Xdiscon 95.83 5.44 1918 695 Baseline+ Xdiscon 93.75 5.96 2403.1 700 Baseline pma rms G2 Error df Model
Nearey & Smits: Perception of VCCV
38
1 Factored + Xdiscon 2 Baseline + Xdiscon; 3 Factored ; 4 Baseline
1 Baseline + Xdiscon; 2 Factored + Xdiscon; 3 Factored; 4 Baseline
– Train on 5 samples; Test on 13 samples; Repeat 200 times (see [5,6]) – Random samples of entire subject data sets with replacement (13 subjects total)
Nearey & Smits: Perception of VCCV
39
Expansion of shorthand notation for baseline model
– Expansion: bj for j = 1 to 6 (ranging over 6 categories /d, dd, db, b, bb,bd/)
– Let X1i X2i, X3i respectively represent its F2-F3 offset (closing transition) step, F3F3 onset (opening transition) and gap duration for the i-th stimulus
– Expansion: a1j X1i for j = 1 to 6; Restriction S(a1j ) = 0,
– Expansion: a2j X2i for j = 1 to 6; Restriction S(a2j ) = 0,
– Expansion: a3j X3i for j = 1 to 6; Restriction S(a3j ) = 0, Thus each set of terms involves 6 coefficients with 5 df each
Nearey & Smits: Perception of VCCV
40
Statistics on Baseline models I
comparisons via paired difference t-tests; Fischer LSD and Sidak criteria for multiple comparisons (MC) applied. – Fisher, declare significance at pt <= .05 test-wise criterion, where pt is nominal alpha level for two-tailed t-test with 12 df (based 3 subjects). – Sidak , declare significance ps = 1-(1-pt)k where k = 15 pairwise comparisons of 6 coefficients shown in boxplots
Xopen and Xdur. Significant differences between classes, non-significant within classes.
shows /b/ < /bd/ and /b/ < /bb/
Nearey & Smits: Perception of VCCV
41
Phoneme parts in context
– 1) $ B< in env. {#, C}___
– 2) >b $ in env ___{#,C}
transitions
transitions
Nearey & Smits: Perception of VCCV
42
Grammar for network construction
1) S ‡ {VCV, VCCV, VC#CV} 2) V ‡ [a] 3) C ‡ {/b/, /d/} 4) # ‡ [#] [>b] [$] [B<] / V_V 5) /b/ ‡ [>b] ($) / __(#)C [$] [B<] / C(#)__ [>d] [$] [D<] / V_V 6) /d/ ‡ [>d] ($) / __(#)C [$] [D<] / C(#)__ Note: [x] indicates x is terminal symbol and element in network