LINGUISTIC TIMBRE: FORMANTS YU / LAMONT FEBRUARY 27 & MARCH 1, - - PowerPoint PPT Presentation

linguistic timbre formants
SMART_READER_LITE
LIVE PREVIEW

LINGUISTIC TIMBRE: FORMANTS YU / LAMONT FEBRUARY 27 & MARCH 1, - - PowerPoint PPT Presentation

LINGUIST 197M, SPRING 2018. CLASS 6.1-6.2 LINGUISTIC TIMBRE: FORMANTS YU / LAMONT FEBRUARY 27 & MARCH 1, 2018 LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2 2 PARSING WORDS FROM CONTINUOUS SPEECH LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2 3


slide-1
SLIDE 1

LINGUISTIC TIMBRE: FORMANTS

YU / LAMONT FEBRUARY 27 & MARCH 1, 2018

LINGUIST 197M, SPRING 2018. CLASS 6.1-6.2

slide-2
SLIDE 2

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

PARSING WORDS FROM CONTINUOUS SPEECH

2

slide-3
SLIDE 3

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

VOWEL MERGERS IN SPECTROGRAMS

3

slide-4
SLIDE 4

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

RATE OF PHONEMES PER SECOND

4

slide-5
SLIDE 5

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

SPEECH INFORMATION RATE ARTICLE

5

http://citeseerx.ist.psu.edu/viewdoc/download? doi=10.1.1.433.3226&rep=rep1&type=pdf

slide-6
SLIDE 6

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

BRIEF ARTICLE SUMMARY: BLOG POST

6

http://rosettaproject.org/blog/02012/mar/1/language-speed-vs-density/

slide-7
SLIDE 7

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

HOW MANY PHONEMES A LANGUAGE USES


7

Phoible: http://phoible.org/ UPSID: http://web.phonetik.uni-frankfurt.de/upsid_info.html LAPSyD: http://www.lapsyd.ddl.ish-lyon.cnrs.fr/lapsyd/index.php

slide-8
SLIDE 8

8

THE VOCAL TRACT AND TIMBRE: VOWELS

slide-9
SLIDE 9

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

MRI VIDEOS OF VOWEL PRODUCTIONS!

9

http://sail.usc.edu/span/rtmri_ipa/pk_2015.html

back open unrounded vowel front close unrounded vowel

slide-10
SLIDE 10

LINGUIST 197M, SPRING 2018. CLASS 5.2

REAL-TIME OSCILLOSCOPE

10

http://academo.org/demos/virtual-oscilloscope/

slide-11
SLIDE 11

LINGUIST 197M, SPRING 2018. CLASS 5.2

REAL-TIME SPECTRUM ANALYZER II

11

http://musiclab.chromeexperiments.com/Spectrogram

slide-12
SLIDE 12

LINGUIST 197M, SPRING 2018. CLASS 5.2

REAL-TIME SPECTRUM ANALYZER

12

http://academo.org/demos/spectrum-analyzer/

slide-13
SLIDE 13

13

WAVEFORMS, SPECTRA, AND SPECTROGRAMS

slide-14
SLIDE 14

LINGUIST 197M, SPRING 2018. CLASS 5.1

SYNTHESIZE A VOCALIC SOUND…

14

slide-15
SLIDE 15

LINGUIST 197M, SPRING 2018. CLASS 5.1

REVIEW: FIND PERIOD FROM THE WAVEFORM

15

What are the x- and y-axes in the waveform? What’s that blue line?

slide-16
SLIDE 16

LINGUIST 197M, SPRING 2018. CLASS 5.1

REVIEW: TAKE SPECTRAL SLICE (SPECTRUM)

16

Make sure you select a window of at least 5 cycles of the waveform to take the spectral slice over!

slide-17
SLIDE 17

LINGUIST 197M, SPRING 2018. CLASS 5.1

REVIEW: UNDERSTANDING THE SPECTRUM

17

Make sure to zoom in on the spectrum! What are the axes? How far apart are two adjacent spectral peaks and why?

slide-18
SLIDE 18

LINGUIST 197M, SPRING 2018. CLASS 5.1

NEW: THE SPECTROGRAM!

18

slide-19
SLIDE 19

LINGUIST 197M, SPRING 2018. CLASS 5.1

NEW: THE SPECTROGRAM!

19

slide-20
SLIDE 20

20

FROM THE SPECTRUM TO THE SPECTROGRAM

slide-21
SLIDE 21 0.0 0.5 1.0 1.5 2.0
  • 1.0
0.5 t f = 1 Hz 0.0 0.5 1.0 1.5 2.0
  • 1.0
0.5 t f = 2 Hz 0.0 0.5 1.0 1.5 2.0
  • 1.0
0.5 t f = 3 Hz 0.0 0.5 1.0 1.5 2.0
  • 1.0
0.5 t f = 4 Hz

f = 1 Hz f = 2 Hz f = 3 Hz f = 4 Hz

slide-22
SLIDE 22

LINGUIST 197M, SPRING 2018. CLASS 5.1

SPECTRUM OF COMPLEX WAVEFORM

22

0.0 0.5 1.0 1.5 2.0 5 10 15 20 t wave.complex

1 2 3 4 Amplitude (dB) 1 Frequency (Hz) Amplitude (dB) Time (s)

slide-23
SLIDE 23

LINGUIST 197M, SPRING 2018. CLASS 5.1

SPECTRUM OF COMPLEX WAVEFORM

23

0.0 0.5 1.0 1.5 2.0 5 10 15 20 t wave.complex

1 2 3 4 Amplitude (dB) 1 Frequency (Hz) Amplitude (dB) Time (s)

Waveform: x-axis is time Spectrum: x-axis is frequency

slide-24
SLIDE 24

LINGUIST 197M, SPRING 2018. CLASS 5.1

FROM SPECTRUM TO SPECTROGRAM: STEP 1

24

1 2 3 4 Amplitude (dB) 4 3 2 1 Freq (Hz) 1 2 3 4 Amplitude (dB) 4 3 2 1 Freq (Hz) Rotate clockwise

slide-25
SLIDE 25

LINGUIST 197M, SPRING 2018. CLASS 5.1

FROM SPECTRUM TO SPECTROGRAM: STEP 2

25

1 2 3 4 Frequency (Hz) 1 2 3 4 Amplitude (dB) 1 2 3 4 Amplitude (dB) 4 3 2 1 Freq (Hz) Flip vertically

slide-26
SLIDE 26

LINGUIST 197M, SPRING 2018. CLASS 5.1

FROM SPECTRUM TO SPECTROGRAM: STEP 3

26

1 2 3 4 Frequency (Hz) 1 2 3 4 Amplitude (dB) 1 2 3 4 Encode amplitude levels (the height/ length of the blue spikes) in grayscale* *or with heat map

slide-27
SLIDE 27

LINGUIST 197M, SPRING 2018. CLASS 5.1

FROM SPECTRUM TO SPECTROGRAM: STEP 4

27

1 2 3 4 Frequency (Hz) 1 2 3 4 Frequency (Hz) 1 2 3 4 Amplitude (dB) Encode amplitudes in grayscale… …at time t1

slide-28
SLIDE 28

LINGUIST 197M, SPRING 2018. CLASS 5.1

FROM SPECTRUM TO SPECTROGRAM: STEP 5

28

1 2 3 4 Frequency (Hz) Line up spectral slices taken over time… t1 t2 t3 t4 t5 t6

slide-29
SLIDE 29

LINGUIST 197M, SPRING 2018. CLASS 5.1

FROM SPECTRUM TO SPECTROGRAM: FINIS

29

…and you have a spectrogram

Time (s) 0.8772 1.128 5000 Frequency (Hz)

slide-30
SLIDE 30

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

WIDE VS. NARROW-BAND SPECTROGRAMS

30

Wide: high temporal resolution, low frequency resolution Narrow: low temporal resolution, high frequency resolution

WIDE NARROW

slide-31
SLIDE 31

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

NARROW-BAND SPECTROGRAM

31

Narrow: low temporal resolution, high frequency resolution

slide-32
SLIDE 32

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

WIDE-BAND SPECTROGRAM

32

Wide: high temporal resolution, low frequency resolution

slide-33
SLIDE 33

33

SOURCE-FILTER THEORY

slide-34
SLIDE 34

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

SOURCE-FILTER THEORY

34

▸ Input = (Voice) source ▸ Result: harmonics of voice source ▸ Filter = Vocal tract ▸ Result: harmonic amplitudes get modulated ▸ Output = Speech ▸ Result: combined effects of source and filter

slide-35
SLIDE 35

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

VOCAL TRACT

35

Ladefoged and Johnson (2010), p. 5

From glottis to lips!

slide-36
SLIDE 36

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

SOURCE-FILTER THEORY: A DIAGRAM

36

Vocal tract

http://www.phon.ucl.ac.uk/courses/plin/plin2108/week5.php

slide-37
SLIDE 37

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

DIFFERENT SPEECH SOUNDS HAVE DIFFERENT VOCAL TRACT CONFIGURATIONS

37

http://www.phon.ucl.ac.uk/courses/plin/plin2108/week5.php

Same source Different vocal tract configurations

slide-38
SLIDE 38

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

SOURCE-FILTER THEORY: A SCHEMATIC

38

source filter

  • utput

Fucci and Lass

waveform spectrum

slide-39
SLIDE 39

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

SOURCE-FILTER THEORY: A SCHEMATIC

39

source filter

  • utput

Fucci and Lass

waveform spectrum

slide-40
SLIDE 40

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

OUTPUT SPECTRUM

40

  • Output = Speech
  • Result: combined effects of source and filter

1st harmonic is f0 Frequency of nth harmonic = n * f0

40

slide-41
SLIDE 41

41

THE SOURCE: THE VIBRATING LARYNX

slide-42
SLIDE 42

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

SOURCE-FILTER THEORY: A SCHEMATIC

42

source filter

  • utput

Fucci and Lass

waveform spectrum

slide-43
SLIDE 43

AMPLITUDE OF HARMONICS IN SOURCE

▸ Can get by recording at the larynx or inverse filtering of

speech

▸ Amplitudes of source harmonics generally decrease as

frequency goes up (about 3 dB fall per octave)

▸ Rate of decrease depends on phonation quality (creaky,

breathy, etc.)

43

slide-44
SLIDE 44

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

ELECTROGLOTTOGRAPHY: RECORDING AT THE LARYNX

44

http://www.linguistics.ucla.edu/faciliti/facilities/physiology/EGG.htm

slide-45
SLIDE 45

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

ELECTROGLOTTOGRAPHY: RECORDING AT THE LARYNX

45

http://sail.usc.edu/~lgoldste/General_Phonetics/Source_Filter_Demo/index.html

Glottal waveform Vocal tract shape for [i]

Filtered waveform: [i]

Glottal spectrum

Filtered spectrum: [i]

Vocal tract transfer function

slide-46
SLIDE 46

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

ELECTROGLOTTOGRAPHY: RECORDING AT THE LARYNX

46

http://sail.usc.edu/~lgoldste/General_Phonetics/Source_Filter_Demo/index.html

Mic

slide-47
SLIDE 47

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

INVERSE FILTERED SPECTRUM EXAMPLE

47

slide-48
SLIDE 48

48

THE FILTER: THE VOCAL TRACT

slide-49
SLIDE 49

THE VOCAL TRACT AS A FILTER

▸ Configuration of vocal tract acts on amplitude of harmonics

from voice source

▸ No new harmonics are added nor or their frequencies

changed!

▸ Some harmonics get stronger, some get weaker ▸ Particular vocal tract configuration has particular resonance

frequencies (formants); if these are close to the frequencies

  • f some harmonics, those harmonics get strengthened

49

slide-50
SLIDE 50

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

SYMPATHETIC VIBRATION: TUNING FORKS

50 https://youtu.be/zWKiWaiM3Pw

Other fun resonance videos at: http://blog.prosig.com/2011/09/20/5-videos-that-explain- resonance/

slide-51
SLIDE 51

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

EXAMPLE: RESONANCE

51

B resonates to A if B’s vibrations make A vibrate too. Effect: B has more energy than it did The closer B’s natural frequency is to A’s the stronger the vibrations of B

51

slide-52
SLIDE 52

RESONANCES FOR DIFFERENT VOCAL TRACT CONFIGURATIONS

▸ Resonances depend on size and shape of airway ▸ Can be approximated as multitube models, with connected

Helmholtz resonators

▸ Helmholtz resonances are the formants ▸ See article by Sandberg on The Acoustics of the Singing Voice

for further reading

52

http://faculty.washington.edu/losterho/Sundberg.pdf

slide-53
SLIDE 53

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

MRI VIDEOS OF VOWEL PRODUCTIONS!

53

http://sail.usc.edu/span/rtmri_ipa/pk_2015.html

back open unrounded vowel front close unrounded vowel

slide-54
SLIDE 54

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

HELMHOLTZ RESONATORS AND FORMANTS

54

http://www.exploratorium.edu/exhibits/vocal_vowels/vocal_vowels.html

duck call sound source: vibrating reed

slide-55
SLIDE 55

55

SOME POINTS OF CONFUSION

slide-56
SLIDE 56

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

POINT OF CONFUSION: VOCAL FOLD VS. VOCAL TRACT LENGTH

56

  • Vocal tract length has a systematic effect on

formant frequencies (vocal tract resonances)

  • As vocal tract length increases, formant

frequencies go down: inverse relation Who has higher formant frequencies? A baby or an adult?

slide-57
SLIDE 57

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

POINT OF CONFUSION: VOCAL FOLD VS. VOCAL TRACT LENGTH

57

  • Vocal fold length has a systematic effect on

fundamental frequency (property of voice source)

  • Natural vocal fold length accounts for some

portion of individual differences in fundamental frequencies, i.e. differences in f0 between individuals

http://www.ncvs.org/ncvs/tutorials/voiceprod/tutorial/influence.html

slide-58
SLIDE 58

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

POINT OF CONFUSION: VOCAL FOLD VS. VOCAL TRACT LENGTH

58

http://www.ncvs.org/ncvs/tutorials/voiceprod/tutorial/influence.html

slide-59
SLIDE 59

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

POINT OF CONFUSION: VOCAL FOLD VS. VOCAL TRACT LENGTH

59

http://www.ncvs.org/ncvs/tutorials/voiceprod/tutorial/influence.html

On average, men have 60% longer effective vocal fold length than women. What does this tell you about f0 on average, compared for men vs. women?

59

slide-60
SLIDE 60

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

POINT OF CONFUSION: FORMANTS VS. F0

60

Formants: vocal tract resonances that “filter” harmonic amplitudes from the voice source: we indirectly see what the formants are from their effects on the harmonics in speech Harmonics boosted around F1, F2, and F3

formants F1, F2, F3

slide-61
SLIDE 61

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

POINT OF CONFUSION: FORMANTS VS. F0

61

Fundamental frequency: a property of the voice source, the rate of vocal fold vibration, the lowest harmonic (the “first” harmonic), also the spacing between harmonics f0 not affected by vocal tract configuration*: spacing between harmonics unaffected by vocal tract filtering

*to a first approximation

61

slide-62
SLIDE 62

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

EXERCISE: INDEPENDENCE OF F0, FORMANTS

62

▸ Demonstrate that you can keep f0 constant, while

changing formants

▸ Demonstrate that you can keep formants constant, while

changing f0

slide-63
SLIDE 63

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

HANDY FORMANT CHART!

63

http://www.facebook.com/groups/ling5

https://blogs.umass.edu/ihauser/

Ivy Hauser

slide-64
SLIDE 64

64

TUBE RESONANCE IN THE VOCAL TRACT

slide-65
SLIDE 65

LINGUIST 197M, SPRING 2018. CLASS 5.2

TUBE RESONANCE: OPEN AT ONE END

65

http://philschatz.com/physics-book/contents/m42296.html#import-auto-id1379919

Open at one end Open at both ends

slide-66
SLIDE 66

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

NATURAL RESONANCES FOR SCHWA

66

http://rstb.royalsocietypublishing.org/content/363/1493/965

slide-67
SLIDE 67

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

ESTIMATING YOUR VOCAL TRACT LENGTH!

67

http://sail.usc.edu/~lgoldste/General_Phonetics/Source_Filter/SFb.html#VTL

slide-68
SLIDE 68

LINGUIST 197M, SPRING 2018. CLASS 6.1, 6.2

ESTIMATING YOUR VOCAL TRACT LENGTH!

68

▸ Record yourself producing a schwa-type vowel /ə/, and while continuing to

phonate, slowly raise the jaw a bit to a higher vowel, then lower again to

  • schwa. Now glide smoothly to an /ɛ/-type vowel (as in "head'), and back to
  • schwa. Save this recorded file as schwa_YOURINITIALS.wav, e.g.,

schwa_KY.wav

▸ Create a textgrid. Examine the spectrogram of your recording, and select a

moment in time for labeling where the formants appear to be fairly equally spaced in frequency. Measure the values of F1-F3 as in Part I and record their values in the textgrid. Calculate the F2-F1 and F3-F2 at this point. Take the average of these as the inter-formant distance. Save this TextGrid as schwa_YOURINITIALS.TextGrid, e.g., schwa_KY.TextGrid.

▸ Upload both files to this folder: https://drive.google.com/open?

id=1icPQn8vZ214lV70i8BJI_-YeAvucSYUG (you may need to be signed into your UMass account to access the folder)

http://sail.usc.edu/~lgoldste/General_Phonetics/Week10/Formant_Analysis/index.html