Fundamental frequency (F0) Tn s c bn v n tp Three ways to find F0 ( - - PowerPoint PPT Presentation

fundamental frequency f0 t n s c b n v n t p
SMART_READER_LITE
LIVE PREVIEW

Fundamental frequency (F0) Tn s c bn v n tp Three ways to find F0 ( - - PowerPoint PPT Presentation

Fundamental frequency (F0) Tn s c bn v n tp Three ways to find F0 ( tn s c bn, cao ): From waveform; C s m v hc v ng m hc From (wide-band) spectrogram; From spectrum. Lecture


slide-1
SLIDE 1

Tần số cơ bản và ôn tập

Cơ sở âm vị học và ngữ âm học Lecture 16

Fundamental frequency (F0)

Three ways to find F0 (tần số cơ bản, độ cao):

  • From waveform;
  • From (wide-band) spectrogram;
  • From spectrum.

Fundamental frequency (F0)

Three ways to find F0 (tần số cơ bản, độ cao):

  • From waveform;
  • From (wide-band) spectrogram;
  • From spectrum.

Fundamental frequency (F0)

Three ways to find F0 (tần số cơ bản, độ cao):

  • From waveform;
  • From (wide-band) spectrogram;
  • From spectrum.
slide-2
SLIDE 2

F0 from waveform

  • F0 = 1000 / duration of a single period (in ms)

Intensity

  • 0.2

0.2 Time 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 0.2 0.22 0.24 0.26 0.28 0.3 0.32

  • Chu kỳ lặp lại bao nhiều lần trên một giây?

F0 from waveform

  • F0 = 1000 / duration of a single period (in ms)

Intensity

  • 0.2

0.2 Time 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 0.2 0.22 0.24 0.26 0.28 0.3 0.32

  • Chu kỳ lặp lại bao nhiều lần trên một giây?

F0 from wide-band spectrogram

  • F0 = 1000 / duration of interval between striations (in ms)

500 1000 1500 2000 2500 3000 3500 4000 4500 5000 0.05 0.1 0.15 0.2 0.25 0.3 Frequency (Hz) Time (s)

  • Một đường = một mạch dây thanh

F0 from wide-band spectrogram

  • F0 = 1000 / duration of interval between striations (in ms)

500 1000 1500 2000 2500 3000 3500 4000 4500 5000 0.05 0.1 0.15 0.2 0.25 0.3 Frequency (Hz) Time (s)

  • Một đường = một mạch dây thanh
slide-3
SLIDE 3

F0 from spectrum

  • Locate first harmonic (tần số cộng hưởng thứ nhất=F0)...

Frequency (Hz) 5000 Sound pressure level (dB/Hz) 20 40 500 1000 1500 2000 2500 3000 3500 4000 4500 5000

F0 from spectrum

  • ...or H10 (H10/10 = F0)
  • Harmonics (cộng hưởng) are always multiples of F0

(bội số của F0)

Frequency (Hz) 5000 Sound pressure level (dB/Hz) 20 40 500 1000 1500 2000 2500 3000 3500 4000 4500 5000

F0 from spectrum

  • ...or H10 (H10/10 = F0)
  • Harmonics (cộng hưởng) are always multiples of F0

(bội số của F0)

Frequency (Hz) 5000 Sound pressure level (dB/Hz) 20 40 500 1000 1500 2000 2500 3000 3500 4000 4500 5000

How do languages use F0?

  • At the lexical level, as tone (thanh điệu)
  • At the utterance level, as prosody (điệu tính) and

intonation (ngữ điệu)

  • All three uses can occur simultaneously and

(semi-)independently.

slide-4
SLIDE 4

How do languages use F0?

  • At the lexical level, as tone (thanh điệu)
  • At the utterance level, as prosody (điệu tính) and

intonation (ngữ điệu)

  • All three uses can occur simultaneously and

(semi-)independently.

How do languages use F0?

  • At the lexical level, as tone (thanh điệu)
  • At the utterance level, as prosody (điệu tính) and

intonation (ngữ điệu)

  • All three uses can occur simultaneously and

(semi-)independently.

Lexical tone (Vietnamese) Lexical tone (Matbat)

slide-5
SLIDE 5

Notes on F0

  • Most F0-extraction software (like Praat) essentially uses

the last method

  • But... software can make mistakes, e.g. octave jump

(thinking H2 is H1)

  • F0 trace is unreliable when speech is unvoiced/partially

devoiced or creaky (dấu nặng, dấu ngã...in Praat?)

Notes on F0

  • Most F0-extraction software (like Praat) essentially uses

the last method

  • But... software can make mistakes, e.g. octave jump

(thinking H2 is H1)

  • F0 trace is unreliable when speech is unvoiced/partially

devoiced or creaky (dấu nặng, dấu ngã...in Praat?)

Notes on F0

  • Most F0-extraction software (like Praat) essentially uses

the last method

  • But... software can make mistakes, e.g. octave jump

(thinking H2 is H1)

  • F0 trace is unreliable when speech is unvoiced/partially

devoiced or creaky (dấu nặng, dấu ngã...in Praat?)

Prosodic structure

  • In English, pitch accents (phrasal stress, trọng âm ngữ

đọan) are (partly) motivated by information structure (cấu trúc thông tin)

Time (s) 1.32816 Pitch (Hz) 350

HARRY’s going to Hawaii

50 100 150 200 250 300 350 50 100 150 200 250 300 350 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 Time (s) 1.23816 Pitch (Hz) 350

Harry’s going to HAWAII

50 100 150 200 250 300 350 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2
slide-6
SLIDE 6

Prosodic structure

  • In tone languages, such effects are (often) achieved

through pitch range expansion

From Remijsen, 2007

Revision

  • What acoustic properties of speech of the visual

representations we have covered show?

– Waveform – Spectrum – Spectrogram

  • What does each axis (trục) represent?
  • Which distinctive features of speech sounds can/can’t

we measure in each representation?

– Why are certain representations better/worse for different classes of speech sounds?

Revision

  • What acoustic properties of speech of the visual

representations we have covered show?

– Waveform – Spectrum – Spectrogram

  • What does each axis (trục) represent?
  • Which distinctive features of speech sounds can/can’t

we measure in each representation?

– Why are certain representations better/worse for different classes of speech sounds?

Revision

  • What acoustic properties of speech of the visual

representations we have covered show?

– Waveform – Spectrum – Spectrogram

  • What does each axis (trục) represent?
  • Which distinctive features of speech sounds can/can’t

we measure in each representation?

– Why are certain representations better/worse for different classes of speech sounds?

slide-7
SLIDE 7

A formant is not the same as a harmonic!

Formant = harmonic

  • Formants are properties of the vocal tract (đường dẫn

âm) - they are independent of pitch

  • Harmonics are multiples of F0 (tâôn số cơ bản) - if F0

changes, the harmonics change too

Formant = harmonic

  • Formants are properties of the vocal tract (đường dẫn

âm) - they are independent of pitch

  • Harmonics are multiples of F0 (tâôn số cơ bản) - if F0

changes, the harmonics change too

Spectrogram reading

  • Be able to identify broad classes of sounds

– nguyên âm, bán nguyên âm, âm xát, âm tắc...

  • Vô thanh? Hữu thanh?

– clues: striations (dải) and voicing bar

slide-8
SLIDE 8

Spectrogram reading

  • Be able to identify broad classes of sounds

– nguyên âm, bán nguyên âm, âm xát, âm tắc...

  • Vô thanh? Hữu thanh?

– clues: striations (dải) and voicing bar

Spectrogram reading: stops

http://www.cns.nyu.edu/ david/courses/perception/lecturenotes/speech/speech.html

Spectrogram reading: vowels

200 300 400 500 600 700 800 900 1000 1000 1500 2000 2500 3000 F2 (Hertz) i ! " æ # $ % u & &

  • Given F1 and F2, can you guess which vowel?
  • Given a vowel, can you say if F1/F2 are high or low?

Spectrogram reading: vowels

200 300 400 500 600 700 800 900 1000 1000 1500 2000 2500 3000 F2 (Hertz) i ! " æ # $ % u & &

  • Given F1 and F2, can you guess which vowel?
  • Given a vowel, can you say if F1/F2 are high or low?
slide-9
SLIDE 9

First half: general

  • 1. The spectrum shows:

a) amplitude over time b) amplitude over intensity c) frequency over time d) amplitude over frequency

  • 2. If a complex wave has three component waves with

fundamental frequencies (F0) of 60 Hz, 90 Hz, and 120 Hz, what is the fundamental frequency of the complex wave?

First half: general

  • 1. The spectrum shows:

a) amplitude over time b) amplitude over intensity c) frequency over time d) amplitude over frequency

  • 2. If a complex wave has three component waves with

fundamental frequencies (F0) of 60 Hz, 90 Hz, and 120 Hz, what is the fundamental frequency of the complex wave?

First half: general

  • 3. True or false: the vowel [a] is always pronounced with

the same pitch (F0).

  • 4. Which statement is correct?

a) In a waveform printout of an utterance, you can easily tell whether a vowel is an /o/ or /e/. b) In a waveform printout of an utterance, you can easily tell whether a sound is an /o/ or /f/. c) Both a) and b) are correct. d) Neither a) nor b) are correct.

First half: general

  • 3. True or false: the vowel [a] is always pronounced with

the same pitch (F0).

  • 4. Which statement is correct?

a) In a waveform printout of an utterance, you can easily tell whether a vowel is an /o/ or /e/. b) In a waveform printout of an utterance, you can easily tell whether a sound is an /o/ or /f/. c) Both a) and b) are correct. d) Neither a) nor b) are correct.

slide-10
SLIDE 10

Second half: Spectrogram reading

  • 1. Which segment is located in region 1?

a) [r] b) [l] c) [N] d) [p]

  • 2. Which segment is located in region 2?

a) [v] b) [k] c) [i] d) [d]

Second half: Spectrogram reading

  • 1. Which segment is located in region 1?

a) [r] b) [l] c) [N] d) [p]

  • 2. Which segment is located in region 2?

a) [v] b) [k] c) [i] d) [d]

Âm vị học

  • Trình bầy giống như kỳ thi giữa nhưng cũng có mốt vấn

đề về biểu diễn tầng sâu....