Foundations of Language Science and Technology Acoustic Phonetics 2: - - PowerPoint PPT Presentation

foundations of language science and technology acoustic
SMART_READER_LITE
LIVE PREVIEW

Foundations of Language Science and Technology Acoustic Phonetics 2: - - PowerPoint PPT Presentation

Foundations of Language Science and Technology Acoustic Phonetics 2: Speech signals and waveforms Jan 21, 2015 Bernd Mbius FR 4.7, Phonetics Saarland University Acoustic communication Prerequisites for acoustically based speech


slide-1
SLIDE 1

Acoustic Phonetics 2: Speech signals and waveforms

Jan 21, 2015

Bernd Möbius

FR 4.7, Phonetics Saarland University

Foundations of Language Science and Technology

slide-2
SLIDE 2

Acoustic communication

Prerequisites for acoustically based speech communication sound production sound perception sound propagating medium Basic acoustic properties of speech sounds frequencies within the range of human auditory perception (20 – 20,000 Hz) amplitude: displacement of an oscillation perceived loudness duration: perceptible minimum duration; duration of units of speech (timbre)

slide-3
SLIDE 3

Speech sounds and speech signals

Basic types of speech signals quasi-periodic signals: sonority vowels sonorants (approximants, glides, nasals, liquids) stochastic signals: frication noise fricatives plosive aspirations transient signals – impulse plosive releases mixed excitation – voiced frication noise voiced fricatives

slide-4
SLIDE 4

Speech sounds and speech signals

"Heute ist schönes Frühlingswetter."

slide-5
SLIDE 5

Speech sounds and speech signals: vowels

"Heute ist schönes Frühlingswetter."

slide-6
SLIDE 6

Speech sounds and speech signals: sonorants

"Heute ist schönes Frühlingswetter."

slide-7
SLIDE 7

Speech sounds and speech signals: fricatives

"Heute is schönes Frühlingswetter."

slide-8
SLIDE 8

Speech sounds and speech signals: plosives

"Heute is(t) schönes Frühlingswetter."

slide-9
SLIDE 9

Speech sounds...: voiced fricatives

"Heute ist schönes Frühlingswetter."

voiced?

slide-10
SLIDE 10

Simple waveforms

slide-11
SLIDE 11

Simple waveforms

Simple periodic oscillation: pure sine wave cyclically recurring, simple oscillation pattern, determined by fundamental period T0 amplitude A phase Fundamental frequency [Hz]: 1 / fundamental period [s] F0 = 1 / T0

slide-12
SLIDE 12

Simple waveforms

Phase relation two sine waves of same frequency and amplitude, but temporally displaced maxima, minima, and zero crossings phase shift (here: angle 90º)

slide-13
SLIDE 13

Simple waveforms

Frequency differences two sine waves of same amplitude and phase, but different frequency (here: 1 vs. 2 Hz)

slide-14
SLIDE 14

Complex waveforms

Complex periodic signals cyclically recurring oscillation patterns composed of at least two sine waves fundamental frequency = 1 / complex fundamental period Form of resulting complex wave depends on frequency, amplitude and phase relations between component waves

slide-15
SLIDE 15

Complex waveforms

Complex waveform: 2 components two sine waves (100 Hz, 1000 Hz) with same phase and different amplitude (left) complex wave (right) resulting from addition of the two components F0 = 100 Hz

slide-16
SLIDE 16

Complex waveforms

Complex waveform (red): 5 components five sine waves (100, 200, 300, 400, 500 Hz) with same phase

  • nly 3 lowest frequency components displayed
slide-17
SLIDE 17

Complex waveforms

Complex waveform (red): 5 components five sine waves (100, 200, 300, 400, 500 Hz) with phase shifts

  • nly 3 lowest frequency components displayed
slide-18
SLIDE 18

Power spectrum

Power spectrum (amplitude over frequencies) of the complex waveform composed of five components (see above)

slide-19
SLIDE 19

Fourier analysis

Fourier analysis: power spectrum of 5 component wave (see above) Fourier's theorem every complex wave can be analytically decomposed into a series of sine waves, each with a specific set of frequency, amplitude and phase values

slide-20
SLIDE 20

Fourier analysis and power spectrum

Differences between result of Fourier analysis and idealized power spectrum (see above): broader peaks additional peaks Reasons for these differences: Fourier analysis assumes infinitely long signal, whereas analysis is performed over 2 fundamental periods (quasi-periodic signal) analog vs. digital signal representation

slide-21
SLIDE 21

Discrete Fourier Transform

Discrete Fourier analysis (Discrete Fourier Transform, DFT) digital Fourier analysis of complex signals, yielding a spectrum of sine wave components transformation of data from time domain into frequency data resolution parameters sampling rate (e.g. 16000 Hz) window size (length; e.g. 512 samples) granularity of computed spectrum ca. 31 Hz (16000/512=31.25), with linear interpolation trading relation (uncertainty principle) good frequency resolution poor time resolution good time resolution poor frequency resolution

slide-22
SLIDE 22

Spectrogram

Analysis window size/length: short temporal window : good time resolution long temporal window: good frequency resolution Types of spectrograms: narrow band spectrogram (e.g. 50 Hz): good frequency resolution wide band spectrogram (e.g. 300 Hz): good temporal resolution

slide-23
SLIDE 23

From spectrum to spectrogram

Power spectrum: snapshot taken at a specific instant of time in the speech signal Spectrogram: time as 3rd dimension (beside frequency and amplitude) x-axis: time [s] y-axis: frequency [Hz] "z-axis": amplitude [dB] (gray-scale or color coding) Let's go use Praat for further interactive demos... (exercise session on Friday!)

slide-24
SLIDE 24

Thanks!