Speech Generation and Perception 1 Speech Generation and - - PowerPoint PPT Presentation

speech generation
SMART_READER_LITE
LIVE PREVIEW

Speech Generation and Perception 1 Speech Generation and - - PowerPoint PPT Presentation

Speech Generation and Perception 1 Speech Generation and Perception : The study of the anatomy of the organs of speech is required as a background for articulatory and acoustic phonetics. An understanding of hearing and perception is


slide-1
SLIDE 1

1

Speech Generation and Perception

slide-2
SLIDE 2

2

Speech Generation and Perception :

 The study of the anatomy of the organs of

speech is required as a background for articulatory and acoustic phonetics.

 An understanding of hearing and

perception is needed in the field of both speech synthesis and speech enhancement and is useful in the field of automatic speech recognition.

slide-3
SLIDE 3

3

Schematic diagram of the human speech production :

slide-4
SLIDE 4

4

Organs of Speech :

 Lungs and trachea :

 source of air during speech.  The vocal organs work by using compressed air; this

is supplied by the lungs and delivered to the system by way of the trachea.

 These organs also control the loudness of the

resulting speech.

 The trachea and lungs together constitute the

pulmonary tract.

slide-5
SLIDE 5

5

Organs of Speech :

 The Larynx :

This is a complicated system of cartilages and

muscle containing and controlling the vocal

  • cords. Principle parts are :

 Cricoid cartilage  Thyroid cartilage  Arytenoid cartilage  Vocal cords

The place where the vocal folds come

together is called the glottis.

slide-6
SLIDE 6

6

Glottal closure during voicing

folds folds Vocal folds During breathing

slide-7
SLIDE 7

7

Organs of Speech :

 The Vocal Tract :

 Laryngeal pharynx

 beneath epiglottis

 Oral pharynx

 behind tongue, between epiglottis and velum

 Nasal pharynx

 Above velum, rear end of nasal cavity

 Oral cavity

 Forward of the velum and bounded by lips, tongue and palate

 Nasal cavity

 Above the palate and extending from the pharynx to the

nostrils

slide-8
SLIDE 8

8

Vocal Tract

slide-9
SLIDE 9

9

Vocal Tract Model

slide-10
SLIDE 10

10

A General Discrete-Time Model For Speech Production

slide-11
SLIDE 11

11

Magnitude Spectrum Of One Pulse Of The Volume Velocity At The Glottis

slide-12
SLIDE 12

12

Position Of The Vocal Cords And Cartilages (a) For Phonation (b) For Whispering

slide-13
SLIDE 13

13

slide-14
SLIDE 14

14

Speech Production :

 The operation of the system is divided into

two functions :

Excitation Modulation

Excitation (glottis) Modulation (vocal tract) Radiate speech

slide-15
SLIDE 15

15

Vocal Vowels

AH EE EH OH OO Duck Call

slide-16
SLIDE 16

16

Speech Production :

 Excitation :is done in several ways

Phonation (making of a voiced sound)

 This is the oscillation of the vocal cords  The arytenoid cartilages close and stretch the

vocal cords

 When air forced through the vocal, they vibrate  The opening and closing of the cords breaks the

airstream up into pulses

slide-17
SLIDE 17

17

Speech Production :

 The repetition rate of the pulses is termed pitch.  At low levels of air pressure oscillation may

become irregular, this irregularities are known as “vocal fry”.

 Speech sounds accompanied by phonation are

called voiced; others, unvoiced or mute.

 Whispering (speak softly)

 The vocal cord are drawn together, but with small

triangular opening between arytenoid cartilages

slide-18
SLIDE 18

18

Speech Production :

Frication

 Frication can occur with or without phonation

Compression

 If the release is abrupt and clean, the sound is a

stop or plosive

 If gradual and turbulent, the sound can pass into

the related fricative and is termed an affricative

slide-19
SLIDE 19

19

Speech Production :

 Vibration

 If air is forced through a closure other than the vocal cords,

vibrations may be set up

 Modulation

 This is what we do to impose information on the

glottal output

 Articulatory phonetics: how the organs of speech are

positioned to produce any given speech sound

 Acoustic phonetics: what the measurable acoustical

correlates of any given speech sound are and how acoustical features in general correspond to phonetic and articulatory

  • nes
slide-20
SLIDE 20

20

Hearing and perception :

 Hearing is a process which sound is

received and convert into nerve impulse

 Perception is the post-processing within

the brain by which the sounds heard are interpreted and given meaning

slide-21
SLIDE 21

21

The structure of peripheral auditory system :

slide-22
SLIDE 22

22

Sectional View Of The Human Ear

slide-23
SLIDE 23

23

Hearing :

 The ear is divided into three parts:

The outer ear:

 Consist of the pinna (visible, convolved cartilage)

 Its convolved shape is provide some directional cues

 The external canal (external auditory meatus)

 Uniform tube, 2.7 cm long by 0.7 cm across through  It has a number of resonant frequencies at 3 kHz

 The eardrum (tympanic membrane)

 Is a stiff, conical structure at the end of the meatus  It vibrate in response to the sound

slide-24
SLIDE 24

24

Hearing :

The middle ear

 Is an air-filled cavity  Separated from the outer ear by the tympanic

membrane

 Connected to the inner ear by the oval and round

window

 Connected to the outside world by way of the

eustachian tube

slide-25
SLIDE 25

25

Hearing :

 eustachian tube permit equalization of air pressure

between the middle air and the surrounding atmosphere

 the middle ear contain three tiny bone (ossicles)

 Malleus (hammer)  Incus (anvil)  Stapes (stirrup)

 The function of the ossicles

 Impedance transformation  Amplitude limiting

slide-26
SLIDE 26

26

Hearing :

The inner ear

 vestibular apparatus

 Used for balance and sensing orientation

 The round and oval window  Cochlea

 Is a snail-shape passage  communication with the middle ear via the round and

  • val window

 It consist the transducers which convert acoustical

vibration to verve impulses

slide-27
SLIDE 27

27

The Cochlea as It Would Appear If Unwound

slide-28
SLIDE 28

28

Cross Section Of One Turn Of The Cochlea

slide-29
SLIDE 29

29

Position Of Maximum Amplitude Along Basilar Membrance As A Function Of Applied Frequency

slide-30
SLIDE 30

30

Frequency Response Of a Point On The Basilar Membrance