Pattern Recognition
Gerhard Schmidt Christian-Albrechts-Universität zu Kiel Faculty of Engineering Institute of Electrical and Information Engineering Digital Signal Processing and System Theory
Pattern Recognition Part 1: Introduction and Motivation Gerhard - - PowerPoint PPT Presentation
Pattern Recognition Part 1: Introduction and Motivation Gerhard Schmidt Christian-Albrechts-Universitt zu Kiel Faculty of Engineering Institute of Electrical and Information Engineering Digital Signal Processing and System Theory
Gerhard Schmidt Christian-Albrechts-Universität zu Kiel Faculty of Engineering Institute of Electrical and Information Engineering Digital Signal Processing and System Theory
Digital Signal Processing and System Theory | Pattern Recognition | Introduction Slide 2
❑ Speech and audio signal paths in a car ❑ Contents of the lecture ❑ Boundary conditions of the lecture (exercises, exam, etc.) ❑ Notation used in the lecture ❑ Literature ❑ Example of medical, speech, and audio signal processing
Digital Signal Processing and System Theory | Pattern Recognition | Introduction Slide 3
Into the car Out of the car Within the car
Digital Signal Processing and System Theory | Pattern Recognition | Introduction Slide 4
Signal processing in the „receiving path“
Signal processing for enhancing the communication quality and the sound impression
Signal processing in the „sending path“ Speech dialog system and phone Music and audio sources
Digital Signal Processing and System Theory | Pattern Recognition | Introduction Slide 5
❑ Preprocessing for improving the „noise robustness“
❑ Single-channel noise suppression ❑ Beamforming
❑ Pattern recognition (using speech and speaker recognition as an example)
❑ Basics of speech production ❑ Feature extraction ❑ Codebook generation ❑ Generation of Gaussian mixture models (GMMs) ❑ Hidden Markov models (HMMs)
❑ Enhancing the playback of audio signals
❑ Extending the bandwidth of speech signals (as application of codebooks) ❑ Loudspeaker equalization
Digital Signal Processing and System Theory | Pattern Recognition | Introduction Slide 6
❑ ECTS points
❑ 4 credit points
❑ Oral examination
❑ about 20 minutes per student ❑ After the term
❑ Talks (part of the exercise)
❑ About 10 minutes talk plus 5 minutes discussion ❑ Topics are available from now on
❑ Lecture slides
❑ Printed at the beginning of each lecture ❑ In the internet via dss.tf.uni-kiel.de
Digital Signal Processing and System Theory | Pattern Recognition | Introduction Slide 7
Scalars:
❑ Signals: ❑ Impulse responses (time-variant): ❑ Example for a (real) convolution:
Vectors:
❑ Signal vectors: ❑ Impulse response vectors (time-variant) : ❑ Example for a real convolution:
Matrices:
Coefficient index Boldface and uppercase Boldface and lowercase
Digital Signal Processing and System Theory | Pattern Recognition | Introduction Slide 8
Random variables and processes:
❑ Notation: ❑ Probability density function: ❑ Stationary random processes: ❑ Expected values of stationary random processes:
No differences between deterministic signals and random processes – different writing styles:
Digital Signal Processing and System Theory | Pattern Recognition | Introduction Slide 9
Auto and cross correlation for real, stationary random processes:
❑ Auto-correlation function: ❑ Cross-correlation function: ❑ (Auto) power spectral density: ❑ (Cross) power spectral density:
Digital Signal Processing and System Theory | Pattern Recognition | Introduction Slide 10
Stationary white noise:
❑ Auto-correlation function: ❑ Auto power spectral density:
Digital Signal Processing and System Theory | Pattern Recognition | Introduction Slide 11
❑ E. Hänsler: Statistische Signale: Grundlagen und Anwendungen, Springer, 2001 (in German) ❑ A. Papoulis: Probability, Random Variables, and Stochastic Processes, McGraw-Hill, 1965
Statistical signal theory:
❑ E. Hänsler, G. Schmidt: Acoustic Echo and Noise Control, Wiley, 2004 ❑ S. Haykin: Adaptive Filter Theory, Prentice Hall, 2002 ❑ A. Sayed: Fundamentals of Adaptive Filtering, Wiley, 2004
Noise suppression, beamforming, adaptive filters:
❑ E. Hänsler, G. Schmidt: Topics in Acoustic Echo and Noise Control, Springer, 2006 ❑ B. Iser, et al.: Bandwidth Extension of Speech Signals, Springer, 2008 ❑ E. Hänsler, G. Schmidt: Speech and Audio Processing in Adverse Environments, Springer, 2008 ❑ J. Benesty, et al.: Speech Enhancement, Springer, 2005
Application examples for speech processing:
Digital Signal Processing and System Theory | Pattern Recognition | Introduction Slide 12
Speech processing:
❑ L. R. Rabiner, R. W. Schafer: Digital Processing of Speech Signals, Prentice Hall, 1978 ❑ P. Vary, U. Heute, W. Hess: Digitale Sprachsignalverarbeitung, Teubner, 1998 (in German) ❑ P. Vary, R. Martin: Digital Speech Transmission, Wiley, 2006 ❑ L. R. Rabiner, R. W. Schafer: Introduction to Digital Speech Processing, Now, 2008 ❑ B. Pfister, T. Kaufman: Sprachverarbeitung, Springer, 2008 (in German)
Audio processing:
❑ U. Zölzer: DAFX – Digital Audio Effects, Wiley, 2002 ❑ E. Larsen, R. M. Aarts: Audio Bandwidth Extension, Wiley, 2004 ❑ M. Talbot-Shmith: Audio Engineer‘s Reference Book, Focal Press, 1998
Digital Signal Processing and System Theory | Pattern Recognition | Introduction Slide 13
Hands-free telephony:
❑ Echo cancellation as well as noise and residual echo suppression ❑ Double talk and barge-in (interrupting a speech dialog system)
Medical signal processing:
❑ Brain computer interfaces
Speech recognition:
❑ Applications for a mobile phone
Audio signal processing:
❑ Loudspeaker equalization ❑ Demo of KiRAT (Kiel Real-time Audio Toolkit)
Digital Signal Processing and System Theory | Pattern Recognition | Introduction Slide 14
Digital Signal Processing and System Theory | Pattern Recognition | Introduction Slide 15
( ) y n
+
Noise and residual echo suppression Echo cancellation
Hands-free telephony – a basic system:
Digital Signal Processing and System Theory | Pattern Recognition | Introduction Slide 16
Transmission to the communication partner (channel delay: about 180 ms) Remote communication partner Received signal („Hearing channel“ of the remote communication partner)
Initial filter convergence:
Adaptation at the beginning of the call Without Wiener filter With Wiener filter
Enclosure dislocations: Stereo signals (16 kHz):
Left: Received signal ... Right: Sent signal ... ... of the remote communication partner
Double talk:
Both partners speak simultaneously
Digital Signal Processing and System Theory | Pattern Recognition | Introduction Slide 17
Digital Signal Processing and System Theory | Pattern Recognition | Introduction Slide 18
Electro- encephalography (EEG) Magneto- encephalography (MEG) Electro- cardiography (ECG) Magneto- cardiography (MCG)
Digital Signal Processing and System Theory | Pattern Recognition | Introduction Slide 19
❑ Helping medical doctors to distinguish better between deseases ❑ „Conventional“ measures ❑ Establishment of so-called early biomarkers ❑ To localize areas of interest in the heart or in the brain ❑ Networks that cause epilepctic seizures, etc. ❑ Unwanted „exciation channels“in the heart ❑ Brain-computer interfaces ❑ Control of electronic devices for handicapped people
Digital Signal Processing and System Theory | Pattern Recognition | Introduction Slide 20
Digital Signal Processing and System Theory | Pattern Recognition | Introduction Slide 21
Digital Signal Processing and System Theory | Pattern Recognition | Introduction Slide 22
❑ Raymond Brückner (SVOX) ❑ Andreas Löw (SVOX) ❑ Patrick Langer (SVOX)
Link to video
Digital Signal Processing and System Theory | Pattern Recognition | Introduction Slide 23
Digital Signal Processing and System Theory | Pattern Recognition | Introduction Slide 24
Digital Signal Processing and System Theory | Pattern Recognition | Introduction Slide 25
❑ Speech and audio signal paths in a car ❑ Contents of the lecture ❑ Boundary conditions of the lecture (exercises, exam, etc.) ❑ Notation used in the lecture ❑ Literature ❑ Example of medical, speech, and audio signal processing
Next week:
❑ Noise suppression