Pattern Recognition Part 5: Codebook Training Gerhard Schmidt - PowerPoint PPT Presentation

Pattern Recognition Part 5: Codebook Training Gerhard Schmidt Christian-Albrechts-Universität zu Kiel Faculty of Engineering Institute of Electrical and Information Engineering Digital Signal Processing and System Theory

Codebook Training • Contents ❑ Motivation ❑ Application examples ❑ Cost function for the training of a codebook ❑ LBG- and k-means algorithm ❑ Basic schemes ❑ Extensions ❑ Combination with additional mapping schemes Slide 2 Digital Signal Processing and System Theory | Pattern Recognition | Codebook Training

Codebook Training • Introduction and Motivation – Part 1 Feature vectors and corresponding codebook – feature room 1: ❑ Codebook with 4 entries Feature vectors Codebook entries ❑ 10.000 feature vectors are quantized Feature 2 Feature 1 Slide 3 Digital Signal Processing and System Theory | Pattern Recognition | Codebook Training

Codebook Training • Introduction and Motivation – Part 2 Feature vectors and corresponding codebook – feature room 2: ❑ Codebook with 4 entries Feature vectors Codebook entries ❑ 10.000 feature vectors are quantized Feature 2 Feature 1 Slide 4 Digital Signal Processing and System Theory | Pattern Recognition | Codebook Training

Codebook Training • Codebooks Codebook definition: Codebook matrix mit Codebook vector ❑ The codebook vectors should be chosen such that they represent a large number of so-called feature vectors with a small average distance. ❑ For feature vectors and a distance measure it should follow for the average distance : Slide 5 Digital Signal Processing and System Theory | Pattern Recognition | Codebook Training

Codebook Training • Literature Codebook training: ❑ L. R. Rabiner, R. W. Schafer: Digital Processing of Speech Signals , Prentice Hall, 1978 ❑ C. Bishop: Pattern Recognition and Machine Learning , Springer, 2006 ❑ B. Pfister, T. Kaufman: Sprachverarbeitung , Springer, 2008 (in German) Current version of „ Sprachsignalverarbeitung “ is free via the university library … Slide 6 Digital Signal Processing and System Theory | Pattern Recognition | Codebook Training

Codebook Training • Application Examples – Part 1 Basic structure of a codebook search: Matrix of codebook vectors „Distance“ of the current feature vector Code- to the „best“ Feature vector book codebook entry Distortion-reducing Feature extraction Vector preprocessing (MFCCs, cepstral Index (address) of the quantisation (beamforming, echo coeffictients) „best“ codebook entry cancellation, and noise reduction) Slide 7 Digital Signal Processing and System Theory | Pattern Recognition | Codebook Training

Codebook Training • Application Examples – Part 2 Speaker recognition - structure: Codebook for the first speaker Feature vector Index of the Accumulation „most of the distances probable“ over time speaker Distortion- Feature reducing extraction Best distances preprocessing Codebook for the last speaker Slide 8 Digital Signal Processing and System Theory | Pattern Recognition | Codebook Training

Codebook Training • Application Examples – Part 3 Speaker recognition – basic principle: ❑ The current spectral envelope of the signal is compared to the entries of several codebooks and the distance to the „ best match “ is computed for each codebook. ❑ The codebooks belong to a speaker that is known in advance and have been trained to his data. ❑ The smallest distances of each codebook are accumulated . ❑ The smallest accumulated distance determines on which speaker it will be decided. ❑ Models of the speakers that are known in advance compete against one ore more „ universal “ models . A new speaker can be recognized if the „universal“ codebook is better than the best individual one. In this case, a new codebook for the new speaker can be initialized. ❑ An update of the „winner - codebook“ is usually done. Slide 9 Digital Signal Processing and System Theory | Pattern Recognition | Codebook Training

Codebook Training • Application Examples – Part 4 Speaker recognition – basic principle: Codebook „Best“ entry in the for the first codebook first speaker current spectral envelope dB Codebook for the Frequency second speaker „Best“ entry in the second codebook Slide 10 Digital Signal Processing and System Theory | Pattern Recognition | Codebook Training

Codebook Training • Application Examples – Part 5 Signal reconstruction – structure: Matrix of codebook vectors Estimation of the undistorted spectral envelope Code- book Feature vector Undistorted spectral envelope Feature extraction Distortion-reducing Index (address) of the preprocessing „best“ codebook entry Detection of „good“ and „bad“ SNR conditions Slide 11 Digital Signal Processing and System Theory | Pattern Recognition | Codebook Training

Codebook Training • Application Examples – Part 6 Signal reconstruction – basic principle: ❑ First, the input signal is „conventionally“ noise-reduced . In this processing step it is also determined in which frequency ranges the conventional method does not “open” (attenuation is less than the maximum attenuation). ❑ Based on the conventionally improved signal, the spectral envelope is estimated by, e.g., thelogarithmic melband- signal powers. ❑ In a codebook it is now searched for that envelope that has the smallest distance to the input signal’s envelope within the „ allowed “ frequency range . ❑ Because the „allowed“ frequency range is not known a priori , both, the features that are used and the cost function, have to be chosen appropriately . ❑ Finally, the extracted spectral envelope of the input signal and the best codebook envelope are combined such that the codebook envelope is chosen in such areas where the conventional noise reduction fails and the original input envelope is chosen for the remaining frequency range. Slide 12 Digital Signal Processing and System Theory | Pattern Recognition | Codebook Training

Codebook Training • Application Examples – Part 7 Signal reconstruction – basic principle: Envelope of the „Best“ entry of input signal the codebook Current Combined spectral spectral envelope envelope dB dB Frequency Frequency Best codebook envelope Frequency range in which a conventional noise reduction is not working satisfactory Slide 13 Digital Signal Processing and System Theory | Pattern Recognition | Codebook Training

Codebook Training • Application Examples – Part 8 Bandwidth extension – structure: Combination of the envelope of the broadband codebook and Double matrix with the telephone-bandlimited pairs of codebook vectors envelope Code- Code- buch 1 buch 2 Spectral broadband- envelope Distortion-reducing Feature preprocessing extraction Index (address) of the „best“ codebook entry Envelope of the signal that is limited to telephone bandwidth Slide 14 Digital Signal Processing and System Theory | Pattern Recognition | Codebook Training

Codebook Training • Application Examples – Part 9 Bandwidth extension – basic principle: ❑ For bandwidth extension , two codebooks are trained in parallel : one for the envelpes of the telephone-bandlimited signal and a second one for the envelopes of the broadband signal. ❑ During the training it is important that the input vectors (so-called double vectors) are available synchronously , such that identical feature data can be used for the clustering of the narrowband and broadband data. ❑ The search is done in the narrowband-codebook. The suitable broadband-codevector is chosen and combined with the extracted narrowband-entry in such a way, that the original envelope is chosen in the telephone band and the broadband codebook entry for the remaining frequency range. Slide 15 Digital Signal Processing and System Theory | Pattern Recognition | Codebook Training

Codebook Training • Application Examples – Part 10 Bandwidth extension – basic principle: „Best“ entry of the codebook Codebook with Envelope in the pairs of narrow- telephone band and broadband dB envelopes Frequency Best broadband codebook envelope Resulting broadband dB envelope Input envelope Frequency Slide 16 Digital Signal Processing and System Theory | Pattern Recognition | Codebook Training

Codebook Training • Application Examples – Part 11 More application examples ❑ Speech coding : The aim is basically to code the spectral envelope with as little bits as possible. Vector quantization is a good choice here. ❑ Classification of speech sound (sibilant sounds, vowels): Several codebook entries are trained for each speech sound. Afterwards, a good classification can be achieved. This can be used, e.g., to select between different preprocessing methods for reducing distortions or to parametrize a universal preprocessing method properly. ❑ (Noise-) environment recognition : Depending on the environment, different hard- and software components should be used (switching between cardioid and omnidirectional microphones, activation or deactivation of dereverberation, etc.). To realize such a classification, codebooks for different environments can be trained and compared during operation. Slide 17 Digital Signal Processing and System Theory | Pattern Recognition | Codebook Training

Pattern Recognition Part 5: Codebook Training Gerhard Schmidt - PowerPoint PPT Presentation

Pattern Recognition Part 5: Codebook Training Gerhard Schmidt Christian-Albrechts-Universitt zu Kiel Faculty of Engineering Institute of Electrical and Information Engineering Digital Signal Processing and System Theory Codebook Training

Part 5 pattern recognition pattern recognition track pattern recognition: associate hits

Feature Selection Pattern Recognition: The Early Days Pattern Recognition: The Early Days Only

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 1: INTRODUCTION Pattern Recogniton Pattern: Any

CS 7616 Pattern Recognition Introduction Aaron Bobick School of Interactive Computing

Pattern Recognition CSE 802 Michigan State University Spring 2017 Lecture 1, January 9, 2017

Applications of Pattern Recognition in Computational Biology Pattern Recognition Course

Pattern Recognition: An Overview Prof. Richard Zanibbi Pattern Recognition (One) Definition

CS 7616 Pattern Recognition Linear, Linear, Linear Aaron Bobick School of Interactive

CS 7616 Pattern Recognition Bayesian Decision Theory Aaron Bobick School of Interactive Computing

Pattern Recognition 2018 Support Vector Machines Ad Feelders Universiteit Utrecht Ad Feelders

An NFR Pattern Approach to Dealing An NFR Pattern Approach to Dealing An NFR Pattern Approach to

Scope Constrained Frequent Pattern Mining: Constrained Frequent Pattern Mining: A A

A common pattern: map Another common pattern: filter Pattern: take a list and produce a new list,

A summary of deep models for face recognition Qianli Liao Face recognition Face recognition:

8-Speech Recognition Speech Recognition Concepts Speech Recognition Approaches

Pattern Recognition Theory Lecture 12 : Correlation Filters Pattern Matching a How to match

Bag-of-Visual-Words 16-385 Computer Vision (Kris Kitani) Carnegie Mellon University What object

Lecture 2 Matthieu Bloch 1 Channel coding problem As illustrated in Figure 1, the problem of

Sing It, Say It, Share It LESSON 7 Your Response to the Lesson What was most interesting in the

Core object model EO / EFL++ Carsten Haitzler Samsung Electronics Principal Engineer

Information Theory Lecture 4 Discrete channels, codes and capacity: CT7 Channels:

Refactoring Legacy Code By: Adam Culp Twitter: @ adamculp https://joind.in/ 11658 1 Refactoring

Attacks only get better: The case of OCB2 Tetsu Iwata Nagoya University Real World Crypto 2020,

Compiler Construction Chapter 1: Introduction Slides modified from Louden Book and Dr. Scherger