Some Notes on the Psychoacoustics and Signal Processing of RASTA-PLP Analysis of Speech
- Introduction
- Reflection Masking Model
- RASTA-PLP Processing
- RASTA applied to RIR filtered speech
Some Notes on the Psychoacoustics and Signal Processing of RASTA-PLP - - PowerPoint PPT Presentation
Wire Communication Laboratory - University of Patras Some Notes on the Psychoacoustics and Signal Processing of RASTA-PLP Analysis of Speech Jrg Buchholz Introduction Reflection Masking Model RASTA-PLP Processing RASTA
masker suppressor suppressor test signals test signals post (forward) masking pre (backward) masking simultaneous masking time frequency
TMM Simultaneous Masking Directivity Module s (t)
i
s (t)
i-1
s (t)
i+1
Module Two-Tone Suppression Module (BP-Filterbank) Transformation / Resythesis Feature Vectors / Audible Signal TMM TMM TMM
Speech compressing static NL (log) CB-integration (mel-scale) FFT expanding static NL (Exp) linear BP-filtering equal loudness curve IFFT / IDFT power law of hearing cepstral recursion solving set of linear equations cepstral coefficients of RASTA-PLP model RASTA
time / ms 10 50 40 60 20 30 frequency frequency amplitude 1 fa/2 fa/2 power spectrum mel-scale power spectrum short time analysis i+3 i+4 i+1 i+2 i+5 i time trajectory k FFT CB integration
10
2
10
3
10
4
20 Magnitude in dB 10
2
10
3
10
4
20 40 frequency / Hz Magnitude in dB
FT
log
y y
FT
50 100 150 200 250 300 350 400 0.1 0.2 0.3 Time / ms Amplitude 10
10
10 10
1
10
2
10 Modulation Frequency / Hz Magnitude in dB
− − − −
4 1 3 4 1
500 1000 1500 2000 0.5 1 500 1000 1500 2000
0.5 1 normalis ed amplitude 500 1000 1500 2000 0.5 1 time / ms
Time Trajectory (1 kHz band) BP-filtered Time Trajectory Negative values set to zero
500 1000 1500 2000 0.5 1 500 1000 1500 2000
0.5 1 normalis ed amplitude 500 1000 1500 2000 0.5 1 time / ms
Time Trajectory (1 kHz band) BP-filtered Time Trajectory Negative values set to zero
x(n) n x(n) n X(n , )
2 2k
ω X(n , )
1 1k
ω n2 ω1 A
n1 ω2
0 dB 0 dB
+18 dB