SLIDE 1
Cepstral analysis in speech processing
s[n] = (p[n]*g[n] + u[n]) * v[n] *r[n] p[n] => periodic impulse train u[n] => random white noise g[n] => glottal filter impulse response v[n] => vocal tract impulse response r[n] => lip radiation system impulse response From speech production model, we have:
Consider voiced speech: s[n] = p[n] * g[n] * v[n] * r[n] => S(z) = P(z)H(z) where H(z) = G(z)V(z)R(z)
The convolved components p[n] and h[n] are additive in the complex cepstrum H(z) will give a complex cepstrum which is non-zero for both positive and negative time
- which decays rapidly for large n
- P(z) gives a complex cepstrum
consisting of decaying impulses at multiples of the pitch period
The real cepstrum is the even part of the complex cepstrum
Screen clipping taken: 25-09-2013, 15:59
Lecture-oct4-a
03 October 2010 11:20 Class A Page 1