GCT535- Sound Technology for Multimedia Time-Stretching and Pitch-Shifting
Graduate School of Culture Technology KAIST Juhan Nam
1
GCT535- Sound Technology for Multimedia Time-Stretching and - - PowerPoint PPT Presentation
GCT535- Sound Technology for Multimedia Time-Stretching and Pitch-Shifting Graduate School of Culture Technology KAIST Juhan Nam 1 Outlines Resampling OverLap and Add (OLA) methods SOLA WSOLA PSOLA Phase Vocoder 2
1
2
3
4
6
−5 −4 −3 −2 −1 1 2 3 4 5 0.5 1 1.5 Windowed Sinc Sample Time −5 −4 −3 −2 −1 1 2 3 4 5 0.5 1 1.5 Linear Sample Time −5 −4 −3 −2 −1 1 2 3 4 5 0.5 1 1.5 3rd−order B−spline Sample Time −5 −4 −3 −2 −1 1 2 3 4 5 0.5 1 1.5 3rd−order Lagrange Sample Time
7
[The DaFX book] Original Speed Down (Up-sampling) Speed Up (Down-sampling)
8
Sample Block Analysis hop size Synthesis hop size
#$ #% (𝐼': synthesis hop size, 𝐼 ( : analysis hop size)
9
10 Analysis Hop Size Fade-In Fade-Out Fade-Out Fade-In Synthesis Hop Size
11 Analysis Hop Size Synthesis Hop Size Synthesis Hop Size L
Synchronization by cross-correlation Xcorr(l) = x1(n)
n=0 n=L−1
x2(n +l) Find the lag (l) where the cross correlation is maximum Shift the next frame by the lag
12
Pitch marks PSOLA analysis Segments
Pitch marks Segments PSOLA time stretching Synthesis pitch marks Overlap and add
13
14
Pitch marks Segments PSOLA pitch shifting Synthesis pitch marks Overlap and add
15