SLIDE 3 3
Digital Audio Effects DAFx-2008 220 440 880 0.00 0.50 1.00 1.50 2.00 Fundamental Frequency [Hz] wH / wI
×106
Pitch manipulation
- Pitch-dependent feature function
– approximates timbral features over pitches by polynomial function
- power of harmonics ( )
- the ratio of harmonic energy to inharmonic energy ( )
- Manipulating the spectral envelope
– by multiplying the pitch trajectory ( ) by a desired ratio – Obtain timbral features from pitch-dependent feature function ) ( ' r µ ) ( ' r µ
n
v'
I H w
w /
n
v
Frequency
n
v
) (r µ ) (r µ
…
Amplitude ) (r µ
Pitch trajectory Power of harmonics Power of harmonics
220 440 880 −0.05 0.00 0.05 0.10 Fundamental Frequency [Hz] v of 4th 220 440 880 0.00 0.20 0.40 0.60 0.80 1.00 Fundamental Frequency [Hz] v of 1st
pitch trajectory [Hz] pitch trajectory [Hz] pitch trajectory [Hz] Power of 1 th harmonics Power of 4 th harmonics The ratio of harmonic en. to inharmonic en.
Digital Audio Effects DAFx-2008
Duration manipulation
Th r E dr r dE > < ) ( , ) ( ε
- Manipulating the temporal envelope ( )
– by expanding or shrinking between onset ( ) and offset ( )
r
r
Time Preserve ) (r E ) (r E
– Pitch trajectory ( ) is analyzed and synthesized by sinusoidal model
Time ) (r µ ) (r µ Smoothing Preserve Expand Preserve Preserve Synthesize Analyze
r
r
detection equation:
Detect Detect Amplitude Frequency ) (r µ
) (r E
Synthesized Pitch trajectory
Temporal envelope
Original Pitch trajectory
Digital Audio Effects DAFx-2008
Synthesis from harmonics and inharmonics ∑
=
n n n H
t j t A t s )] ( exp[ ) ( ) ( φ
) ( ' ) ( t E v w t A
n n H n
=
∫
+ =
t n n
d n t ) ( ' ) ( ) ( τ τ µ φ φ
) (t sH ) (t sI ) (t s ) (t sH ) (t sI ) (t s
Instance amplitude: Instance phase: Harmonic signal:
H
w
Harmonic energy:
– using sinusoidal model
– from inharmonic model weighted by inharmonic energy ( )
– obtained by adding these two signals
Equations for harmonic signal
Power of harmonics: Temporal envelope:
※” ‘ ” parameter is a manipulated parameter.
Pitch trajectory:
'
n
v ) (t En ) ( ' τ µ '
I
w
Digital Audio Effects DAFx-2008
Evaluation in pitch manipulation
– Our method without pitch-dependent feature function
– Spectral distance: evaluation of harmonic component difference – Mel-Frequency Cepstrum Coefficient (MFCC) distance:
- quantitative auditory measurement
- evaluation of harmonic and inharmonic components differences
- Conditions
– 32 instruments from RWC-MDB (forte, normal articulation)
- 3 individuals for each instrument
– 10-fold cross validation (10%:90% = [evaluation data]:[learning data])
∑
− =
t f syn real
T r f C r f C D
, 2 /
)) , ( ) , ( (
Synthesis sound Real sound Frames i
C
Spectrum or MFCC
= Sophisticated sinusoidal model