Unsupervised Piano Music Transcription
Taylor Berg-Kirkpatrick Jacob Andreas and Dan Klein CMU UC Berkeley
Unsupervised Piano Music Transcription Taylor Berg-Kirkpatrick - - PowerPoint PPT Presentation
Unsupervised Piano Music Transcription Taylor Berg-Kirkpatrick Jacob Andreas and Dan Klein CMU UC Berkeley Piano Music Transcription note time Supervised Transcription Supervised Transcription w Model
Taylor Berg-Kirkpatrick Jacob Andreas and Dan Klein CMU UC Berkeley
time note
time freq
time freq
time freq
time freq
time freq
time freq
time freq
time freq
time freq
time freq
time freq
time freq
time freq
time freq
time freq
time freq
time freq
Symbolic Music
Audio signal
Piano Parameters Symbolic Music
Generative Model
Audio signal
Piano Parameters Symbolic Music
Generative Model
Audio signal
time note
Mn
n
Note events
time velocity
Note events
time
Activation
time
Spectrogram
time freq
Component spectrogram
time freq freq time
duration velocity
REST PLAY
X
Parameters Latent variables
Sn An Mn
µn αn σn
E1 E2 E3
V2
V3
Event type
V1
Velocity
PLAY REST PLAY
duration velocity
REST PLAY
D1
Duration
D2 D3
Mn
µn
V2
V3
V1 D1 D2 D3
An
Activation
V2 V3 V1 D1 D2 D3
copy temporal shape αn
Temporal shape
An
Activation
V2 V3 V1 D1 D2 D3
αn truncate to duration
Temporal shape
An
Activation
V2 V3 V1 D1 D2 D3
αn scale to velocity
Temporal shape
An
Activation
V2 V3 V1 D1 D2 D3
αn add Gaussian noise
Temporal shape
An
Activation
σn
Spectral shape
Sn
Component spectrogram
Poisson noise
. . .
σ1 σN S1 SN
Total spectrogram
X
A1 AN
Note events
time
Activation
time
Spectrogram
time freq
Component spectrogram
time freq freq time
duration velocity
REST PLAY
X
Parameters Latent variables
Sn An Mn
µn αn σn
Semi-Markov dynamic program Closed form update Exponentiated gradient ascent Exponentiated gradient ascent
α|A, M
Temporal shapes update:
A|M, X, α, σ
Activations update:
σ|A, X
Spectral shapes update: Note events update:
M|A, α, µ
time note
50 60 70 80 82.1 70.4
69.0
68.6 58.3
Onset F1
MAPS Corpus
O’Hanlon 2014 Benetos 2014 Vincent 2013
Unsupervised*
[Berg-Kirkpatrick et al. 2014] [Valentin et al. 2010]
Supervised
Predicted Reference
Grieg input Grieg resynth piano Grieg resynth guitar
Chopin input Chopin resynth piano Chopin resynth guitar
Beethoven input Beethoven resynth piano Beethoven resynth guitar