Automatic Transcription and Separation of the Main Melody from - PowerPoint PPT Presentation

Automatic Transcription and Separation of the Main Melody from Polyphonic Music Signals Jean-Louis Durrieu, PhD candidate TSI Department, Telecom ParisTech http://perso.telecom-paristech.fr/durrieu/en/ 07/09/09

Automatic Transcription and Separation of the Main Melody from Polyphonic Music Signals 1.Introduction 2.Signal Models 3.Transcription of the Melody 4.“Solo/Accompaniment” Separation page 2 direction ou services

Introduction  Blind Audio Source Separation (BASS) for music and Music Information Retrieval (MIR) : → Inter-related Fields  Polyphonic music recordings: a BASS/MIR hybrid approach to main melody transcription/separation  Applications page 4 direction ou services

Introduction: link between BASS/MIR MIR BASS Approaches Approaches • Perceptually • Based on models motivated • Data-driven • Knowledge driven • “Low-level” • “High-level” (signal level) (semantic level) “Breaking” music into “atomic” elements Separated • Transcription Musical Sources • Indexing page 5 direction ou services

Introduction: Bridging BASS/MIR “gap”  Improving BASS with MIR, and MIR with BASS  2 instruments transcription/separation example: MIR BASS MIR  Hybrid approaches: • E. Vincent , “Musical Source Separation Using Time- Frequency Source Priors”, IEEE Transactions on Audio, Speech and Language Processing, vol. 14, No 1 • Singing voice signals? page 6 direction ou services

Introduction: Main Melody Transcription, Main Instrument Separation  Definitions: • [MIREX] “Audio Melody Extraction”: extract the main melody from polyphonic audio signals. • [Paiva2007]: “[The Main] Melody is the dominant individual pitched line in a musical ensemble”.  Addressing 2 tasks: • Main Melody Transcription: identify and transcribe the sequence of fundamental frequencies played by the main instrument in a polyphonic music signal (mono or stereo), • Main Instrument/Accompaniment Separation: separate the instrument playing the main melody from the other accompaniment instruments. page 7 direction ou services

Introduction: Applications  Transcribed Melody • Indexing large music database, • Musical transcription into “human readable” score, • ...  Separating the Main Instrument from the Accompaniment: • Generate accompaniments for solo performers • Pre-Processing for MIR applications (chord detection, instrument classification, etc.) • ... page 8 direction ou services

Introduction: Presentation Outline  Signal Models Source/Filter model for the main instrument, NMF for the other instruments; estimation algorithm for the corresponding parameters,  Melody transcription Viterbi smoothing of the melody sequence,  Main Instrument/Accompaniment Separation (also referred to as Solo/Accompaniment Separation) Wiener filters to estimate the separated sources,  Conclusion/Discussions page 9 direction ou services

Introduction: System Outline (ICASSP09) page 10 direction ou services

Introduction: Contributors at Telecom ParisTech  Supervisors: • Bertrand DAVID, • Gaël RICHARD.  Team members: • Nancy BERTIN, • Cédric FEVOTTE, • Alexey OZEROV, • And all the other Audiosig project team members... page 11 direction ou services

Signal Models  Audio signals: • Time-Frequency Representation, • Statistical modeling.  Mixture model  Source/Filter model for the main instrument • Motivations • Characterizing the main melody instrument  NMF -based model for the accompaniment • Decomposition on limited dictionary • Link between NMF and our framework  Parameter estimation • NMF -like algorithm: multiplicative gradient approach page 13 direction ou services

Signal Models: Time-Frequency Representation  Digital audio: waveform  Time-frequency representation: • Evolution of frequency content, Frequency (Hz) • Human auditory system.  Short-Time Fourier Transform (STFT): Time (s) page 14 direction ou services

Signal Models: Complex Proper Gaussians  Model for complex spectrum:  Independence across time and frequencies:  For stationary processes, power spectrum density (PSD) of  Variance/PSD matrix s.t. page 15 direction ou services

Signal Models: Mixture Model  Mixture = Solo + Accompaniment Voice Music  Each signal centered-Gaussian, with resp. variances :  Independence between V and M: NMF decomposition of the power spectrum Source/Filter model page 16 direction ou services

Signal Models: Source/Filter Model for the Main Instrument  Motivations: • Singing voice often main instrument, • Source/Filter widely used, suitable for wide range of other instruments, • Separately modeling pitched aspects (source) from timbre aspects (filter). Human vocal tract (from Wikipedia) page 18 direction ou services

Signal Models: Source/Filter Principle (Glottal) (Vocal Tract) Source Filter Frequency (Hz) Frequency (Hz) Frequency (Hz) page 19 direction ou services

Signal Models: Source/Filter Variability A Vocal Signal (by Tamy - from MTG MASS database) Frequency (Hz) Time (s) page 20 direction ou services

Signal Models: Source/Filter Variability  Human singer: • Independent evolution of pitches and filters (vowel), • Continuous pitch variations, • Limited set of vowels (smooth filters), • Unvoiced parts...  Proposed Model for Main Instrument: • Discrete range of possible for voiced source component, log-spaced s.t. 96 per octave, • Limited number of “smooth” filters , • Unvoiced source component integrated later in the estimation process. page 21 direction ou services

Signal Models: Source Component (1/2)  Voiced source component : • KLGLOTT88 (Glottal source) model, [Klatt90]: spectral comb dictionary , “notes”, • Freq. , Pitch : power spectrum , • Pitch , Frame : activation coefficients , • Nonnegative combination of the element of the dictionary  Unvoiced source • In dictionary , “unvoiced” component such that: • Activation coefficient estimated only after filter part. page 22 direction ou services

Signal Models: Source Component (2/2) f0 number Frequency (Hz) Frequency (Hz) Time (s) f0 number page 23 direction ou services

Signal Models: Filter Component (1/2)  Filter component: • Dictionary of filters , • Freq. , Filter number : freq. response , • Filter , Frame : activation , • Combination:  Filter smoothness: • Decomposition on spectral dictionary of smooth “atomic” elements , activations , • That is to say: page 24 direction ou services

Signal Models: Filter Component (2/2) filter p Time (s) Frequency (Hz) Frequency (Hz) Frequency (Hz) Time (s) page 25 direction ou services

Signal Models: Source/Filter Summary  Source contribution:  Filter contribution:  Main Instrument contribution to the mixture power spectrum:  Parameters: • Fixed parameters: dictionaries and • To estimate : page 26 direction ou services

Signal Models: Accompaniment (1/2)  Accompaniment/Background Music component : • Power spectrum dictionary , with elements, • Activation matrix , • Nonnegative combination of the element of the dictionary  Equivalence between [Fevotte09]: • Maximum Likelihood (ML) estimation of and with • NMF minimizing the Itakura-Saito divergence between and the matrix product page 28 direction ou services

Signal Models: Accompaniment (2/2) r Frequency (Hz) Frequency (Hz) Time (s) r page 29 direction ou services

Signal Models: Mixture model summary  Mixture variance/PSD matrix: • Main Instrument: • Accompaniment: • Mixture:  Parameters: • Fixed Parameters: • To be estimated: page 30 direction ou services

Signal Models: Parameter Estimation  Maximum Likelihood (ML) estimation: • Log-likelihood of the observations : • With the parameterized variance:  NMF inspired algorithm: • Itakura-Saito divergence between and • Multiplicative updates for parameter estimation page 31 direction ou services

Automatic Transcription and Separation of the Main Melody from - PowerPoint PPT Presentation

Automatic Transcription and Separation of the Main Melody from Polyphonic Music Signals Jean-Louis Durrieu, PhD candidate TSI Department, Telecom ParisTech http://perso.telecom-paristech.fr/durrieu/en/ 07/09/09 Automatic Transcription and

Automatic Drum Transcription E6820 Project Proposal Ron Weiss ronw@ee.columbia.edu Automatic

Unsupervised Piano Music Transcription Taylor Berg-Kirkpatrick Jacob Andreas and Dan Klein

Separation energies A = 21 isobaric chain one-nucleon separation energies two-nucleon separation

TFClass a classifjcation of transcription factors Jrgen Dnitz, Edgar Wingender T

MELODY TONG GE JINGSI LI SHUO YANG Music programming language .mc .csv .midi

Wipes Clog Pipes! Central Sans Approach to the War on Wipes Melody LaBella Melody LaBella

Overview of CCCSDs Recycled Water Program Melody LaBella, Melody LaBella, P.E. P.E. LAFCO

FROM DRUM TRANSCRIPTION TO DRUM PATTERN VARIATION Richard Vogl richard.vogl@tuwien.ac.at PART 1

Automatic Verification of Automatic Verification of Automatic Verification of Automatic

Transcription: Pausing and Backtracking: Error Correction Mamata Sahoo and Stefan Klumpp Theory

Theoretical Biology 2016 Transcription factors bind DNA to block or enhance transcription

Transform. Innovate. How DevOps and Cloud are changing Engineering Melody Meckfessel

Melody Control Giacomo Driussi / Philipp Hoffmann Interaktion Engineering / WS 2018/2019 Prof.

Frances Willard Our SMART Journey Hello! Lance Clark: Principal Nicole Melody: Assistant

November 2019 Building Planning Team Melody Davis Building Principal Ad Hoc Committee

I nc ub a tio n: Phila de lphia s E me rg e nt Me ta lsmiths Elisabeth R. Agro Presented

SIZE OPTIMIZATION OF WIND PHOTOVOLTAIC SYSTEMS Dr. Fatih Onur HOCAO LU Afyon Kocatepe

Specification of APERTIF Polyphase Filter Bank in C aSH Rinse Wester a , Dimitrios Sarakiotis a

INTRODUCING INNOVATIVE RAINWATER HARVESTING TECHNOLOGIES TO IMPROVE ACCESS TO SAFE WATER IN

Filling the Performance Gap in Convolution Implementations for NVIDIA GPUs Antonio J. Pea,

Vertjcal Profjles of WaveCoherent Momentum Flux and Velocity Variances in the Marine

Hierarchical Dirichlet Processes Sharing Clusters Among Related Groups Dongruo Zhou 1 Difan Zou 2

Far-from-equilibrium dynamics of molecules in 4 He nanodroplets: a quasiparticle perspective

A RECY A RECYCL CLING ING SO SOCIETY CIETY TH THE E NATIO TIONAL AL WASTE WASTE PLA