The journey to open-sourcing the code and models FOSDEM 2020 Anis - PowerPoint PPT Presentation

The journey to open-sourcing the code and models FOSDEM 2020 Anis Khlif, Félix Voituret

Who’s been involved Romain Hennequin - Lead research scientist Laure Prétet - Former intern Anis Khlif - Research Engineer Félix Voituret - Research Engineer Manuel Moussallam - Head of Deezer Research Spleeter by Deezer

What is it all about ? Spleeter by Deezer

Large impact on tech audience 9500+ stars on Github 200k+ views 100k+ read on deezer.io Spleeter by Deezer

Myth busting Deezer solved source separation Spleeter performs better than all other solutions Spleeter by Deezer

What did we bring ? State of the art Fast MIT Licensed Spleeter by Deezer

Primer on source separation

Waveform Time Primer on source separation

Time-frequency representation Primer on source separation

Magnitude spectrogram Frequencies Time Primer on source separation

Magnitude spectrogram Frequencies Harmonic content Time Primer on source separation

Magnitude spectrogram Frequencies In-harmonic Percussive content Time Primer on source separation

Magnitude spectrogram Frequencies Vocal content Time Primer on source separation

Magnitude spectrogram Learn a mask for each instrument ! What fraction of the energy at each time and each frequency bin should be assigned to this instrument. Primer on source separation

Magnitude spectrogram Primer on source separation

Spleeter models 2, 4 & 5 stems

A deep learning approach to mask prediction Vocal mask Deep learning model Instruments mask Spleeter models

4-stems Vocal mask Drums mask Deep learning Bass mask model Others mask Spleeter models

5-stems Vocal mask Drums mask Deep Bass mask learning model Piano mask Others mask Spleeter models

Quick introduction to TensorFlow Input Operation Output s s ● Build computation graph that represent a parametrized function Parameters (or weights ) can be modified (trained) to fit an optimization ● function A model can be run in any tensorflow environment ● Some graph operations can be run very efficiently on GPU ● Spleeter models

Quick introduction to TensorFlow Input Operation Output s s model = computation graph (network architecture) + weights (parameters) Spleeter models

Overview Voice - * L1 loss unet masks - * Instruments Spleeter models

Overview Example 1 Voice - * L1 loss unet masks - * Instruments Spleeter models

Overview Example 1 Parameter update Voice - * L1 loss unet masks - * Instruments Spleeter models

Overview Example 2 Voice - * L1 loss unet masks - * Instruments Spleeter models

Overview Example 2 Parameter update Voice - * L1 loss unet masks - * Instruments Spleeter models

Overview Example N Voice - * L1 loss unet masks - * Instruments Spleeter models

Overview Example N Parameter update Voice - * L1 loss unet masks - * Instruments Spleeter models

Dataset In-house dataset of tracks ~24k tracks with stems ~80 hours of recording … that we are not allowed to release! Spleeter models

BUT...

Training We can release learned weights 2 5 4 One mask per channel per source. ● 1 branch predicts masks for 1 source, with 2 channels ● ~10M parameters per branch ● Spleeter models

Open-sourcing Spleeter Packaging & distribution

Packaging constraints Predefined configurations On demand model downloading Oneliner command Open-sourcing Spleeter

Predefined configurations JSON formatted file ● Spleeter Mostly model related parameters ● Provided as ● Embedded configuration files ... File path ○ Configuration name ○ Open-sourcing Spleeter

Using GitHub releases as model hub Spleeter deezer/spleeter Open-sourcing Spleeter

Separate source from command line Separate with default 2stems configuration $ spleeter separate -i input_file.mp3 -o output_dir Separate with specific embedded configuration $ spleeter separate -i input_file.mp3 -o output_dir -p spleeter:4stems Open-sourcing Spleeter

Distribution constraints Critical dependencies to manage : FFmpe Spleeter CPU version g TensorFlow GPU version Cross platform ● Cross hardware ● User friendly ● Open-sourcing Spleeter

Distribution channels Open-sourcing Spleeter

Continuous integration and delivery Open-sourcing Spleeter

Legal considerations No Intellectual Property consensus on weights ● We decided to open-source model ● Open-sourcing Spleeter

Bibliography and references Industrial integrations : AconDigital plugins ● Various public web applications ● ~ 30 projects referenced as using Spleeter on GitHub Research publications : https://ieeexplore.ieee.org/document/8683555 ● ● http://archives.ismir.net/ismir2019/latebreaking/000036.pdf Open-sourcing Spleeter

Demo Spleeter Live

Thank you research@deezer.com

The journey to open-sourcing the code and models FOSDEM 2020 Anis - PowerPoint PPT Presentation

The journey to open-sourcing the code and models FOSDEM 2020 Anis Khlif, Flix Voituret Whos been involved Romain Hennequin - Lead research scientist Laure Prtet - Former intern Anis Khlif - Research Engineer Flix Voituret - Research

A Musical Journey A Musical Journey A Musical Journey A Musical Journey A Musical Journey A

Event Sourcing at Studyflow.nl Sourcing intro Event Sourcing architecture Joost Diepenmaat

Global Sourcing Local Solutions www.ncsourcing.com NC Sourcing Your industrial

Global Sourcing & Technology Changes: Reboot Your Sourcing Strategies May 8, 2014 1 Mayer

Using Social Media for Health Studies Ingmar Weber Social Computing, Qatar Computing Research

Since 2012 Sourcing from producing Information Sourcing countries Education Analyses

Event Sourcing Greg Young Event Sourcing says all state is transient and you only store facts.

EVERYONE IS APPROACHABLE: SOURCING TALENT IN THE 21st CENTURY Josef jos Kadlec Founder

Bazel { fast, correct } choose two Klaus Aehlig February 45, 2017 Bazel How Bazel Works

Use Case Study: Journey - Effects and Terrain Presented by Madis Janno Use Case Study: Journey -

Beyond Beyond Journey Journey Times Times Bluetooth journey time process Moving beyond basic

UNILEVERS JOURNEY IN SUSTAINABLE PALM OIL PERPETUA GEORGE, Sustainability Sourcing Manager,

Code Generation Machine code generation cs4713 1 Machine code generation machine Intermediate

{Sequential Code} {Sequential Code} {Sequential Code} {Sequential Code} {Sequential Code}

monolith to micro-services? event sourcing can help Doug Client legacy Culture Amp

The Heros Journey The Heros Journey the hero's journey, is the common template of stories

Formal Analysis of the Entropy / Security Trade-off in First-Order Masking Countermeasures

A machine learning approach against a masked AES L. LERMAN , S. FERNANDES MEDEIROS, G. BONTEMPI,

1. procedure ONE TO ALL BC( d , my id , X ) 2. begin mask := 2 d 1; 3. /* Set all d bits of

Travel Sentiment Study Wave 16 JULY 21, 2020 COVID-19 TRAVEL SENTIMENT STUDY WAVE 16 Fielded

for Sound Object Initialization Xin Qi and Andrew C. Myers Cornell University Friday, June 3,

Image Masking Schemes for Local Manifold Learning Methods Marco F. Duarte Joint work with

An Approximate Perspective on Word Prediction in Context: Ontological Semantics meets BERT

How to hand wash How to hand rub COVID-19 personal protective equipment when working in health

The journey to open-sourcing the code and models FOSDEM 2020 Anis - PowerPoint PPT Presentation

The journey to open-sourcing the code and models FOSDEM 2020 Anis Khlif, Flix Voituret Whos been involved Romain Hennequin - Lead research scientist Laure Prtet - Former intern Anis Khlif - Research Engineer Flix Voituret - Research

A Musical Journey A Musical Journey A Musical Journey A Musical Journey A Musical Journey A

Event Sourcing at Studyflow.nl Sourcing intro Event Sourcing architecture Joost Diepenmaat

Global Sourcing Local Solutions www.ncsourcing.com NC Sourcing Your industrial

Global Sourcing &amp; Technology Changes: Reboot Your Sourcing Strategies May 8, 2014 1 Mayer

Using Social Media for Health Studies Ingmar Weber Social Computing, Qatar Computing Research

Since 2012 Sourcing from producing Information Sourcing countries Education Analyses

Event Sourcing Greg Young Event Sourcing says all state is transient and you only store facts.

EVERYONE IS APPROACHABLE: SOURCING TALENT IN THE 21st CENTURY Josef jos Kadlec Founder

Bazel { fast, correct } choose two Klaus Aehlig February 45, 2017 Bazel How Bazel Works

Use Case Study: Journey - Effects and Terrain Presented by Madis Janno Use Case Study: Journey -

Beyond Beyond Journey Journey Times Times Bluetooth journey time process Moving beyond basic

UNILEVERS JOURNEY IN SUSTAINABLE PALM OIL PERPETUA GEORGE, Sustainability Sourcing Manager,

Code Generation Machine code generation cs4713 1 Machine code generation machine Intermediate

{Sequential Code} {Sequential Code} {Sequential Code} {Sequential Code} {Sequential Code}

monolith to micro-services? event sourcing can help Doug Client legacy Culture Amp

The Heros Journey The Heros Journey the hero's journey, is the common template of stories

Formal Analysis of the Entropy / Security Trade-off in First-Order Masking Countermeasures

A machine learning approach against a masked AES L. LERMAN , S. FERNANDES MEDEIROS, G. BONTEMPI,

1. procedure ONE TO ALL BC( d , my id , X ) 2. begin mask := 2 d 1; 3. /* Set all d bits of

Travel Sentiment Study Wave 16 JULY 21, 2020 COVID-19 TRAVEL SENTIMENT STUDY WAVE 16 Fielded

for Sound Object Initialization Xin Qi and Andrew C. Myers Cornell University Friday, June 3,

Image Masking Schemes for Local Manifold Learning Methods Marco F. Duarte Joint work with

An Approximate Perspective on Word Prediction in Context: Ontological Semantics meets BERT

How to hand wash How to hand rub COVID-19 personal protective equipment when working in health

Global Sourcing & Technology Changes: Reboot Your Sourcing Strategies May 8, 2014 1 Mayer