Deep Learning Techniques for Music Generation Compound and GAN (6) - PowerPoint PPT Presentation

Deep Learning Techniques for Music Generation Compound and GAN (6) Jean-Pierre Briot Jean-Pierre.Briot@lip6.fr Laboratoire d’Informatique de Paris 6 (LIP6) Sorbonne Université – CNRS Programa de Pós-Graduação em Informática (PPGI) UNIRIO Deep Learning – Music Generation – 2019 Jean-Pierre Briot

Architectures Deep Learning – Music Generation – 2019 2 Jean-Pierre Briot

Architectures • Feedforward mini-bach.py • Autoencoder auto-bach.py – Variational Autoencoder (VAE) VRAE • Recurrent (RNN) – LSTM lstm.py, Celtic • Generative Adversarial Networks (GAN) • Restricted Boltzmann Machine (RBM) • Reinforcement Learning (RL) Deep Learning – Music Generation – 2019 3 Jean-Pierre Briot

Compound Architectures • Autoencoder Stack = Autoencoder n 784 400 – DeepHear, auto-bach.py 200 100 • Autoencoder(RNN, RNN) = RNN Encoder-Decoder – VRAE • RNN Variational Encoder-Decoder – Music-VAE Deep Learning – Music Generation – 2019 4 Jean-Pierre Briot

Generative Adversarial Networks (GAN) [Goodfellow et al., 2014] Fake Generator 𝐻 𝑨, 𝜄 �� 𝐸 𝑦, 𝜄 �𝐸� 𝐻 𝑨, 𝜄 �� 𝐸 𝑦, 𝜄 �𝐸� 𝐸 𝐸 Real 𝜄 �𝐸� 𝐻 𝜄 �𝐸� 𝐻 or 𝜄 �� 𝜄 �� Fake ? ℝ � ℝ � ℝ � ℝ � ℝ � ℝ � Discriminator 𝐻 𝑨, 𝜄 �� 𝐸 𝑦, 𝜄 �𝐸� 𝐸 𝜄 �𝐸� 𝐻 𝐻 𝑨, 𝜄 �� 𝐸 𝑦, 𝜄 �𝐸� 𝜄 �� 𝐸 Real 𝐻 𝑨, 𝜄 �� 𝐸 𝑦, 𝜄 �𝐸� ℝ � ℝ � 𝐻 𝑨, 𝜄 �� 𝐸 𝑦, 𝜄 �𝐸� 𝜄 �𝐸� 𝐻 𝐸 ℝ � 𝜄 �� 𝐸 𝐻 𝑨, 𝜄 �� 𝐸 𝑦, 𝜄 �𝐸� 𝜄 �𝐸� 𝐻 𝜄 �𝐸� ℝ � 𝐻 ℝ � 𝜄 �� 𝐸 Real Data Base 𝜄 �� ℝ � 𝜄 �𝐸� 𝐻 ℝ � ℝ � ℝ � ℝ � 𝜄 �� Deep Learning – Music Generation – 2019 5 Jean-Pierre Briot ℝ � ℝ � ℝ � ℝ � ℝ �

Generative Adversarial Networks (GAN) [Goodfellow et al., 2014] • Training Simultaneously 2 Neural Networks – Generator » Transforms Random noise Vectors into Faked Samples – Discriminator » Estimates probability that the Sample came from training data rather than from G – Minimax 2-player game Prediction by D D( x ): P D (x from real data) (Correct) D(G( z )): P D (G( z ) from real data) (Incorrect) 1 - D(G( z )): P D (G( z ) from Generator) (Correct) P=1 P=0 [Nam Hyuk Ahn, 2017] Deep Learning – Music Generation – 2019 6 Jean-Pierre Briot

GAN Equation • Binary Cross-Entropy: • H B (y, y) = - (y log y + (1-y) log (1-y)) • D (x) = 1 P D ( x from real data) Correct • H B ( D (x), D (x)) = - ( D (x) log D (x) + (1- D (x)) log (1- D (x))) • H B ( D (x), D (x)) = - log D (x) • D ( G (z)) = 0 P D (G( z ) from real data) Incorrect • H B ( D ( G (z)), D ( G (z))) = - ( D ( G (z)) log D ( G (z)) + (1- D ( G (z))) log (1- D (G(z)))) • H B ( D ( G (z)), D ( G (z))) = - log (1- D (G(z))) • H B ( D (x), D (x)) + H B ( D ( G (z)), D ( G (z))) = - (log D (x) + log (1- D (G(z)))) Deep Learning – Music Generation – 2019 7 Jean-Pierre Briot

GAN and Turing Test ar�i�� rendi�io 𝐸 𝐻�𝑨� or 𝐸 𝑦 𝐻 𝑨, 𝜄 �� 𝐸 𝑦, 𝜄 �𝐸� Generator 𝐸 𝜄 �𝐸� 𝐻 𝜄 �� 𝐻 𝑨 or 𝑦 ℝ � ℝ � ℝ � 𝐻 𝑨, 𝜄 �� 𝐸 𝑦, 𝜄 �𝐸� Discriminator 𝐸 𝑨 𝜄 �𝐸� 𝐻 [Goodfellow, 2016] 𝜄 �� Deep Learning – Music Generation – 2019 8 Jean-Pierre Briot ℝ � ℝ � ℝ �

GAN Basic Training Algorithm Initialize � �� , � �� • • For � � 1: 𝑐: 𝑈 Initialize Δ� �� 0 • • For 𝑗 � �: � � 𝑐 � 1 • Sample � � ~ 𝑞�� 𝑙 • Compute 𝐸 𝐻 � � , 𝐸�� ← Compute gradient of Discriminator loss , 𝐾 � � � , � �� • Δ� � Δ� �� ← Δ� �� Δ� � � • Update � �� • 𝑙 � 1 Initialize Δ� �� 0 • • For 𝑘 � �: � � 𝑐 � 1 • Sample � � ~ 𝑞�� • Compute 𝐸 𝐻 � � , 𝐸�� ← Compute gradient of Generator loss, 𝐾 � � � , � �� • Δ� � Δ� �� ← Δ� �� Δ� � • � • Update � �� Deep Learning – Music Generation – 2019 9 Jean-Pierre Briot

Examples of GAN Generated Images [Brundage et al., 2018] Synthetic (Generated) Celebrity images CelebFaces Attributes Dataset (CelebA) > 200K celebrity images [Karras et al., 2018] Deep Learning – Music Generation – 2019 10 Jean-Pierre Briot

Using StyleGAN [Karras et al., 2018] [Xu, 2018] Deep Learning – Music Generation – 2019 11 Jean-Pierre Briot

C-RNN-GAN [Mogren, 2016] GAN(Bidirectional-LSTM 2 , LSTM 2 ) • Discriminator considers the hidden layers (forward and backward) values to be (or not) representative of the Real data – Analog to RNN Encoder-Decoder which considers the hidden layer as the summary of a sequence • Classical music Training Dataset Deep Learning – Music Generation – 2019 Jean-Pierre Briot 13

MidiNet [Yang et al., 2017] • Conditioning information – Previous measure – Chord sequence • Scope: – Previous measure (1D conditions) – Various previous measures (2D conditions) • Fine control: – Conditioning on previous measure 1D/2D and on chord sequence 1D/2D for one/all convolutional layers – Ex: previous measure 1D and on chord sequence 2D for all convolutional layers » Follows more chord sequence https://soundcloud.com/vgtsv6jf5fwq/model3 – Pop music dataset Deep Learning – Music Generation – 2019 Jean-Pierre Briot 14

GAN Examples – Celtic Melodies (500 Epochs) Deep Learning – Music Generation – 2019 15 Jean-Pierre Briot

GAN Examples – Celtic Melodies (5000 Epochs) Deep Learning – Music Generation – 2019 16 Jean-Pierre Briot

GAN Examples – Bach Chorales Deep Learning – Music Generation – 2019 17 Jean-Pierre Briot

GAN Mode Collapse (1/3) [Jonathan Hui, 2016] Deep Learning – Music Generation – 2019 18 Jean-Pierre Briot

GAN Mode Collapse (2/4) [Jonathan Hui, 2016] Corpus Conformance Variability Generator>Discriminator Discriminator>Generator Deep Learning – Music Generation – 2019 19 Jean-Pierre Briot

GAN Mode Collapse (2/4) Corpus Conformance Variability Generator>Discriminator Discriminator>Generator Deep Learning – Music Generation – 2019 20 Jean-Pierre Briot

GAN Mode Collapse (3/3) • G is trained extensively without sufficient updates to D • The generated samples will converge to find the optimal content x* that fools D the most, the most realistic sample from the discriminator perspective • In this extreme case (single point mode collapse), x* will be independent of z [Hui, 2018] • Approach: Constantly update D • Heuristic/Empirical approach • High hyperparameters sensivity Deep Learning – Music Generation – 2019 21 Jean-Pierre Briot

Explanation and Direction [Li, 2019] Generated samples move toward the closest boundary This ensures that each generated sample has a nearby data example But it does not ensure that each real data has a nearby generated sample [Li, 2019] Deep Learning – Music Generation – 2019 22 Jean-Pierre Briot

Implicit Maximum Likelihood Estimation (IMLE) [Li, 2019] 1) For each real data, what is the closest generated sample? 2) The generated sample moves toward that real data Deep Learning – Music Generation – 2019 23 Jean-Pierre Briot

VAE vs GAN • VAE (Variational Autoencoder) and GAN (Generative Adversarial Networks) Some Similarities: • Are both generative architectures • Generate from random latent variables [Dykeman, 2016] Differences: • VAE is representational of the whole training dataset • GAN is not • Smooth control interface for exploring latent data space • GAN has (ex: interpolation) but not as for VAE • GAN produces better quality content (ex: better resolution images) Deep Learning – Music Generation – 2019 24 Jean-Pierre Briot

Compound Architectures • Composition – Bidirectional RNN, combining two RNNs, forward and backward in time – RNN-RBM [Boulanger-Lewandowski et al., 2012], combining an RNN (horizontal/sequence) and an RBM (vertical/chords) • Refinement – Sparse autoencoder – Variational autoencoder (VAE) = Variational(Autoencoder) • Nested Stacked autoencoder = Autoencoder n – – RNN Encoder-Decoder = Autoencoder(RNN, RNN) • Pattern instantiation – C-RBM [Lattner et al., 2016] = Convolutional(RBM) C-RNN-GAN [Mogren, 2016] = GAN(Bidirectional-LSTM 2 , LSTM 2 ) – – Anticipation-RNN [Hadjeres & Nielsen, 2017] = Conditioning(RNN, RNN) Deep Learning – Music Generation – 2019 25 Jean-Pierre Briot

Deep Learning Techniques for Music Generation Compound and GAN (6) - PowerPoint PPT Presentation

Deep Learning Techniques for Music Generation Compound and GAN (6) Jean-Pierre Briot Jean-Pierre.Briot@lip6.fr Laboratoire dInformatique de Paris 6 (LIP6) Sorbonne Universit CNRS Programa de Ps-Graduao em Informtica (PPGI)

MUSIC THERAPY MUSIC THERAPY What is music therapy? Music therapy is simply the process of using

Deep Learning Techniques for Music Generation Compound and GAN (6) Jean-Pierre Briot

What does compound mean? Aron Ralston had a compound fracture in his arm before he severed

JEWISH MUSIC 101: WHAT IS JEWISH MUSIC? A PROGRAM OF THE LOWELL MILKEN FUND FOR AMERICAN JEWISH

The intriguing case of sad music Dr. Jonna Vuoskoski jonna.vuoskoski@music.ox.ac.uk Music &

COMPOUND INTEREST FORMULA How to calculate compound interest Courtesy o of Ms.V .Veronica Mo

Deep Learning Techniques for Music Generation 3. Generation by Feedforward Architectures

Deep Learning Techniques for Music Generation 3. Generation by Feedforward Architectures

Music and Pain: A Music Therapy Perspective Deborah Salmon, MA, MTA, CMT BRAMS, Universit de

FOLK MUSIC AT KMH A presentation of the Folk Music Department at the Royal College of Music,

Music, Language and Computation Aline Honingh LoLaCo Guestlecture 2012 Outline Music at the

Music Generation Using Machine Learning Seminar Computer Music SS 2017 Michael Krause RWTH

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

A Musical Future Options for Studying Music at UWA Why choose Music at UWA? Music at UWA

Sample Complexity and Expressiveness Roi Livni and Yishay Mansour Discrimination A

Multi-objective training of Generative Adversarial Networks with multiple discriminators Isabela

Adversarial Methods Graham Neubig Site https://phontron.com/class/nn4nlp2017/ Generative Models

Radon control options for new home construction William J. Angell, Professor University of

Machine Learning Lecture 13: Generative Adversarial Networks (I) Nevin L. Zhang

CoFiGAN: Collaborative Filtering by Generative and Discriminative Training for One-Class

GANs + Final practice questions Lecture 23 CS 753 Instructor: Preethi Jyothi Final Exam

Learning a Probabilistic Latent Space of Object Shapes via 3D Generative-Adversarial Modeling

Deep Learning Techniques for Music Generation Compound and GAN (6) - PowerPoint PPT Presentation

Deep Learning Techniques for Music Generation Compound and GAN (6) Jean-Pierre Briot Jean-Pierre.Briot@lip6.fr Laboratoire dInformatique de Paris 6 (LIP6) Sorbonne Universit CNRS Programa de Ps-Graduao em Informtica (PPGI)

MUSIC THERAPY MUSIC THERAPY What is music therapy? Music therapy is simply the process of using

Deep Learning Techniques for Music Generation Compound and GAN (6) Jean-Pierre Briot

What does compound mean? Aron Ralston had a compound fracture in his arm before he severed

JEWISH MUSIC 101: WHAT IS JEWISH MUSIC? A PROGRAM OF THE LOWELL MILKEN FUND FOR AMERICAN JEWISH

The intriguing case of sad music Dr. Jonna Vuoskoski jonna.vuoskoski@music.ox.ac.uk Music &amp;

COMPOUND INTEREST FORMULA How to calculate compound interest Courtesy o of Ms.V .Veronica Mo

Deep Learning Techniques for Music Generation 3. Generation by Feedforward Architectures

Deep Learning Techniques for Music Generation 3. Generation by Feedforward Architectures

Music and Pain: A Music Therapy Perspective Deborah Salmon, MA, MTA, CMT BRAMS, Universit de

FOLK MUSIC AT KMH A presentation of the Folk Music Department at the Royal College of Music,

Music, Language and Computation Aline Honingh LoLaCo Guestlecture 2012 Outline Music at the

Music Generation Using Machine Learning Seminar Computer Music SS 2017 Michael Krause RWTH

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

A Musical Future Options for Studying Music at UWA Why choose Music at UWA? Music at UWA

Sample Complexity and Expressiveness Roi Livni and Yishay Mansour Discrimination A

Multi-objective training of Generative Adversarial Networks with multiple discriminators Isabela

Adversarial Methods Graham Neubig Site https://phontron.com/class/nn4nlp2017/ Generative Models

Radon control options for new home construction William J. Angell, Professor University of

Machine Learning Lecture 13: Generative Adversarial Networks (I) Nevin L. Zhang

CoFiGAN: Collaborative Filtering by Generative and Discriminative Training for One-Class

GANs + Final practice questions Lecture 23 CS 753 Instructor: Preethi Jyothi Final Exam

Learning a Probabilistic Latent Space of Object Shapes via 3D Generative-Adversarial Modeling

The intriguing case of sad music Dr. Jonna Vuoskoski jonna.vuoskoski@music.ox.ac.uk Music &