Adversarial Generation of Time-Frequency Features with application - - PowerPoint PPT Presentation

adversarial generation of time frequency features
SMART_READER_LITE
LIVE PREVIEW

Adversarial Generation of Time-Frequency Features with application - - PowerPoint PPT Presentation

Adversarial Generation of Time-Frequency Features with application in audio synthesis Speaker: Andr es Marafioti Co-Authors: Nathana el Perraudin, Nicki Holighaus, Piotr Majdak Acoustics Research Institute, Vienna Austrian Academy of


slide-1
SLIDE 1

Adversarial Generation of Time-Frequency Features

with application in audio synthesis

Speaker: Andr´ es Marafioti Co-Authors: Nathana¨ el Perraudin, Nicki Holighaus, Piotr Majdak

Acoustics Research Institute, Vienna Austrian Academy of Sciences

International Conference on Machine Learning Long Beach, California, June 11th, 2019

slide-2
SLIDE 2

Time to time-frequency

Marafioti (ARI) Adversarial Generation of TF Features ARI 2 / 6

slide-3
SLIDE 3

Time-frequency to time

Marafioti (ARI) Adversarial Generation of TF Features ARI 3 / 6

slide-4
SLIDE 4

Is it consistent?

Marafioti (ARI) Adversarial Generation of TF Features ARI 4 / 6

slide-5
SLIDE 5

Applied to GANs

Marafioti (ARI) Adversarial Generation of TF Features ARI 5 / 6

slide-6
SLIDE 6

Evaluation

We trained on a dataset of spoken English digits [0-9]. We evaluated our results with perceptual tests. Audio examples and implementations are available at tifgan.github.io WaveGAN digits TiFGAN-M digits vs TiFGAN vs WaveGAN Real 86% 94% TiFGAN – 75% WaveGAN 25% – Thank you for your attention!

Supported by the Austrian Science Fund (FWF; MERLIN, I 3067-N30).

Marafioti (ARI) Adversarial Generation of TF Features ARI 6 / 6