for Scalable Joint Distribution Matching Ziliang Chen *, Zhanfu - - PowerPoint PPT Presentation

for scalable joint distribution matching
SMART_READER_LITE
LIVE PREVIEW

for Scalable Joint Distribution Matching Ziliang Chen *, Zhanfu - - PowerPoint PPT Presentation

Multivariate-Information Adversarial Ensemble for Scalable Joint Distribution Matching Ziliang Chen *, Zhanfu Yang*, Xiaoxi Wang*, Xiaodan Liang, Xiaopeng Yan, Guanbin Li, Liang Lin Sun Yat-sen University, Purdue University Motivation Implicit


slide-1
SLIDE 1

Multivariate-Information Adversarial Ensemble for Scalable Joint Distribution Matching

Ziliang Chen*, Zhanfu Yang*, Xiaoxi Wang*, Xiaodan Liang, Xiaopeng Yan, Guanbin Li, Liang Lin Sun Yat-sen University, Purdue University

slide-2
SLIDE 2

Motivation

Implicit Generative Models (IGM), e.g., GAN [1] and CycleGAN [2], boil down to an

  • domain joint distribution matching (JDM):

[1] I. J. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville, and Y. Bengio, Generative Adversarial Nets. Proceedings Neural Information Processing Systems Conference, 2014

[2] J.-Y. Zhu, T. Park, P. Isola, and A. A. Efros, Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks. ArXiv, 2017

slide-3
SLIDE 3

Motivation

[1] I. J. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville, and Y. Bengio, Generative Adversarial Nets. Proceedings Neural Information Processing Systems Conference, 2014

[2] J.-Y. Zhu, T. Park, P. Isola, and A. A. Efros, Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks. ArXiv, 2017

However, if > 2, Implicit Generative Models (IGM), e.g., GAN [1] and CycleGAN [2], boil down to an

  • domain joint distribution matching (JDM):
slide-4
SLIDE 4

Motivation

However, if > 2,

Solution 1: CycleGAN, JointGAN[3] They suffer combinatorial explosion in their parameters Solution 2: StarGAN [4] and its variants The domain-shared model lacks theoretical support, fragile in model collapse

[1] I. J. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville, and Y. Bengio, Generative Adversarial Nets. Proceedings Neural Information Processing Systems Conference, 2014

[2] J.-Y. Zhu, T. Park, P. Isola, and A. A. Efros, Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks. ArXiv, 2017 [4] Choi Y, Choi M, Kim M, et al. StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation. CVPR 2018. [3] Pu Y, Dai S, Gan Z, et al. JointGAN: Multi-Domain Joint Distribution Learning with Generative Adversarial Nets ICML. 2018.

Implicit Generative Models (IGM), e.g., GAN [1] and CycleGAN [2], boil down to an

  • domain joint distribution matching (JDM):
slide-5
SLIDE 5

Solution 3: ALI Ensemble

Adversarially Learned Inference (ALI) Model [5]

[5] Dumoulin, V., Belghazi, I., Poole, B., Mastropietro, O., Lamb, A., Arjovsky, M., and Courville, A. Adversarially learned inference. arXiv preprint arXiv:1606.00704, 2016.

slide-6
SLIDE 6

Solution 3: ALI Ensemble

ALI ensemble across domains Advantages:

(1). Linear-parameter scalability as increases. (2). Generative model capability:

slide-7
SLIDE 7

Solution 3: ALI Ensemble

ALI ensemble across domains Advantages:

(1). Linear-parameter scalability as increases. (2). Generative model capability:

slide-8
SLIDE 8

JDM Criteria

In supervised learning, data are drawn from , each of them presenting as –tuple. So the criterion can be written as In unsupervised learning, no access is provided to draw -tuple from . Extending the observation from [6], the criterion is to minimize the conditional entropy

[6] Li, C., Liu, H., Chen, C., Pu, Y., Chen, L., Henao, R., and Carin, L. Alice: Towards understanding adversarial learning for joint distribution matching. In Advances in Neural Information Processing Systems, pp. 5501–5509, 2017.

slide-9
SLIDE 9

Multivariate Mutual Information

Mutual Information: Multivariate Mutual Information: Our method aims to achieve:

slide-10
SLIDE 10

Multivariate Mutual Information Leads To JDM

slide-11
SLIDE 11

Multivariate Mutual Information Leads To JDM

slide-12
SLIDE 12

Adversarial Ensemble Learning

s.t.

slide-13
SLIDE 13

Experiments

Groundtruth Generation

slide-14
SLIDE 14

Experiments

slide-15
SLIDE 15

Experiments

Supervised Learning Unsupervised Learning

slide-16
SLIDE 16

Thank You!

Collaborators

Zhanfu Yang Xiaoxi Wang Xiaodan Liang Xiaopeng Yan Guanbin Li Liang Lin