Semantic Image Analogy with a Conditional Single-Image GAN Ji a - - PowerPoint PPT Presentation

semantic image analogy with a conditional single image gan
SMART_READER_LITE
LIVE PREVIEW

Semantic Image Analogy with a Conditional Single-Image GAN Ji a - - PowerPoint PPT Presentation

Semantic Image Analogy with a Conditional Single-Image GAN Ji a cheng Li , Zhiwei Xiong, Dong Liu, Xuejin Chen, Zheng-Jun Zh a ACM MM 2020 P P analogous I I Image Analogy A : A :: B : B : :: : :: A A A A :


slide-1
SLIDE 1

Semantic Image Analogy with a Conditional Single-Image GAN

Jiacheng Li, Zhiwei Xiong, Dong Liu, Xuejin Chen, Zheng-Jun Zha ACM MM 2020

analogous

I ⇒ I′ P ⇒ P′

slide-2
SLIDE 2
  • A. Hertzmann, et al. 2001. Image analogies. ACM Trans. Graph.

A : A′ :: B : B′

: :: :

A B B′ A′

: :: :

A B B′ A′

Image Analogy

slide-3
SLIDE 3

: :: :

A B B′ A′

Image Analogy

A : A′ :: B : B′

slide-4
SLIDE 4

::

P P′ I′ I

P ⇒ P′ :: I ⇒ I′

Semantic Image Analogy

⇒ ⇒

Segmentation Domain Image Domain

slide-5
SLIDE 5

analogous

I ⇒ I′ P ⇒ P′ P

Semantic Image Analogy

P ⇒ P′ :: I ⇒ I′

P I

slide-6
SLIDE 6
  • T. Park, et al. 2019. Semantic Image Synthesis With Spatially-Adaptive Normalization. In CVPR.
  • T. Shaham, et al. 2019. SinGAN: Learning a Generative Model From a Single Natural Image. In ICCV.

Conditional GANs Single-Image GANs

Semantic Image Synthesis Retargeting Super-Resolution Unconditional Sampling …

In-the-wild Images ADE20k Cityscapes COCO CelebA …

slide-7
SLIDE 7
  • T. Park, et al. 2019. Semantic Image Synthesis With Spatially-Adaptive Normalization. In CVPR.
  • T. Shaham, et al. 2019. SinGAN: Learning a Generative Model From a Single Natural Image. In ICCV.

Conditional GANs Single-Image GANs

Semantic Image Synthesis Retargeting Super-Resolution Unconditional Sampling …

In-the-wild Images ADE20k Cityscapes COCO CelebA …

Can we achieve the best from both worlds?

slide-8
SLIDE 8

Conditional Single-Image GAN

Can we achieve the best from both worlds?

Self-Supervised Training Semantic Feature Translation (SFT) Loss Terms

slide-9
SLIDE 9

Self-Supervised Learning: Alternating Optimization

Psource ⇒ Paug :: Isource ⇒ Iaug

⇒ ⇒ ⇒ ⇒

Psource ⇒ Psource :: Isource ⇒ Isource

Sampling Mode Reconstruction Mode

slide-10
SLIDE 10

share weights

Eseg Faug Fsource

(γseg, βseg)

Self-Supervised Learning: Reconstruction Mode

⇒ ⇒

Psource Isource

G SFT

(γimg, βimg)

Psource ⇒ Psource :: Isource ⇒ Isource

Psource Isource

slide-11
SLIDE 11

SFT block SFT block

Semantic Feature Translation (SFT) Module

Fl

source

Fl

aug

Segmentation Features

βl

img

γl

img

Transformation Parameters

Fl

img

Image Features

γl

seg ≈ Fl scale

βl

seg ≈ Fl shift

Fl

scale =

Fl

aug

Fl

source

Fl

shift = Fl aug − Fl source

Transformation Parameters Linear Linear

slide-12
SLIDE 12

Paug Psource Isource Itarget

share weights

Eseg

G SFT

Faug Fsource

(γimg, βimg) (γseg, βseg)

Loss Terms

homogeneous appearance

slide-13
SLIDE 13

Paug Psource Isource Itarget

share weights

Eseg

G SFT

Faug Fsource

(γimg, βimg) (γseg, βseg)

Loss Terms

aligned semantic layout homogeneous appearance

slide-14
SLIDE 14

Paug Psource Isource Itarget

share weights

Eseg

G SFT

Faug Fsource

(γimg, βimg) (γseg, βseg)

Loss Terms

Patch Coherence Loss

Isource Itarget

1 N ∑

V⊂Itarget

min

U⊂Isource

d(V, U)

V U

slide-15
SLIDE 15

Paug Psource Isource Itarget

share weights

Eseg

G SFT

Semantic Alignment Loss

Iaug

GAN Loss

D

Feature Matching Loss

Real/Fake Fake Real

Faug Fsource

(γimg, βimg) Segmentation Network

S

Ppredict

(γseg, βseg)

Loss Terms

Patch Coherence Loss

slide-16
SLIDE 16

Psource Isource Itarget

share weights

Eseg

G SFT

Faug Fsource

(γimg, βimg) (γseg, βseg)

Loss Terms

Reconstruction Loss

Paug

γimg → 1 βimg → 0

Fixed-Point Loss

slide-17
SLIDE 17

Psource Isource Itarget

share weights

Eseg

G SFT

Faug Fsource

(γimg, βimg) (γseg, βseg)

Loss Terms

Reconstruction Loss

Paug Isource

GAN Loss

D

Real/Fake Fake Real γimg → 1 βimg → 0

Fixed-Point Loss

slide-18
SLIDE 18

Evaluation

slide-19
SLIDE 19

User Study Interface

Pleas rank A, B and C by appearance similarity with the left side image.

  • A. Hertzmann, et al. 2001. Image analogies. ACM Trans. Graph.
  • J. Liao, et al. 2001. Visual attribute transfer through deep image analogy. ACM Trans. Graph.
slide-20
SLIDE 20

Quantitative Comparisons

15 30 45 60 Mean IOU Pixel-wise Accuracy IA DIA Ours IA DIA Ours 0% 25% 50% 75% 100% Rank #1 Rank #2 Rank #3

  • A. Hertzmann, et al. 2001. Image analogies. ACM Trans. Graph.
  • J. Liao, et al. 2001. Visual attribute transfer through deep image analogy. ACM Trans. Graph.
slide-21
SLIDE 21

Source Target DIA IA Target Layout Ours

Comparisons with Previous Image Analogies

  • A. Hertzmann, et al. 2001. Image analogies. ACM Trans. Graph.
  • J. Liao, et al. 2001. Visual attribute transfer through deep image analogy. ACM Trans. Graph.
slide-22
SLIDE 22

Ours Source SinGAN IA Target Layout Edited Source Ours

  • A. Hertzmann, et al. 2001. Image analogies. ACM Trans. Graph.
  • T. Shaham, et al. 2019. SinGAN: Learning a Generative Model From a Single Natural Image. In ICCV.

Comparisons with Single-Image GANs

slide-23
SLIDE 23

Source Ours SPADE IA Target Layout

  • A. Hertzmann, et al. 2001. Image analogies. ACM Trans. Graph.
  • T. Park, et al. 2019. Semantic Image Synthesis With Spatially-Adaptive Normalization. In CVPR.

Comparisons with Conditional GANs

slide-24
SLIDE 24

Source Target #3 Target #1 Target #2

Semantic Manipulation Results

slide-25
SLIDE 25

Applications

slide-26
SLIDE 26

Isource Psource Ptarget Itarget

Object Removal Results

slide-27
SLIDE 27

Source Target #1 Target #3 Target #2

Face Editing Results

slide-28
SLIDE 28

Isource Psource Ptarget Itarget

Sketch-to-Image Synthesis Results

slide-29
SLIDE 29

Isource Psource Ptarget Itarget

Failure Cases

slide-30
SLIDE 30

Thank you!

analogous

I ⇒ I′ P ⇒ P′