Semantic Image Analogy with a Conditional Single-Image GAN
Jiacheng Li, Zhiwei Xiong, Dong Liu, Xuejin Chen, Zheng-Jun Zha ACM MM 2020
analogous
I ⇒ I′ P ⇒ P′
Semantic Image Analogy with a Conditional Single-Image GAN Ji a - - PowerPoint PPT Presentation
Semantic Image Analogy with a Conditional Single-Image GAN Ji a cheng Li , Zhiwei Xiong, Dong Liu, Xuejin Chen, Zheng-Jun Zh a ACM MM 2020 P P analogous I I Image Analogy A : A :: B : B : :: : :: A A A A :
Jiacheng Li, Zhiwei Xiong, Dong Liu, Xuejin Chen, Zheng-Jun Zha ACM MM 2020
analogous
I ⇒ I′ P ⇒ P′
: :: :
: :: :
: :: :
::
Segmentation Domain Image Domain
analogous
I ⇒ I′ P ⇒ P′ P
P I
In-the-wild Images ADE20k Cityscapes COCO CelebA …
In-the-wild Images ADE20k Cityscapes COCO CelebA …
Psource ⇒ Paug :: Isource ⇒ Iaug
Psource ⇒ Psource :: Isource ⇒ Isource
share weights
Eseg Faug Fsource
(γseg, βseg)
Psource Isource
G SFT
(γimg, βimg)
Psource ⇒ Psource :: Isource ⇒ Isource
Psource Isource
SFT block SFT block
Fl
source
Fl
aug
Segmentation Features
βl
img
γl
img
Transformation Parameters
Fl
img
Image Features
γl
seg ≈ Fl scale
βl
seg ≈ Fl shift
Fl
scale =
Fl
aug
Fl
source
Fl
shift = Fl aug − Fl source
Transformation Parameters Linear Linear
Paug Psource Isource Itarget
share weights
Eseg
G SFT
Faug Fsource
(γimg, βimg) (γseg, βseg)
homogeneous appearance
Paug Psource Isource Itarget
share weights
Eseg
G SFT
Faug Fsource
(γimg, βimg) (γseg, βseg)
aligned semantic layout homogeneous appearance
Paug Psource Isource Itarget
share weights
Eseg
G SFT
Faug Fsource
(γimg, βimg) (γseg, βseg)
Patch Coherence Loss
Isource Itarget
1 N ∑
V⊂Itarget
min
U⊂Isource
d(V, U)
V U
Paug Psource Isource Itarget
share weights
Eseg
G SFT
Semantic Alignment Loss
Iaug
GAN Loss
D
Feature Matching Loss
Real/Fake Fake Real
Faug Fsource
(γimg, βimg) Segmentation Network
S
Ppredict
(γseg, βseg)
Patch Coherence Loss
Psource Isource Itarget
share weights
Eseg
G SFT
Faug Fsource
(γimg, βimg) (γseg, βseg)
Reconstruction Loss
Paug
γimg → 1 βimg → 0
Fixed-Point Loss
Psource Isource Itarget
share weights
Eseg
G SFT
Faug Fsource
(γimg, βimg) (γseg, βseg)
Reconstruction Loss
Paug Isource
GAN Loss
D
Real/Fake Fake Real γimg → 1 βimg → 0
Fixed-Point Loss
Pleas rank A, B and C by appearance similarity with the left side image.
15 30 45 60 Mean IOU Pixel-wise Accuracy IA DIA Ours IA DIA Ours 0% 25% 50% 75% 100% Rank #1 Rank #2 Rank #3
Source Target DIA IA Target Layout Ours
Ours Source SinGAN IA Target Layout Edited Source Ours
Source Ours SPADE IA Target Layout
Source Target #3 Target #1 Target #2
Isource Psource Ptarget Itarget
Source Target #1 Target #3 Target #2
Isource Psource Ptarget Itarget
Isource Psource Ptarget Itarget
analogous
I ⇒ I′ P ⇒ P′