Costa Colbert
GAN Fashion Photo Shoot: Garment to Model Images Using Conditional GANs.
Costa M. Colbert, Chief Scientist MAD Street Den Inc.
GAN Fashion Photo Shoot: Garment to Model Images Using Conditional - - PowerPoint PPT Presentation
GAN Fashion Photo Shoot: Garment to Model Images Using Conditional GANs. Costa M. Colbert, Chief Scientist MAD Street Den Inc. Costa Colbert Studies show higher purchase rates when clothing is shown on human figures. Li Live e model el
Costa Colbert
Costa M. Colbert, Chief Scientist MAD Street Den Inc.
and styling
items)
If quality and control of images is sufficient...
Garment images Generated images
Catalog photo Garment image Generated image Generated image
Catalog photo Garment image Generated image Generated image
Catalog photo Garment image Generated image Generated image
Varying pose
Real / Fake Discriminator Training Dataset providing real samples x Samples z from prior distribution e.g. N(0,1) Generator z G(z) approximates a sample from x D(x) decides if sample is from x
Also use L1 reconstruction loss term: abs(G(z)-x)
Generator CNN Discriminator CNN
Real or Fake ? G(z) approximates a sample from x Training Dataset providing real samples x ~ X Samples z from prior distribution e.g., garments, pose, other labels D(x, garment) decides if sample is from x, also requiring correct garment
Discriminator CNN
Input is Model Image concatenated to Garment Image (Patch GAN, Isola et al. 2016) Real/Fake is determined by
patches of limited extent. 6 CNN layers Convolution Instance Normalization Dropout
Hmm.., maybe that global discriminator term wasn’t such a bad idea..
Latent Vector 4x3x512 Encoder Decoder Pose Garment Image Encoder 6 CNN layers Convolution Instance Normalization Adam Optimizer Fashion Model Image Decoder 6 CNN layers Deconvolution/Unpool Dropout Instance Normalization GTX1080ti 2-4 GB
Pose Interpolation
Latent Vector 512x4x3 Encoder Decoder
Enc Dec Enc Dec
X1 X2 LV2 LV1 X1garment X2garment
Enc Dec Enc Dec
X1garment X2garment X1 F(LV1,LV2) LV2
Dec
XFi,n LV1 Fi,n(x,y) = x + (y-x)*i/n X2
Latent Variable Interpolation Shoes Neckline Hemline
Latent Variable Interpolation Hemline
Latent Variable Interpolation Color Background Note sleeves.
Enc Dec
X1garment X1 F(LV1,i)
Dec
XF,
i
LV1 Latent Vector 512x4x3
PCA is a dimension-reduction tool that can reduce a large set
information in the large set. PCA transforms a number of (possibly) correlated variables into a (smaller) number of uncorrelated variables (principal components). PCA determines the new dimensions on the basis of variance.
Use PCA to go from 512x4x3 (~6k) dimensions to 100. LV6k PCA LV100 Choose an entry, scale by +/- 10 PCA LV6k Inv(PCA) LV100
Dec
XF,
i
PCA Latent Variable Interpolation Skin color Model build
PCA Latent Variable Interpolation Shoes
Conditional GAN’s are well-suited for image generation in well-defined domains. Good enough for the casual observer not to notice. GAN’s have many “moving parts,” but we are getting better at using them. More work needed on accessories, choosing specific shoes, handbags, etc. Requires more thought on implementing conditioning labels.
A big thanks to Preferred Networks
l support@madstreetden.com