from image processing to machine
play

from Image Processing to Machine Learning Dr. Yu-Kun Lai, Cardiff - PowerPoint PPT Presentation

Non-photorealistic Rendering of Images: from Image Processing to Machine Learning Dr. Yu-Kun Lai, Cardiff University GAMES webinar 4/7/2019 Non-Photorealistic Rendering A well researched topic that mimics artistic styles using computer


  1. Non-photorealistic Rendering of Images: from Image Processing to Machine Learning Dr. Yu-Kun Lai, Cardiff University GAMES webinar 4/7/2019

  2. Non-Photorealistic Rendering A well researched topic that mimics artistic styles using computer algorithms Photo Cartoon Painting Pen & Ink

  3. Non-Photorealistic Rendering: Applications Technical illustrations Scientific visualisation

  4. Our Work: Traditional Methods Dedicated algorithms are developed to create specific styles. Rosin & Lai, Computational Aesthetics, 2013. Rosin & Lai, Graphical Models, 2013. Rosin & Lai, Computational Aesthetics, 2015. Lai & Rosin, IEEE Trans. Image Processing, 2014.

  5. Artistic Minimal Rendering Key observations: Using a small number of tones Simple primitives: lines and tonal blocks Portrait by Andy Warhol Input Single-scale Multi-scale Artistical minimal rendering with lines and blocks, Graphical Models, 2013.

  6. Artistic Minimal Rendering: Pipeline

  7. Artistic Minimal Rendering: 3-tone lines Input Lines only (2-tone) 3-tone posterised into 3 levels

  8. Artistic Minimal Rendering: Tonal Blocks

  9. Artistic Minimal Rendering: Tone Overlay

  10. Results and Extensions

  11. Results and Extensions

  12. GAN-based Image Synthesis Pix2Pix CycleGAN

  13. CartoonGAN We address cartoon stylisation of images Using unpaired content and style image sets Training data easy to obtain Content: normal photos from Flickr Style: key frames from cartoon films CartoonGAN: Generative Adversarial Networks for Photo Cartoonization. CVPR, 2018.

  14. CartoonGAN: architecture Network architecture Generator: similar to [Johnson et al. 2016] Discriminator: differentiate real and synthesised images

  15. CartoonGAN: content loss Content loss: Use L1 instead of L2 to cope with local large differences (recover flat shading)

  16. CartoonGAN: adversarial loss Adversarial loss Edges are often lost since they only cover a small number of pixels We add an edge promoting term in adversarial loss by penalising cartoon images with edges smoothed

  17. CartoonGAN: initialisation Initialisation Traditional random initialisation does not give good results To avoid the GAN model stuck at poor local minima, we start generator learning that only aims to reconstruct the content of the input images.

  18. CartoonGAN: Results

  19. CartoonGAN: Results

  20. APDrawingGAN Portrait drawings: a longstanding and distinct art form, which typically use a sparse set of continuous graphical elements (e.g., lines) to capture the distinctive appearance of a person APDrawingGAN: Generating Artistic Portrait Drawings from Face Photos with Hierarchical GANs. CVPR, 2019 (oral).

  21. APDrawingGAN : Challenges Artistic portrait drawings (APDrawings) are substantially different from painting styles studied in previous work: Highly abstract, sparse but continuous graphical elements Stronger semantic constraints for portrait Different rendering for different facial parts Elements not located precisely by artists Conceptual lines not directly related to low level features

  22. APDrawingGAN : Challenges

  23. APDrawingGAN: Method APDrawingGAN Highlights: Hierarchical structure Novel distance transform (DT) loss A new APDrawing Dataset

  24. APDrawingGAN: Hierarchical Structure We propose a hierarchical structure for both generator and discriminator, each of which includes a global network and six local networks.

  25. APDrawingGAN: Hierarchical Generator G

  26. APDrawingGAN: Hierarchical Discriminator D

  27. APDrawingGAN: Loss Function There are four terms in the loss function: Distance Local Adversarial Pixel-wise Transform transfer Loss Loss Loss Loss 𝑀 𝐻, 𝐸 = 𝑀 𝑏𝑒𝑀 𝐻, 𝐸 + πœ‡ 1 𝑀 β„’1 𝐻, 𝐸 + πœ‡ 2 𝑀 πΈπ‘ˆ 𝐻, 𝐸 + πœ‡ 3 𝑀 π‘šπ‘π‘‘π‘π‘š 𝐻, 𝐸

  28. APDrawingGAN: Distance Transform (DT) loss Motivation: elements in APDrawings are not located precisely. Small misalignments exist! L1 loss: DT loss: Penalize minor misalignments. Based on distance. Treat small misalignments and Tolerate small misalignments big misalignments as the Penalize big misalignments same…

  29. APDrawingGAN: Dataset We build an artistic portrait drawing dataset containing 140 pairs of high-resolution portrait photos and corresponding professional artistic drawings.

  30. APDrawingGAN: Pre-training Face photo NPR result NPR result Ours with jaw line 6655 pairs of face photos and NPR results for pre-training.

  31. APDrawingGAN: Results on photos without GT

  32. APDrawingGAN: Results on photos without GT

  33. APDrawingGAN: Results on photos without GT

  34. APDrawingGAN: Results on photos without GT

  35. APDrawingGAN: Ablation study

  36. APDrawingGAN: Comparison with Gatys, CycleGAN, Pix2Pix

  37. APDrawingGAN: Comparison with CNNMRF, Deep Image Analogy and Headshot Portrait

  38. APDrawinGAN: More Results

  39. APDrawingGAN: User study Method Rank 1 Rank 2 Rank 3 CycleGAN 14.45% 30.90% 54.65% Pix2Pix 14.16% 44.92% 40.92% Ours 71.39% 24.18% 4.43% ANOVA boxplot

  40. Conclusions We show how traditional image processing techniques and machine learning can be used for non-photorealistic rendering of images Machine learning is more flexible to cope with different styles We demonstrate that they can be combined to perform effective non-photorealistic rendering Many challenges remain Quality of results Robustness Evaluation

  41. Conclusions Quality evaluation Subjective, typically using a few examples to demonstrate the method works One approach is to create benchmark datasets Mould & Rosin, An image Rosin et al. Benchmarking Non- stylization benchmark, Expressive Photorealistic Rendering of Portraits, 2016 Expressive, 2017

  42. Thank you!

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend