Automatic Colorization Gustav Larsson TTI Chicago / University of - - PowerPoint PPT Presentation

automatic colorization
SMART_READER_LITE
LIVE PREVIEW

Automatic Colorization Gustav Larsson TTI Chicago / University of - - PowerPoint PPT Presentation

Automatic Colorization Gustav Larsson TTI Chicago / University of Chicago Joint work with Michael Maire and Greg Shakhnarovich NVIDIA @ SIGGRAPH 2016 Colorization Let us first define colorization Colorization Definition 1: The inverse


slide-1
SLIDE 1

Automatic Colorization

Gustav Larsson

TTI Chicago / University of Chicago

Joint work with Michael Maire and Greg Shakhnarovich

NVIDIA @ SIGGRAPH 2016

slide-2
SLIDE 2

Colorization

Let us first define “colorization”

slide-3
SLIDE 3

Colorization

Definition 1: The inverse of desaturation. Original

slide-4
SLIDE 4

Colorization

Definition 1: The inverse of desaturation. Original Desaturate Grayscale

slide-5
SLIDE 5

Colorization

Definition 1: The inverse of desaturation. Grayscale

slide-6
SLIDE 6

Colorization

Definition 1: The inverse of desaturation. Original Colorize Grayscale

slide-7
SLIDE 7

Colorization

Definition 1: The inverse of desaturation. (Note: Impossible!) Original Colorize Grayscale

slide-8
SLIDE 8

Colorization

Definition 2: An inverse of desaturation, that... Grayscale

slide-9
SLIDE 9

Colorization

Definition 2: An inverse of desaturation, that... Our Method Colorize Grayscale ... is plausible and pleasing to a human observer.

slide-10
SLIDE 10

Colorization

Definition 2: An inverse of desaturation, that... Our Method Colorize Grayscale ... is plausible and pleasing to a human observer.

  • Def. 1: Training + Quantitative Evaluation
  • Def. 2: Qualitative Evaluation
slide-11
SLIDE 11

Manual colorization

I thought I would give it a quick try...

slide-12
SLIDE 12

Manual colorization

Grass is green

(low-level: grass texture / mid-level: tree recognition / high-level: scene understanding)

slide-13
SLIDE 13

Manual colorization

Sky is blue

slide-14
SLIDE 14

Manual colorization

Mountains are... brown?

slide-15
SLIDE 15

Manual colorization

Use the original luminosity

slide-16
SLIDE 16

Manual colorization

Manual (≈ 15 s)

slide-17
SLIDE 17

Manual colorization

Manual (≈ 15 s) Manual (≈ 3 min)

slide-18
SLIDE 18

Manual colorization

Manual (≈ 15 s) Manual (≈ 3 min) Automatic (< 1 s)

slide-19
SLIDE 19

A brief history

The history of computer-aided colorization in 3 slides.

slide-20
SLIDE 20

Method 1: Scribbles

Manual Automatic

User-defined scribbles define colors. Algorithm fills it in. Input Output

Levin et al. (2004)

→ Levin et al. (2004); Huang et al. (2005); Qu et al. (2006); Luan et al. (2007)

slide-21
SLIDE 21

Method 2: Transfer

Manual Automatic

Reference image(s) is provided. Scribbles are automatically created from correspondences. Reference Input Output

Charpiat et al. (2008)

→ Welsh et al. (2002); Irony et al. (2005); Charpiat et al. (2008); Morimoto et al. (2009); Chia et al. (2011)

slide-22
SLIDE 22

Method 2: Transfer

Manual Automatic

Reference image(s) is provided. Scribbles are automatically created from correspondences. Reference Input Output

Charpiat et al. (2008)

→ Welsh et al. (2002); Irony et al. (2005); Charpiat et al. (2008); Morimoto et al. (2009); Chia et al. (2011)

slide-23
SLIDE 23

Method 3: Prediction

Manual Automatic

Fully parametric prediction. colorize                         = Automatic colorization is gaining interest recently:

→ Deshpande et al., Cheng et al.

  • ICCV 2015

; Iizuka & Simo-Serra et al.

  • SIGGRAPH 2016 (2pm, Ballroom E)

Zhang et al., Larsson et al.

  • ECCV 2016
slide-24
SLIDE 24

Method 3: Prediction

Manual Automatic

Fully parametric prediction. colorize pixel                         = (60, 87, 44) Automatic colorization is gaining interest recently:

→ Deshpande et al., Cheng et al.

  • ICCV 2015

; Iizuka & Simo-Serra et al.

  • SIGGRAPH 2016 (2pm, Ballroom E)

Zhang et al., Larsson et al.

  • ECCV 2016
slide-25
SLIDE 25

Model

Design principles:

  • Semantic knowledge
slide-26
SLIDE 26

Model

Design principles:

  • Semantic knowledge → Leverage ImageNet-based classifier

p

VGG-16-Gray Input: Grayscale Image conv1 1 conv5 3 (fc6) conv6 (fc7) conv7

slide-27
SLIDE 27

Model

Design principles:

  • Semantic knowledge → Leverage ImageNet-based classifier
  • Low-level/high-level features

p

VGG-16-Gray Input: Grayscale Image conv1 1 conv5 3 (fc6) conv6 (fc7) conv7

slide-28
SLIDE 28

Model

Design principles:

  • Semantic knowledge → Leverage ImageNet-based classifier
  • Low-level/high-level features → Zoom-out/Hypercolumn architecture

p

VGG-16-Gray Input: Grayscale Image conv1 1 conv5 3 (fc6) conv6 (fc7) conv7 Hypercolumn

slide-29
SLIDE 29

Model

Design principles:

  • Semantic knowledge → Leverage ImageNet-based classifier
  • Low-level/high-level features → Zoom-out/Hypercolumn architecture
  • Colorization not unique

p

VGG-16-Gray Input: Grayscale Image conv1 1 conv5 3 (fc6) conv6 (fc7) conv7 Hypercolumn

slide-30
SLIDE 30

Model

Design principles:

  • Semantic knowledge → Leverage ImageNet-based classifier
  • Low-level/high-level features → Zoom-out/Hypercolumn architecture
  • Colorization not unique → Predict histograms

p

VGG-16-Gray Input: Grayscale Image Output: Color Image conv1 1 conv5 3 (fc6) conv6 (fc7) conv7 Hypercolumn h fc1 Hue Chroma Ground-truth Lightness

slide-31
SLIDE 31

Instantiation

Going from histogram prediction to RGB:

  • Sample
slide-32
SLIDE 32

Instantiation

Going from histogram prediction to RGB:

  • Sample
  • Mode
slide-33
SLIDE 33

Instantiation

Going from histogram prediction to RGB:

  • Sample
  • Mode
  • Median
slide-34
SLIDE 34

Instantiation

Going from histogram prediction to RGB:

  • Sample
  • Mode
  • Median
  • Expectation
slide-35
SLIDE 35

Instantiation

Going from histogram prediction to RGB:

  • Sample
  • Mode
  • Median ← Chroma
  • Expectation ← Hue
slide-36
SLIDE 36

Instantiation

Going from histogram prediction to RGB:

  • Sample
  • Mode
  • Median ← Chroma
  • Expectation ← Hue

The histogram representation is rich and flexible:

slide-37
SLIDE 37

Instantiation

Going from histogram prediction to RGB:

  • Sample
  • Mode
  • Median ← Chroma
  • Expectation ← Hue

The histogram representation is rich and flexible:

slide-38
SLIDE 38

Results

Significant improvement over state-of-the-art:

10 15 20 25 30 35

PSNR

0.00 0.05 0.10 0.15 0.20 0.25

Frequency Cheng et al. Our method

Cheng et al. (2015)

0.0 0.2 0.4 0.6 0.8 1.0

RMSE (αβ)

0.0 0.2 0.4 0.6 0.8 1.0

% Pixels No colorization Welsh et al. Deshpande et al. Ours Deshpande et al. (GTH) Ours (GTH)

Deshpande et al. (2015)

slide-39
SLIDE 39

Comparison

Model AuC CMF VGG Top-1 Turk non-rebal rebal Classification Labeled Real (%) (%) (%) Accuracy (%) mean std Ground Truth 100.00 100.00 68.32 50.00 – Gray 89.14 58.01 52.69 – – Random 84.17 57.34 41.03 12.99 2.09 Dahl 90.42 58.92 48.72 18.31 2.01 Zhang et al. 91.57 65.12 56.56 25.16 2.26 Zhang et al. (rebal) 89.50 67.29 56.01 32.25 2.41 Ours 91.70 65.93 59.36 27.24 2.31

Table: Source: Zhang et al. (2016)

slide-40
SLIDE 40
slide-41
SLIDE 41
slide-42
SLIDE 42

Examples

Input Our Method Ground-truth Input Our Method Ground-truth

slide-43
SLIDE 43

Figure: Failure modes. Figure: B&W photographs.

slide-44
SLIDE 44

Self-supervision (ongoing work)

Colorization as a means to learn visual representations:

  • 1. Train colorization from scratch
slide-45
SLIDE 45

Self-supervision (ongoing work)

Colorization as a means to learn visual representations:

  • 1. Train colorization from scratch
  • 2. Use network for segmentation, detection, style transfer, texture generation, etc.
slide-46
SLIDE 46

Self-supervision (ongoing work)

Colorization as a means to learn visual representations:

  • 1. Train colorization from scratch
  • 2. Use network for segmentation, detection, style transfer, texture generation, etc.

Initialization Architecture XImageNet YImageNet Color mIU (%) Classifier (ours) VGG-16 ✓ ✓ 64.0 Random VGG-16 32.5 Classifier AlexNet ✓ ✓ ✓ 48.0 Random AlexNet ✓ 19.8

Table: VOC 2012 segmentation validation set.

slide-47
SLIDE 47

Self-supervision (ongoing work)

Colorization as a means to learn visual representations:

  • 1. Train colorization from scratch
  • 2. Use network for segmentation, detection, style transfer, texture generation, etc.

Initialization Architecture XImageNet YImageNet Color mIU (%) Classifier (ours) VGG-16 ✓ ✓ 64.0 Random VGG-16 32.5 Classifier AlexNet ✓ ✓ ✓ 48.0 BiGAN (Donahue et al.) AlexNet ✓ ✓ 34.9 Inpainter (Deepak et al.) AlexNet ✓ ✓ 29.7 Random AlexNet ✓ 19.8

Table: VOC 2012 segmentation validation set.

slide-48
SLIDE 48

Self-supervision (ongoing work)

Colorization as a means to learn visual representations:

  • 1. Train colorization from scratch
  • 2. Use network for segmentation, detection, style transfer, texture generation, etc.

Initialization Architecture XImageNet YImageNet Color mIU (%) Classifier (ours) VGG-16 ✓ ✓ 64.0 Colorizer VGG-16 ✓ 50.2 Random VGG-16 32.5 Classifier AlexNet ✓ ✓ ✓ 48.0 BiGAN (Donahue et al.) AlexNet ✓ ✓ 34.9 Inpainter (Deepak et al.) AlexNet ✓ ✓ 29.7 Random AlexNet ✓ 19.8

Table: VOC 2012 segmentation validation set.

slide-49
SLIDE 49

Questions?

Try it out yourself:

http://colorize.ttic.edu

slide-50
SLIDE 50

References

Charpiat, G., Hofmann, M., and Sch¨

  • lkopf, B. (2008). Automatic image colorization via multimodal predictions. In ECCV.

Cheng, Z., Yang, Q., and Sheng, B. (2015). Deep colorization. In ICCV. Chia, A. Y.-S., Zhuo, S., Gupta, R. K., Tai, Y.-W., Cho, S.-Y., Tan, P., and Lin, S. (2011). Semantic colorization with internet images. ACM Transactions on Graphics (TOG), 30(6). Deshpande, A., Rock, J., and Forsyth, D. (2015). Learning large-scale automatic image colorization. In ICCV. Huang, Y.-C., Tung, Y.-S., Chen, J.-C., Wang, S.-W., and Wu, J.-L. (2005). An adaptive edge detection based colorization algorithm and its

  • applications. In ACM international conference on Multimedia.

Irony, R., Cohen-Or, D., and Lischinski, D. (2005). Colorization by example. In Eurographics Symp. on Rendering. Levin, A., Lischinski, D., and Weiss, Y. (2004). Colorization using optimization. ACM Transactions on Graphics (TOG), 23(3). Luan, Q., Wen, F., Cohen-Or, D., Liang, L., Xu, Y.-Q., and Shum, H.-Y. (2007). Natural image colorization. In Eurographics conference on Rendering Techniques. Morimoto, Y., Taguchi, Y., and Naemura, T. (2009). Automatic colorization of grayscale images using multiple images on the web. In SIGGRAPH: Posters. Qu, Y., Wong, T.-T., and Heng, P.-A. (2006). Manga colorization. ACM Transactions on Graphics (TOG), 25(3). Welsh, T., Ashikhmin, M., and Mueller, K. (2002). Transferring color to greyscale images. ACM Transactions on Graphics (TOG), 21(3). Zhang, R., Isola, P., and Efros, A. A. (2016). Colorful image colorization. In ECCV.