DEEP LEARNING applications Julia Rabetti Giannella Research - - PowerPoint PPT Presentation

deep learning
SMART_READER_LITE
LIVE PREVIEW

DEEP LEARNING applications Julia Rabetti Giannella Research - - PowerPoint PPT Presentation

DEEP LEARNING applications Julia Rabetti Giannella Research assistant at VISGRAF Lab PhD in Design and Technology (PPDESDI-UERJ) juliagiannella@gmail.com APPLICATIONS Colorization of Black and White Images Adding Sounds To Silent


slide-1
SLIDE 1

DEEP LEARNING

applications

Julia Rabetti Giannella Research assistant at VISGRAF Lab PhD in Design and Technology (PPDESDI-UERJ) juliagiannella@gmail.com

slide-2
SLIDE 2

APPLICATIONS

  • Colorization of Black and White Images
  • Adding Sounds To Silent Movies
  • Object Classification in Photographs
  • Automatic Handwriting Generation
  • Character Text Generation.
  • Image Caption Generation.
  • Automatic Game Playing
  • Artistic style transfer

Source: http://machinelearningmastery.com/inspirational-applications-deep-learning/

slide-3
SLIDE 3

1) Colorization of Black and White Images

  • problem of adding color to black and white photographs
  • traditionally, this was done by hand with human effort
  • CV task attacked by different approaches
  • topic of relative importance in SIGGRAPH and EUROGRAPH
  • DL approach involves the use of very large CNN 


and supervised layers that recreate the image with 
 the addition of color

slide-4
SLIDE 4

Paper Colorful Image Colorization (ECCV, 2016)

Source: http://richzhang.github.io/colorization/

slide-5
SLIDE 5

Network architecture

Source: https://arxiv.org/pdf/1603.08511.pdf

slide-6
SLIDE 6

Semantic interpretability of results

Source: http://richzhang.github.io/colorization/

slide-7
SLIDE 7

[Algorithmia] Demo

Source: http://demos.algorithmia.com/colorize-photos/

slide-8
SLIDE 8

Dana Keller - designer and photo colorizer

Source: https://www.youtube.com/watch?v=bYHnWhZkAIc Source: http://www.danarkeller.com/about/

slide-9
SLIDE 9

Comparing

Keller Algorithmia

slide-10
SLIDE 10

Comparing

Keller Algorithmia

slide-11
SLIDE 11

Comparing

Keller Algorithmia

slide-12
SLIDE 12

Comparing

Keller Algorithmia

slide-13
SLIDE 13

Comparing

Keller Algorithmia

slide-14
SLIDE 14

Comparing

Keller Algorithmia

slide-15
SLIDE 15

2) Object Classification in Photographs

  • task requires the classification of objects within a

photograph as one of a set of previously known objects

  • State-of-the-art results have been achieved on benchmark

examples of this problem using very large CNN

  • derives from image classification task
  • breakthrough: ImageNet Classification with Deep

Convolutional Neural Networks (Krizhevsky et al., 2012)

  • AlexNet won ILSVRC-2012 challenge

Source: http://www.cs.toronto.edu/~fritz/absps/imagenet.pdf

slide-16
SLIDE 16

Classification with localization

  • more complex variation
  • f this task involves

specifically identifying

  • ne or more objects

within the scene of the photograph and drawing a box around them

  • GoogLeNet won

ILSVRC-2014 challenge in this task

Source: https://research.googleblog.com/2014/09/building-deeper-understanding-of-images.html

slide-17
SLIDE 17

2.1) DL and RIO2016

  • VISGRAF project (out 2016)
  • task: automatically classify and cluster images by subject

features related to the Olympic Games, Olympic Torch

  • CNN model and supervised learning
  • TensorFlow (open source software library)
  • Inception-v3 (Going Deeper with Convolutions, 2015)
  • transfer learning (manually labeled 100 examples)

Source: http://lvelho.impa.br/dl_rio2016/index.html Source: https://arxiv.org/abs/1409.4842

slide-18
SLIDE 18

Confidence score

Source: http://lvelho.impa.br/dl_rio2016/metodologia.html

A subset of 12 from 2091 images with confidence score over 83% for the Olympic torch category

slide-19
SLIDE 19

Torch Mosaic

Source: http://lvelho.impa.br/dl_rio2016/mosaico.html

slide-20
SLIDE 20

Torch Mosaic

Source: http://lvelho.impa.br/dl_rio2016/mosaico.html

slide-21
SLIDE 21

2.2) Twitter Facial Analysis Reveals Demographics

  • f Presidential Campaign Followers
  • (Mit Technology Review, march 2016)
  • IN: Conference on Web and Social Media
  • understand follower demographics of Trump and Clinton by

crossing Twitter metadata and facial features

  • a CNN model on followers’ profile images extracts

information on gender, race and age

Source: https://www.technologyreview.com/s/601074/twitter-facial-analysis-reveals-demographics-of-presidential- campaign-followers/?utm_campaign=add_this&utm_source=email&utm_medium=post Source: https://arxiv.org/abs/1603.03097

slide-22
SLIDE 22

A Comparison of the Trumpists and Clintonists

Source: https://arxiv.org/abs/1603.03097

C"lintonists"
 in the Twitter Sphere

slide-23
SLIDE 23

2.3) NVIDIA DRIVENet Demo - Visualizing 
 a Self-Driving Car

Source: https://www.youtube.com/watch?v=HJ58dbd5g8g

slide-24
SLIDE 24

3) Artistic style transfer

  • task: separate and recombine content and style of arbitrary

images, providing a neural algorithm for the creation of artistic images

  • A Neural Algorithm of Artistic Style (Gatys et al., 2015)

Source: https://arxiv.org/abs/1508.06576

slide-25
SLIDE 25

Convolutional Neural Network (CNN)

Source: https://arxiv.org/abs/1508.06576

slide-26
SLIDE 26

An example

Source: https://research.googleblog.com/2016/02/exploring-intersection-of-art-and.html

The style transfer algorithm crosses a photo with a painting style; for example Neil deGrasse Tyson in the style of Kadinsky’s Jane Rouge Bleu. Photo by Guillaume Piolle, used with permission.

slide-27
SLIDE 27

3.1) DeepDream

Source: http://deepdreamgenerator.com/ Source: https://en.wikipedia.org/wiki/DeepDream

  • computer vision program created by Google
  • given an input image returns a version with h"allucinogenic"

appearance

  • originates in a CNN codenamed Inception after the film of

the same name developed for the ILSVRC-2014

  • CNN can also be run in reverse, to do synthesis
  • enhance faces and certain animals -> pareidolia results
slide-28
SLIDE 28

3.1) DeepDream

Source: http://deepdreamgenerator.com/ Source: https://en.wikipedia.org/wiki/DeepDream

slide-29
SLIDE 29

3.2) Prisma App

Source: http://prisma-ai.com/ Source: https://en.wikipedia.org/wiki/Prisma_(app)

  • photo-editing application that utilizes a neural network and

to transform the image into an artistic effect

  • became popular on July 2016
  • created by Alexey Moiseenkov
  • reference A Neural Algorithm of Artistic Style (2016)
slide-30
SLIDE 30

3.2) Prisma App

slide-31
SLIDE 31

3.2) Prisma App

slide-32
SLIDE 32

3.3) Artistic style transfer (video)

Source: https://arxiv.org/abs/1604.08610 Source: https://www.youtube.com/watch?v=Khuj4ASldmU

  • Artistic style transfer for videos (Ruder et al.,2016)
slide-33
SLIDE 33

3.4) Supercharging Style Transfer for video

Source: https://arxiv.org/abs/1610.07629 Source: https://research.googleblog.com/2016/10/supercharging-style-transfer.html

  • A Learned Representation For Artistic Style (Dumoulin et al.,

2016)

  • CNN that learns multiple styles at the same time
  • method enables style interpolation
slide-34
SLIDE 34

3.4) Supercharging Style Transfer for video

Source: https://www.youtube.com/watch?v=6ZHiARZmiUI