deep learning
play

DEEP LEARNING applications Julia Rabetti Giannella Research - PowerPoint PPT Presentation

DEEP LEARNING applications Julia Rabetti Giannella Research assistant at VISGRAF Lab PhD in Design and Technology (PPDESDI-UERJ) juliagiannella@gmail.com APPLICATIONS Colorization of Black and White Images Adding Sounds To Silent


  1. DEEP LEARNING applications Julia Rabetti Giannella Research assistant at VISGRAF Lab PhD in Design and Technology (PPDESDI-UERJ) juliagiannella@gmail.com

  2. APPLICATIONS • Colorization of Black and White Images • Adding Sounds To Silent Movies • Object Classification in Photographs • Automatic Handwriting Generation • Character Text Generation. • Image Caption Generation. • Automatic Game Playing • Artistic style transfer Source: http://machinelearningmastery.com/inspirational-applications-deep-learning/

  3. 1) Colorization of Black and White Images • problem of adding color to black and white photographs • traditionally, this was done by hand with human effort • CV task attacked by different approaches • topic of relative importance in SIGGRAPH and EUROGRAPH • DL approach involves the use of very large CNN 
 and supervised layers that recreate the image with 
 the addition of color

  4. Paper Colorful Image Colorization (ECCV, 2016) Source: http://richzhang.github.io/colorization/

  5. Network architecture Source: https://arxiv.org/pdf/1603.08511.pdf

  6. Semantic interpretability of results Source: http://richzhang.github.io/colorization/

  7. [Algorithmia] Demo Source: http://demos.algorithmia.com/colorize-photos/

  8. Dana Keller - designer and photo colorizer Source: https://www.youtube.com/watch?v=bYHnWhZkAIc Source: http://www.danarkeller.com/about/

  9. Comparing Keller Algorithmia

  10. Comparing Keller Algorithmia

  11. Comparing Keller Algorithmia

  12. Comparing Keller Algorithmia

  13. Comparing Keller Algorithmia

  14. Comparing Keller Algorithmia

  15. 2) Object Classification in Photographs • task requires the classification of objects within a photograph as one of a set of previously known objects • State-of-the-art results have been achieved on benchmark examples of this problem using very large CNN • derives from image classification task • breakthrough: ImageNet Classification with Deep Convolutional Neural Networks (Krizhevsky et al., 2012) • AlexNet won ILSVRC-2012 challenge Source: http://www.cs.toronto.edu/~fritz/absps/imagenet.pdf

  16. Classification with localization • more complex variation of this task involves specifically identifying one or more objects within the scene of the photograph and drawing a box around them • GoogLeNet won ILSVRC-2014 challenge in this task Source: https://research.googleblog.com/2014/09/building-deeper-understanding-of-images.html

  17. 2.1) DL and RIO2016 • VISGRAF project (out 2016) • task: automatically classify and cluster images by subject features related to the Olympic Games, Olympic Torch • CNN model and supervised learning • TensorFlow (open source software library) • Inception-v3 ( Going Deeper with Convolutions , 2015) • transfer learning (manually labeled 100 examples) Source: http://lvelho.impa.br/dl_rio2016/index.html Source: https://arxiv.org/abs/1409.4842

  18. Confidence score A subset of 12 from 2091 images with confidence score over 83% for the Olympic torch category Source: http://lvelho.impa.br/dl_rio2016/metodologia.html

  19. Torch Mosaic Source: http://lvelho.impa.br/dl_rio2016/mosaico.html

  20. Torch Mosaic Source: http://lvelho.impa.br/dl_rio2016/mosaico.html

  21. 2.2) Twitter Facial Analysis Reveals Demographics of Presidential Campaign Followers • (Mit Technology Review, march 2016) • IN: Conference on Web and Social Media • understand follower demographics of Trump and Clinton by crossing Twitter metadata and facial features • a CNN model on followers’ profile images extracts information on gender, race and age Source: https://www.technologyreview.com/s/601074/twitter-facial-analysis-reveals-demographics-of-presidential- campaign-followers/?utm_campaign=add_this&utm_source=email&utm_medium=post Source: https://arxiv.org/abs/1603.03097

  22. A Comparison of the Trumpists and Clintonists C"lintonists" 
 in the Twitter Sphere Source: https://arxiv.org/abs/1603.03097

  23. 2.3) NVIDIA DRIVENet Demo - Visualizing 
 a Self-Driving Car Source: https://www.youtube.com/watch?v=HJ58dbd5g8g

  24. 3) Artistic style transfer • task: separate and recombine content and style of arbitrary images, providing a neural algorithm for the creation of artistic images • A Neural Algorithm of Artistic Style (Gatys et al., 2015) Source: https://arxiv.org/abs/1508.06576

  25. Convolutional Neural Network (CNN) Source: https://arxiv.org/abs/1508.06576

  26. An example The style transfer algorithm crosses a photo with a painting style; for example Neil deGrasse Tyson in the style of Kadinsky’s Jane Rouge Bleu. Photo by Guillaume Piolle, used with permission. Source: https://research.googleblog.com/2016/02/exploring-intersection-of-art-and.html

  27. 3.1) DeepDream • computer vision program created by Google • given an input image returns a version with h"allucinogenic" appearance • originates in a CNN codenamed Inception after the film of the same name developed for the ILSVRC-2014 • CNN can also be run in reverse, to do synthesis • enhance faces and certain animals -> pareidolia results Source: http://deepdreamgenerator.com/ Source: https://en.wikipedia.org/wiki/DeepDream

  28. 3.1) DeepDream Source: http://deepdreamgenerator.com/ Source: https://en.wikipedia.org/wiki/DeepDream

  29. 3.2) Prisma App • photo-editing application that utilizes a neural network and to transform the image into an artistic effect • became popular on July 2016 • created by Alexey Moiseenkov • reference A Neural Algorithm of Artistic Style (2016) Source: http://prisma-ai.com/ Source: https://en.wikipedia.org/wiki/Prisma_(app)

  30. 3.2) Prisma App

  31. 3.2) Prisma App

  32. 3.3) Artistic style transfer (video) • Artistic style transfer for videos (Ruder et al.,2016) Source: https://arxiv.org/abs/1604.08610 Source: https://www.youtube.com/watch?v=Khuj4ASldmU

  33. 3.4) Supercharging Style Transfer for video • A Learned Representation For Artistic Style (Dumoulin et al., 2016) • CNN that learns multiple styles at the same time • method enables style interpolation Source: https://arxiv.org/abs/1610.07629 Source: https://research.googleblog.com/2016/10/supercharging-style-transfer.html

  34. 3.4) Supercharging Style Transfer for video Source: https://www.youtube.com/watch?v=6ZHiARZmiUI

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend