saliency
play

Saliency Prof. Xavier Gir, Prof. Kevin McGuinness Student: Junting - PowerPoint PPT Presentation

Day 3 Lecture 12 Saliency Prof. Xavier Gir, Prof. Kevin McGuinness Student: Junting Pan Elisa Sayrol Saliency 2 Saliency W hat have you seen ? 3 Saliency Lighthouse 4 Saliency Lighthouse House 5 Saliency Lighthouse House Rocks 6


  1. Day 3 Lecture 12 Saliency Prof. Xavier Giró, Prof. Kevin McGuinness Student: Junting Pan Elisa Sayrol

  2. Saliency 2

  3. Saliency W hat have you seen ? 3

  4. Saliency Lighthouse 4

  5. Saliency Lighthouse House 5

  6. Saliency Lighthouse House Rocks 6

  7. Saliency 7

  8. Saliency Map The Goal is to obtain the Saliency Map of an Image. Regression problem, not Classification Original Image Ground Truth Saliency Map (Eye-Fixation Map)

  9. Data Bases: Groundtruth generation Eye Tracker Mouse Click 9

  10. DataBases TRAIN VALIDATION TEST SALICON [Jiang’15] 10,000 5,000 5,000 iSun [Xu’15] 6,000 926 2,000 CAT2000 [Borji’15] 2,000 - 2,000 MIT300 [Judd’12] 300 - - Pascal-S 850 Other databases: http://saliency.mit.edu/datasets.html 10

  11. Architectures: Junting Net (Shallow Network) 2D Upsample map + filter 96x96 2340=48x48 11

  12. Architectures: Junting Net (Shallow Network) Winner of the LSUN Challenge 2015!! Loss function Mean Square Error (MSE) Weight Gaussian distribution initialization Learning rate 0.03 to 0.0001 Mini batch size 128 Training time 7h (SALICON) / 4h (iSUN) Acceleration SGD+ nesterov momentum (0.9) Regularisation Maxout norm GPU NVidia GTX 980 Shallow and Deep Convolutional Networks for Saliency Prediction Junting Pan, Kevin McGuinness, Elisa Sayrol, Noel O'Connor, Xavier Giro-i-Nieto , CVPR 2016

  13. Architectures: SalNet (Deep Network) Loss function Mean Square Error (MSE) Weight First 3 layers pre-trained with VGG, initialization the rest of the layers random distribution Learning rate 0,01(halved every 100 iterations) Mini batch size 2 images for 24.000 iterations Training time 15h Acceleration SGD+ nesterov momentum (0.9) Regularisation L2 weight GPU NVidia GTX Titan Shallow and Deep Convolutional Networks for Saliency Prediction Junting Pan, Kevin McGuinness, Elisa Sayrol, Noel O'Connor, Xavier Giro-i-Nieto , CVPR 2016

  14. Quality Results

  15. Architectures: Junting Net (Shallow Network) Winner of the LSUN Challenge 2015!! Results from CVPR LSUN Challenge 2015 (iSUN Database) 15

  16. Quantitative Results Metrics: Saliency and Human Fixations: State-of-the-art and Study of Comparison Metrics Nicolas Riche, Matthieu Duvinage, Matei Mancas, Bernard Gosselin and Thierry Dutoit, iccv 2013

  17. Architectures: Saliency Unified ( Very Deep Network) Similar to VGG_16 Saliency Unified: A Deep Architecture for simultaneous Eye Fixation Prediction and Salient Object Segmentation Srinivas S S Kruthiventi, Vennela Gudisa, Jaley H Dholakiya and R. Venkatesh Babu, CVPR 2016

  18. Quantitative Results Saliency Unified: A Deep Architecture for simultaneous Eye Fixation Prediction and Salient Object Segmentation Srinivas S S Kruthiventi, Vennela Gudisa, Jaley H Dholakiya and R. Venkatesh Babu, CVPR 2016

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend