wrap up lecture
play

Wrap Up Lecture Instructor - Simon Lucey 16-423 - Designing - PowerPoint PPT Presentation

Wrap Up Lecture Instructor - Simon Lucey 16-423 - Designing Computer Vision Apps Today Review - Project Presentations Emerging Trends in Mobile Vision Project Discussions Reminder - Project Presentation Each team will be


  1. Wrap Up Lecture Instructor - Simon Lucey 16-423 - Designing Computer Vision Apps

  2. Today • Review - Project Presentations • Emerging Trends in Mobile Vision • Project Discussions

  3. Reminder - Project Presentation • Each team will be given approximately 2.5 minutes per member to present (for example a 2 member team will have 5 minutes allotted). • Each team will fill out the following form, providing a short (must be shorter than your allotted time) YouTube clip describing your App in action. • Teams can submit their YouTube clips through the form http://goo.gl/forms/YoeQt0c1Hf. • 16423 staff will select the best presentations, with the winner receiving the the best project prize.

  4. Today • Review - Project Presentations • Emerging Trends in Mobile Vision • Project Discussions

  5. Why is Mobile CV Different? 5

  6. Why is Mobile CV Different? 5

  7. Why is Mobile CV Different? 6

  8. Why is Mobile CV Different? 6

  9. Why is Mobile CV Different? 6

  10. Balancing Power versus Perception 7

  11. Algorithm Software Architecture SOC Hardware

  12. Correlation Filters with Limited Boundaries Hamed Kiani Galoogahi Terence Sim Simon Lucey Algorithm Istituto Italiano di Tecnologia National University of Singapore Carnegie Mellon University Genova, Italy Singapore Pittsburgh, USA hamed.kiani@iit.it tsim@comp.nus.edu.sg slucey@cs.cmu.edu Abstract Correlation filters take advantage of specific proper- ties in the Fourier domain allowing them to be estimated efficiently: O ( ND log D ) in the frequency domain, ver- sus O ( D 3 + ND 2 ) spatially where D is signal length, and N is the number of signals. Recent extensions to cor- Software (a) (b) relation filters, such as MOSSE, have reignited interest of their use in the vision community due to their robustness and attractive computational properties. In this paper we demonstrate, however, that this computational efficiency 1 comes at a cost. Specifically, we demonstrate that only D proportion of shifted examples are unaffected by boundary effects which has a dramatic effect on detection/tracking � � (c) (d) performance. In this paper, we propose a novel approach to correlation filter estimation that: (i) takes advantage of Figure 1. (a) Defines the example of fixed spatial support within inherent computational redundancies in the frequency do- the image from which the peak correlation output should occur. main, (ii) dramatically reduces boundary effects, and (iii) (b) The desired output response, based on (a), of the correlation is able to implicitly exploit all possible patches densely ex- filter when applied to the entire image. (c) A subset of patch ex- tracted from training examples during learning process. Im- amples used in a canonical correlation filter where green denotes pressive object tracking and detection results are presented a non-zero correlation output, and red denotes a zero correlation in terms of both accuracy and computational efficiency. output in direct accordance with (b). (d) A subset of patch ex- Architecture amples used in our proposed correlation filter. Note that our pro- posed approach uses all possible patches stemming from different 1. Introduction parts of the image, whereas the canonical correlation filter simply employs circular shifted versions of the same single patch. The Correlation between two signals is a standard approach central dilemma in this paper is how to perform (d) efficiently in to feature detection/matching. Correlation touches nearly the Fourier domain. The two last patches of (d) show that D − 1 T every facet of computer vision from pattern detection to ob- patches near the image border are affected by circular shift in our ject tracking. Correlation is rarely performed naively in the method which can be greatly diminished by choosing D << T , where D and T indicate the length of the vectorized face patch in spatial domain. Instead, the fast Fourier transform (FFT) (a) and the whole image in (a), respectively. affords the efficient application of correlating a desired tem- plate/filter with a signal. Correlation filters, developed initially in the seminal proach is that it attempts to learn the filter in the frequency work of Hester and Casasent [15], are a method for learning domain due to the efficiency of correlation in that domain. a template/filter in the frequency domain that rose to some prominence in the 80s and 90s. Although many variants Interest in correlation filters has been reignited in the vi- have been proposed [15, 18, 20, 19], the approach’s central sion world through the recent work of Bolme et al. [5] on SOC Hardware tenet is to learn a filter, that when correlated with a set of Minimum Output Sum of Squared Error (MOSSE) correla- training signals, gives a desired response, e.g. Figure 1 (b). tion filters for object detection and tracking. Bolme et al.’s Like correlation, one of the central advantages of the ap- work was able to circumvent some of the classical problems

  13. Algorithm Software Ax = b Architecture SOC Hardware

  14. Algorithm Software Architecture Hardware

  15. Algorithm Software Architecture Hardware

  16. Algorithm Software SIMD (Single Instruction, Multiple Data) Architecture SOC Hardware

  17. Algorithm 4-way x + Software SIMD (Single Instruction, Multiple Data) � � Architecture � (length 2, 4, 8, …) vectors of integers or floats � Names: MMX, SSE, SSE2, … � SOC Hardware � �

  18. Algorithm Software Architecture SOC Hardware

  19. Algorithm Software Architecture SOC Hardware APIs in the current versions of OpenGL ES do not have the “scatter”

  20. Algorithm Software Architecture SOC Hardware APIs in the current versions of OpenGL ES do not have the “scatter”

  21. Algorithm Software Architecture SOC Hardware

  22. Algorithm Optimize Software Architecture SOC Hardware

  23. Algorithm Software Architecture SOC Hardware Optimize

  24. Algorithm Optimize Software Architecture SOC Hardware

  25. MATLAB OpenCV

  26. MATLAB OpenCV

  27. MATLAB OpenCV

  28. Some Insights for Mobile CV • Very difficult to write the fastest code. • When you are prototyping an idea you should not worry about this, but • You have to be aware of where bottle necks can occur. • This is what you will learn in this course. • Highest performance in general is non-portable. • If you want to get the most out of your system it is good to go deep. • However, options like OpenCV are good when you need to build something quickly that works. • To build good computer vision apps you need to know them algorithmically. • Simply knowing how to write fast code is not enough. • You need to also understand computer vision algorithmically. • OpenCV can be dangerous here. Some insights taken from Markus Püschel’s lectures on “How to Write fast Numerical Code”.

  29. Source: http://www.slashgear.com/iphone-7-potential-wanes-as-android-n-starts-to-tango-20440932/

  30. Source: http://www.slashgear.com/iphone-7-potential-wanes-as-android-n-starts-to-tango-20440932/

  31. Better Selfies Ohad Fried, Eli Shechtman, Dan B Goldman, and Adam Finkelstein. Perspective-aware Manipulation of Portrait Photos. ACM Transactions on Graphics (Proc. SIGGRAPH), July 2016.

  32. Better Selfies Ohad Fried, Eli Shechtman, Dan B Goldman, and Adam Finkelstein. Perspective-aware Manipulation of Portrait Photos. ACM Transactions on Graphics (Proc. SIGGRAPH), July 2016.

  33. Emerging Trends - Low Power 18 Taken from: http://lpirc.net/

  34. Emerging Trends - High Speed Camera iPhone 6 Samsung Galaxy S5 19

  35. Emerging Trends - High Speed Camera iPhone 6 Samsung Galaxy S5 19

  36. Depth From Shake H. Alismail, B. Browning, S. Lucey. “Photometric Bundle Adjustment" ACCV 2016.

  37. Depth From Shake H. Alismail, B. Browning, S. Lucey. “Photometric Bundle Adjustment" ACCV 2016.

  38. Results

  39. Results

  40. Results

  41. Results

  42. Emerging Trends - Depth Cameras 23

  43. Kinect 2 Sensor Standard Basketball Hoop 24

  44. Ball Tracking

  45. Ball Tracking

  46. Ball Tracking

  47. 26

  48. 26

  49. Limitations - Range 27

  50. Limitations - Ambient Light • A sunny day on Earth can reach up to 1120Wm -2 • Tabletop projector releases on average 10W of light. Spectral Irradiance (in Wm − 2 nm − 1 ) 2.5 Extraterrestrial Radiation Direct + Circumsolar Irradiance 2 1.5 1 0.5 0 0 500 1000 1500 2000 2500 3000 3500 4000 Wavelength (in nm) 28

  51. The Future

  52. The Future

  53. Emerging Trends - Augmented Reality 30

  54. Emerging Trends - Mobile SLAM 33 P. Tanskanen, K. Kolev, L. Meier, F. Camposeco, O. Saurer, M. Pollefeys : Live metric 3d reconstruction on mobile phones. (ICCV 2013)

  55. Emerging Trends - Mobile SLAM 33 P. Tanskanen, K. Kolev, L. Meier, F. Camposeco, O. Saurer, M. Pollefeys : Live metric 3d reconstruction on mobile phones. (ICCV 2013)

  56. Emerging Trends - Mobile SLAM 33 P. Tanskanen, K. Kolev, L. Meier, F. Camposeco, O. Saurer, M. Pollefeys : Live metric 3d reconstruction on mobile phones. (ICCV 2013)

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend