cs325 artificial intelligence ch 24 computer vision i
play

CS325 Artificial Intelligence Ch. 24, Computer Vision I Object - PowerPoint PPT Presentation

CS325 Artificial Intelligence Ch. 24, Computer Vision I Object Recognition Cengiz Gnay, Emory Univ. Spring 2013 Gnay Ch. 24, Computer Vision I Object Recognition Spring 2013 1 / 27 Computer Vision Done with games, except


  1. CS325 Artificial Intelligence Ch. 24, Computer Vision I – Object Recognition Cengiz Günay, Emory Univ. Spring 2013 Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 1 / 27

  2. Computer Vision Done with games, except homework :) Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 2 / 27

  3. Computer Vision Done with games, except homework :) Vision is one of our main perceptions Computer vision is what robots use to understand their surrounding Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 2 / 27

  4. Computer Vision Done with games, except homework :) Vision is one of our main perceptions Computer vision is what robots use to understand their surrounding 3 lectures: 1 Object recognition (today) 2 3D reconstruction 3 Motion analysis Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 2 / 27

  5. Entry/Exit Surveys Exit survey: Advanced Planning Why isn’t classical planning schema adequate for resource planning? What is the advantage gained in abstract plans by having surely-reachable versus potentially-reachable states? Entry survey: Computer Vision I – Image Processing (0.25 points) List three specific tasks where computer vision would be desirable. What do you think are the major hurdles in computer vision? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 3 / 27

  6. How Machines See: Cameras A charge-coupled device (CCD) photo sensor array: Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 4 / 27

  7. Focal Optics for Determining Distance and Size See the videos, I’ll summarize: X Z = x f Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 5 / 27

  8. Focal Optics for Determining Distance and Size See the videos, I’ll summarize: X Z = x f What can we can figure out from this? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 5 / 27

  9. Focal Optics for Determining Distance and Size See the videos, I’ll summarize: X Z = x f What can we can figure out from this? Object’s distance ( Z ) & height ( X ) based on projection height ( x ) and focal distance ( f ) Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 5 / 27

  10. We All See a Perspective Projection Vanishing points from parallel lines: Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 6 / 27

  11. We All See a Perspective Projection Vanishing points from parallel lines: Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 6 / 27

  12. We All See a Perspective Projection Vanishing points from parallel lines: Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 6 / 27

  13. We All See a Perspective Projection Vanishing points from parallel lines: Giant panda, or just close? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 6 / 27

  14. Object Recognition: How Hard Can It Be? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 7 / 27

  15. Object Recognition: How Hard Can It Be? Problems? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 7 / 27

  16. Object Recognition: How Hard Can It Be? Problems? Rotation, scale, illumination, occlusion, viewpoint, deformation Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 7 / 27

  17. Not Hard for Us Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 8 / 27

  18. Not Hard for Us Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 8 / 27

  19. Not Hard for Us Diffuse reflection, bright Specularities Diffuse reflection, dark Cast shadow Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 8 / 27

  20. Not Hard for Us Diffuse reflection, bright Specularities Diffuse reflection, dark Cast shadow How does our brain do it? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 8 / 27

  21. Not Hard for Us Diffuse reflection, bright Specularities Diffuse reflection, dark Cast shadow How does our brain do it? Will have examples later. Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 8 / 27

  22. Invariance is Crucial for Computer Vision Must recognize objects invariant of their: Rotation, scale, illumination, occlusion, viewpoint, deformation Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 9 / 27

  23. Invariance is Crucial for Computer Vision Must recognize objects invariant of their: Rotation, scale, illumination, occlusion, viewpoint, deformation Let’s start by simplifying: 1 Greyscale (monochrome) images 2 Pixels can have values: 0. . . 255 Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 9 / 27

  24. Even Terminator Has Monochrome Vision Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 10 / 27

  25. Extracting Features: Edge Detection Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 11 / 27

  26. Extracting Features: Edge Detection Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 11 / 27

  27. Extracting Features: Edge Detection How to detect the vertical edge? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 11 / 27

  28. Extracting Features: Edge Detection How to detect the vertical edge? 1 Spatial derivative? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 11 / 27

  29. Extracting Features: Edge Detection How to detect the vertical edge? 1 Spatial derivative? 2 Filter with mask: + 1 − 1 Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 11 / 27

  30. Extracting Features: Edge Detection How to detect the vertical edge? 1 Spatial derivative? 2 Filter with mask: + 1 − 1 Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 11 / 27

  31. Extracting Features: Edge Detection How to detect the vertical edge? 1 Spatial derivative? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 11 / 27 2 Filter with mask: + 1 − 1

  32. Extracting Features: Edge Detection How to detect the vertical edge? 1 Spatial derivative? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 11 / 27 2 Filter with mask: + 1 − 1

  33. Edge Detection: Linear Filter What we did is called convolution : I ′ I ⊗ g = = ⊗ Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 12 / 27

  34. Edge Detection: Linear Filter What we did is called convolution : I ′ I ⊗ g = = ⊗ For each pixel, we multiply by mask and sum: � I ′ ( x , y ) = I ( x − u , y − v ) g ( u , v ) u , v Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 12 / 27

  35. Edge Detection: Linear Filter What we did is called convolution : I ′ I ⊗ g = = ⊗ For each pixel, we multiply by mask and sum: � I ′ ( x , y ) = I ( x − u , y − v ) g ( u , v ) u , v Does that equation look familiar? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 12 / 27

  36. Edge Detection: Linear Filter What we did is called convolution : I ′ I ⊗ g = = ⊗ For each pixel, we multiply by mask and sum: � I ′ ( x , y ) = I ( x − u , y − v ) g ( u , v ) u , v Does that equation look familiar? Perceptron? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 12 / 27

  37. Edge Detection: Linear Filter What we did is called convolution : I ′ I ⊗ g = = ⊗ For each pixel, we multiply by mask and sum: � I ′ ( x , y ) = I ( x − u , y − v ) g ( u , v ) u , v Does that equation look familiar? Perceptron? What are the weights? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 12 / 27

  38. Edge Detection: Linear Filter What we did is called convolution : I ′ I ⊗ g = = ⊗ For each pixel, we multiply by mask and sum: � I ′ ( x , y ) = I ( x − u , y − v ) g ( u , v ) u , v Does that equation look familiar? Perceptron? What are the weights? The mask, g . Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 12 / 27

  39. Edge Detection: Linear Filter What we did is called convolution : I ′ I ⊗ g = = ⊗ For each pixel, we multiply by mask and sum: � I ′ ( x , y ) = I ( x − u , y − v ) g ( u , v ) u , v Does that equation look familiar? Perceptron? What are the weights? The mask, g . What’s the advantage? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 12 / 27

  40. Edge Detection: Linear Filter What we did is called convolution : I ′ I ⊗ g = = ⊗ For each pixel, we multiply by mask and sum: � I ′ ( x , y ) = I ( x − u , y − v ) g ( u , v ) u , v Does that equation look familiar? Perceptron? What are the weights? The mask, g . What’s the advantage? Works in parallel! Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 12 / 27

  41. Neurons Can Do It Faster? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 13 / 27

  42. Detect Only Vertical Edges? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 14 / 27

  43. Detect Only Vertical Edges? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 14 / 27

  44. Horizontal and Vertical Gradients Original: Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 15 / 27

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend