scene understanding
play

Scene Understanding Introduction & Overview Outline Motivation - PowerPoint PPT Presentation

Scene Understanding Introduction & Overview Outline Motivation The problems Scene perception Approaches Objects in context Objects in context Scene classification Natural / Urban Sunny / Cloudy


  1. Scene Understanding Introduction & Overview

  2. Outline • Motivation • The problems • Scene perception • Approaches

  3. Objects in context

  4. Objects in context

  5. Συνεργός

  6. Scene classification • Natural / Urban • Sunny / Cloudy • Open / Closed • Object content • Location • …

  7. Spatial boundary

  8. 3D scene understanding (from a single image)

  9. 3D scene understanding (from a single image)

  10. 3D scene understanding (from a single image)

  11. Image memorability • We see hundreds of images a day • We even remember some of them (but forget many others) • Subjective? Not completely • “ not an inexplicable phenomenon”

  12. Image memorability

  13. Image memorability • Understanding and Predicting Image Memorability at a Large Scale (ICCV 2015)

  14. Scene perception • 1969: Can remember and describe images seen for 0.1 seconds • 1981: Three levels of representation 1. Prominent object 2. Multiple-component 3. Global scene-emergent features • Object-centered vs. Space-centered

  15. Time frame of scene perception • Global-to-local analysis • 20-30 ms: Natural/Urban, Open/Closed • 100ms: Basic category (beach, forest, etc.) • 200ms+: Object identities

  16. Temporal context • Classification of ambiguous images depend on previously perceived scenes

  17. GIST descriptor • Meaningful information from a glimpse • “naturalness, openness, roughness, expansion, ruggedness”

  18. GIST descriptor • Segment image by 4x4 grid, compute oriented histograms

  19. 3D Scene from single images • Geometric classes • Support, Vertical, Sky • Vertical subclasses: Left, center, right Porous, solid

  20. 3D Scene from single images • Divide image into “superpixels” • Group superpixels into larger regions • Compute features for each region • Compute confidence levels for different geometric classes

  21. Photo pop-up • Find connected components for vertical regions • Fit line segments to base of each component • Compute depth at each point

  22. References Barenholtz, E. (2013). Quantifying the role of context in visual object recognition. Visual Cognition, 22(1), 30-56. • Biederman, I. (1981). On the semantics of a glance at a scene. In M. Kubovy & J. R. Pomerantz (Eds.), Perceptual organization (pp. 213–263). • Hillsdale, NJ: Lawrence Erlbaum Associates. Fei-Fei, L., Iyer, A., Koch, C., & Perona, P . (2007). What do we perceive in a glance of a real-world scene? Journal of Vision, 7(1), 1–29. • Isola, P ., et al. (2011). Understanding the intrinsic memorability of images. Advances in Neural Information Processing Systems, 24. • Jahangiri, E., et al. (2014). Object-Level Generative Models for 3D Scene Understanding. SUNw: Scene Understanding Workshop. • Khosla, A., Raju, A. S., Torralba, A., & Oliva, A. (2015). Understanding and Predicting Image Memorability at a Large Scale. 2015 IEEE • International Conference on Computer Vision (ICCV). Oliva, A., & Torralba, A. (2001). Modeling the shape of the scene: A holistic representation of the spatial envelope. International Journal of • Computer Vision, 42, 145–175. Oliva, A., & Torralba, A. (2006). Building the gist of a scene: The role of global image features in recognition. Progress in Brain Research • Visual Perception - Fundamentals of Awareness: Multi-Sensory Integration and High-Order Perception, 23-36. Oliva, A., & Torralba, A. (2007). The role of context in object recognition. Trends in Cognitive Sciences, 11(12), 520-527. • Oliva, A. (2014). Scene Perception. In L. M. Chalupa & J. S. Warner (Eds.), The New Visual Neurosciences (pp. 725-732). Cambridge, MA: MIT • Press. Park, S., et al. (2011). Disentangling Scene Content from Spatial Boundary: Complementary Roles for the Parahippocampal Place Area and • Lateral Occipital Complex in Representing Real-World Scenes. Journal of Neuroscience, 31(4), 1333-1340. Potter, M. C., & Levy, E. I. (1969). Recognition memory for a rapid sequence of pictures. Journal of Experimental Psychology, 81, 10–15. •

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend