Visual Object Recognition Computational Models and - - PowerPoint PPT Presentation

visual object recognition computational models and
SMART_READER_LITE
LIVE PREVIEW

Visual Object Recognition Computational Models and - - PowerPoint PPT Presentation

Visual Object Recognition Computational Models and Neurophysiological Mechanisms Neurobiology 130/230. Harvard College/GSAS 78454 Web site : http://tinyurl.com/visionclass (Class notes, readings, etc) Location: Biolabs 2062 Mondays 03:30


slide-1
SLIDE 1

Visual Object Recognition Computational Models and Neurophysiological Mechanisms Neurobiology 130/230. Harvard College/GSAS 78454

Web site: http://tinyurl.com/visionclass (Class notes, readings, etc) Location: Biolabs 2062 Time: Mondays 03:30 – 05:30 Lectures: Faculty: Gabriel Kreiman and invited guests Contact information:

Gabriel Kreiman Joseph Olson gabriel.kreiman@tch.harvard.edu josepholson@fas.harvard.edu 617-919-2530 Office Hours: After Class. Mon 05:30-06:30

slide-2
SLIDE 2

Visual Object Recognition Computational Models and Neurophysiological Mechanisms Neurobiology 230. Harvard College/GSAS 78454

Class 1. Sep-12 Introduction to pattern recognition. Why is vision difficult? Visual input. Natural image

  • statistics. The retina.

Class 2. Sep-19 Lesion studies in animal models. Neurological studies of cortical visual deficits in humans. Class 3. Sep-26 Psychophysics of visual object recognition [Joseph Olson] Class 4. Oct-03 Introduction to the thalamus and primary visual cortex [Camille Gomez-Laberge] Oct-10 Columbus Day. No class. Class 5. Oct-17 Adventures into terra incognita. Neurophysiology beyond V1 [Hanlin Tang] Class 6. Oct-24 First steps into inferior temporal cortex [Carlos Ponce] Class 7. Oct-31 From the highest echelons of visual processing to cognition [Leyla Isik] Class 8. Nov-07 Correlation and causality. Electrical stimulation in visual cortex. Class 9. Nov-14 Theoretical neuroscience. Computational models of neurons and neural networks. [Bill Lotter] Class 10. Nov-21 Computer vision. Towards artificial intelligence systems for cognition [David Cox] Class 11. Nov-28 Computational models of visual object recognition. [Kreiman] Class 12. Dec-05 [Extra class] Towards understanding subjective visual perception. Visual consciousness.

slide-3
SLIDE 3

Psychophysics: The study of the dependencies of

psychological experiences upon the physical stimuli that generate them

Basic measures:

  • Reaction time — The time taken by subjects to perform a task or make a judgment can

give an indication (or at least an upper bound) of how long the necessary psychological (and hence neural) processing takes.

  • Performance — Often inversely related to reaction time. There are techniques for

mitigating response biases.

  • Threshold — Stimuli can be varied to determine the threshold for detection, discrimination,
  • r some more complex psychological phenomenon.
slide-4
SLIDE 4
  • What are the theories / evidence / questions driving the

motivation behind some psychophysics experiments of visual recognition? – Atoms of recognition – Gestalt (whole vs sum of the parts) – Context – Tolerance and Invariance to image transformations – Fundamental properties of visual system (e.g. speed)

slide-5
SLIDE 5

Gestalt laws of grouping

Basic phenomenological constraints

  • Law of Closure — The mind may experience elements it does not

perceive through sensation, in order to complete a regular figure (that is, to increase regularity).

  • Law of Similarity — The mind groups similar elements into collective

entities or totalities. This similarity might depend on relationships of form, color, size, or brightness.

  • Law of Proximity — Spatial or temporal proximity of elements may

induce the mind to perceive a collective or totality.

  • Law of Symmetry (Figure ground relationships)— Symmetrical

images are perceived collectively, even in spite of distance.

  • Law of Continuity — The mind continues visual, auditory, and kinetic

patterns.

  • Law of Common Fate — Elements with the same moving direction

are perceived as a collective or unit.

slide-6
SLIDE 6

Law of closure: perceiving objects as whole even if they are not complete

The mind may experience elements it does not perceive through sensation, in order to complete a regular figure (that is, to increase regularity)

slide-7
SLIDE 7

Law of closure: perceiving objects as whole even if they are not complete

The mind may experience elements it does not perceive through sensation, in order to complete a regular figure (that is, to increase regularity)

slide-8
SLIDE 8

Law of similarity

The mind groups similar elements into collective entities or totalities. This similarity might depend on relationships of form, color, size, or brightness

slide-9
SLIDE 9

Law of proximity

  • Spatial or temporal proximity of elements may induce

the mind to perceive a collective or totality.

slide-10
SLIDE 10

Law of symmetry

[ ] { } [ ] { } [ ] { } [ { } { } { } { } { } { } { | | | | | | | | | | | | |

http://isle.hanover.edu/Ch05O bject/Ch05SymmetryLaw.html

  • The Law of Symmetry is the gestalt grouping law that states that elements

that are symmetrical to each other tend to be perceived as a unified group

slide-11
SLIDE 11

Law of continuity

The mind continues visual, auditory, and kinetic patterns

slide-12
SLIDE 12

Law of continuity

The mind continues visual, auditory, and kinetic patterns

slide-13
SLIDE 13

Law of common fate

slide-14
SLIDE 14

MIRCs Minimal Recognizable Configurations

slide-15
SLIDE 15

Holistic representation of faces

McKone et al, Frontiers in Psychology, 2013

slide-16
SLIDE 16

Holistic representation of faces

McKone et al, Frontiers in Psychology, 2013

slide-17
SLIDE 17

Holistic representation of faces

McKone et al, Frontiers in Psychology, 2013 Composite illusion

slide-18
SLIDE 18

Beyond pixels – Context matters

slide-19
SLIDE 19

Tolerance to image transformations

Scale Position Rotation (2D) Rotation (3D) – viewpoint Color Illumination Cues Clutter Occlusion Other non-rigid transformations (aging, expressions, etc)

slide-20
SLIDE 20

Scale tolerance

x

A A A A A

slide-21
SLIDE 21

One-shot learning for scale tolerance

Which one is it?

slide-22
SLIDE 22

Position tolerance

x

bd db bd bd bd db db db db bd bd

slide-23
SLIDE 23

Tolerance to viewpoint and illumination changes

slide-24
SLIDE 24

Recognition from minimal features

slide-25
SLIDE 25

Recognition of caricatures

Images: Hanoch Piven

slide-26
SLIDE 26

Visual recognition depends on experience

slide-27
SLIDE 27

Recognition of images flashed for ~100 ms (demo)

NEED MOVIE

slide-28
SLIDE 28

Visual recognition can be extremely fast

Kirchner, H., & Thorpe, S. J. (2006). Ultra-rapid object detection with saccadic eye movements: visual processing speed revisited. Vision Res, 46(11), 1762-1776.

slide-29
SLIDE 29

Is information integrated over time?

Singer and Kreiman, 2014

slide-30
SLIDE 30

Rapid decay in recognition of asynchronously presented object parts

Brief asynchronies disrupt object recognition

Singer and Kreiman, 2014

slide-31
SLIDE 31

The visual system has a very large capacity

slide-32
SLIDE 32

Occlusion

slide-33
SLIDE 33

Pattern completion: Objects can be recognized from partial information

slide-34
SLIDE 34

Amodal completion

slide-35
SLIDE 35

Object recognition from partial information

slide-36
SLIDE 36

Object completion task

slide-37
SLIDE 37

Object completion (unmasked condition)

NO MASK MASK Whole Partial

slide-38
SLIDE 38

Partial Information induces latencies

slide-39
SLIDE 39

Backward masking

10 ms 20 ms 30 ms 40 ms 50 ms 100 ms 200 ms

slide-40
SLIDE 40

Doubles?

Francois Brunelle

http://www.francoisbrunelle.com/

slide-41
SLIDE 41

Object completion task (masking)

slide-42
SLIDE 42

Object completion (unmasked condition)

NO MASK MASK Whole Partial Whole Partial Masked Unmasked

slide-43
SLIDE 43

Further reading

  • Regan, D. Human Perception of Objects (2000). Sinauer Associates. Sunderland,

Massachusets.

  • Frisby, JP and Stone JV. Seeing (2010). MIT Press. Cambridge, Massachusetts.

Original articles cited in class (see lecture notes for complete list)

  • Potter, MC (1969)

Recognition memory for a rapid sequence of pictures. Journal of Experimental Psychology 81:10-15.

  • Kirchner, H., & Thorpe, S. J. (2006). Ultra-rapid object detection with saccadic eye movements: visual processing speed
  • revisited. Vision Res, 46(11), 1762-1776.
  • Brady, T. F., Konkle, T., Alvarez, G. A., & Oliva, A. (2008). Visual long-term memory has a massive storage capacity for object
  • details. Proc Natl Acad Sci U S A, 105(38), 14325-14329
  • Mooney CM. (1957). Age in the development of closure ability in children. Canadian Journal of Psychology 11: 219-226
  • McKone et al, Frontiers in Psychology, 2013
  • Singer and Kreiman (2014). Short temporal asynchrony disrupts visual object recognition. Journal of Vision 12:14.
  • Tang, H., et al. (2014). "Spatiotemporal dynamics underlying object completion in human ventral visual cortex." Neuron 83: 736-

748.

  • Tang, H., et al. (2014). "A role for recurrent processing in object completion: neurophysiological, psychophysical and

computational evidence." CBMM Memo(9).