The Human Visual System EE367/CS448I: Computational Imaging and - - PowerPoint PPT Presentation

the human visual system
SMART_READER_LITE
LIVE PREVIEW

The Human Visual System EE367/CS448I: Computational Imaging and - - PowerPoint PPT Presentation

an engineering-focused introduction to The Human Visual System EE367/CS448I: Computational Imaging and Display stanford.edu/class/ee367 Lecture 2 Gordon Wetzstein Stanford University nautilus eye, wikipedia Dawkins,


slide-1
SLIDE 1

The Human Visual System

Gordon Wetzstein Stanford University an engineering-focused introduction to EE367/CS448I: Computational Imaging and Display stanford.edu/class/ee367 Lecture 2

slide-2
SLIDE 2

nautilus eye, wikipedia

slide-3
SLIDE 3

Dawkins, “Climbing Mount Improbable”, Norton & Company, 1997

slide-4
SLIDE 4

reptile eye, http://pichost.me/1608580/

slide-5
SLIDE 5

Evolution of the Eye

wikipedia

  • wl, https://www.pinterest.com/pin/452400725039917330/
slide-6
SLIDE 6

pigeon, http://globe-views.com/dreams/pigeon.html

slide-7
SLIDE 7

jumping spider, wikipedia

slide-8
SLIDE 8

national geographics

slide-9
SLIDE 9

Summary of Human Visual System (HVS)

  • visual acuity

visual acuity: 20/20 is ~1 arc min

  • field of view

field of view: ~190° monocular, ~120° binocular, ~135° vertical

  • temporal resolution

temporal resolution: ~60 Hz (depends on contrast, luminance)

  • dynamic range

dynamic range: instantaneous 6.5 f-stops, adapt to 46.5 f-stops

  • color

color: everything in the CIE xy diagram; distances are linear in CIE Lab

  • depth cues in 3D displays

depth cues in 3D displays: vergence, focus, conflicts, (dis)comfort

  • accommodation range

accommodation range: ~8cm to ∞, degrades with age

slide-10
SLIDE 10

Overview

sensors network high-level processing low-level processing compute

wikipedia

slide-11
SLIDE 11

Overview

primary visual cortex ventral stream: recognition, object identification dorsal stream: spatial awareness

wikipedia

slide-12
SLIDE 12

Anatomy of the Human Eye

slide-13
SLIDE 13

The Retina

slide-14
SLIDE 14

The Retina

Roorda & Williams, 1999, Nature

5 arcmin visual angle

slide-15
SLIDE 15

Color Perception

slide-16
SLIDE 16

Color Perception - Sensitivity of Cones

reddit.com

slide-17
SLIDE 17

Oculumotor Processes

near focus far focus

adithyakiran.wordpress.com

16 years: ~8cm to ∞ 50 years: ~50cm to ∞ (mostly irrelevant)

slide-18
SLIDE 18

Oculumotor Processes + Visual Cues

slide-19
SLIDE 19

Visual Field / Field of View

monocular visual field

Ruch & Fulton, 1960

binocular visual field

slide-20
SLIDE 20

Immersive VR – How Important is the FOV?

slide-21
SLIDE 21

Visual Acuity

characters are 5 arc min, need to resolve 1 arc min to read

Snellen chart

slide-22
SLIDE 22

Retina Displays

Apple Inc. d p=?? eye α p=2d tan(α/2)

Steve Jobs: 300 dpi is retina resolution

p=2*12”*tan(1 arc min /2)=0.0035”

tablet, 12” away, resolvable pixel:

  • ur math: ~286 dpi
slide-23
SLIDE 23

Dynamic Range

slide-24
SLIDE 24

High Dynamic Range Displays

slide-25
SLIDE 25

Refractive Errors

slide-26
SLIDE 26

300 dpi or higher

Vision-Correcting Displays

slide-27
SLIDE 27

Eye vs Camera

wikipedia [Williams 91]

slide-28
SLIDE 28

Contrast

Which image has a higher contrast? What is contrast? global vs. local, Weber contrast: Michelson contrast:

I − I

slide-29
SLIDE 29

Contrast Sensitivity Function

peak at ~4-6 cpd spatial frequency contrast

Campbell & Robson, 1968; Daly, 1993

shifts depending on viewing distance! packing density of cones ~60 cpd

slide-30
SLIDE 30

Hybrid Images

Oliva, Torralba, & Schyns, 2006, ACM SIGGRAPH

slide-31
SLIDE 31

Hybrid Images

Oliva, Torralba, & Schyns, 2006, ACM SIGGRAPH

slide-32
SLIDE 32

Depth Perception

wikipedia

slide-33
SLIDE 33

Depth Perception

monocular cues monocular cues

  • perspective
  • relative object size
  • absolute size
  • cclusion
  • accommodation
  • retinal blur
  • motion parallax
  • texture gradients
  • shading

wikipedia

binocular cues binocular cues

  • (con)vergence
  • disparity / parallax
slide-34
SLIDE 34

binocular disparity binocular disparity motion parallax motion parallax accommodation/blur accommodation/blur convergence convergence current glasses-based (stereoscopic) displays current glasses-based (stereoscopic) displays near-term: light field displays near-term: light field displays longer-term: holographic displays longer-term: holographic displays

Depth Perception

slide-35
SLIDE 35

Depth Perception

Cutting & Vishton, 1995

slide-36
SLIDE 36

Visual Illusions – Perspective, Occlusion, Size

M.C. Escher

slide-37
SLIDE 37
slide-38
SLIDE 38

Visual Illusions – Which Cues are These?

Held et al., 2006, ACM SIGGRAPH

slide-39
SLIDE 39

Walker, Lewis E., 1865. Hon. Abraham Lincoln, President of the United States. Library of Congress Charles Wheatstone., 1841. Stereoscope.

Stereoscopic Displays

slide-40
SLIDE 40

Stereoscopic Displays

slide-41
SLIDE 41

Stereoscopic Displays

Charles Wheatstone 1838 stereoscopic displays 176 years later

slide-42
SLIDE 42

A Brief History of Virtual Reality

1838 1968 2012-2019

Stereoscopes

Wheatstone, Brewster, …

VR, AR,

Ivan Sutherland

VR explosion

Oculus, Sony, Valve, MS, …

Next-generation VR/AR Displays

slide-43
SLIDE 43

Vergence-Accommodation Conflict

Marty Banks, UC Berkeley

effects effects

  • visual discomfort
  • visual fatigue
  • nausea
  • diplopic vision
  • eyestrain
  • compromised

image quality

  • pathologies in

developing visual system

slide-44
SLIDE 44

Top View Real World:

  • Vergence & Accommodation Match!

Match!

slide-45
SLIDE 45

Top View Stereo Displays Today:

  • Vergence-Accommodation Mismatch!

Mismatch! Screen

slide-46
SLIDE 46

Zone of Comfort

Shibata et al, 2011, Journal of Vision

slide-47
SLIDE 47

VR/AR Displays with Focus Cues

Konrad et al., SIGCHI 2016; Padmanaban et al., PNAS 2017

Gaze-contingent Focus Displays Near-eye Light Field Displays Accommodation- invariant Displays

Huang et al., SIGGRAPH 2015; Wetzstein et al., SIGGRAPH 2011, 2012 Konrad et al., SIGGRAPH 2017

slide-48
SLIDE 48

Summary

  • visual acuity

visual acuity: 20/20 is ~1 arc min

  • field of view

field of view: ~190° monocular, ~120° binocular, ~135° vertical

  • temporal resolution

temporal resolution: ~60 Hz (depends on contrast, luminance)

  • dynamic range

dynamic range: instantaneous 6.5 f-stops, adapt to 46.5 f-stops

  • color

color: everything in the CIE xy diagram; distances are linear in CIE Lab

  • depth cues in 3D displays

depth cues in 3D displays: vergence, focus, conflicts, (dis)comfort

  • accommodation range

accommodation range: ~8cm to ∞, degrades with age

slide-49
SLIDE 49

Homework I

wikipedia

  • take a step back in evolution

take a step back in evolution

  • build a pinhole camera

build a pinhole camera

  • capture photos with it

capture photos with it

  • read instructions carefully!

read instructions carefully!

slide-50
SLIDE 50

light light leakage leakage

Homework I – Build a Pinhole Camera

digital camera digital camera blocked optical path blocked optical path

slide-51
SLIDE 51

Next: Digital Photography I

  • ptics
  • aperture
  • depth of field
  • field of view
  • noise
  • sensors
  • color filter arrays
slide-52
SLIDE 52

References and Further Reading

interesting textbooks on perception:

  • Wandell, “Foundations of Vision”, Sinauer Associates,1995
  • Howard, “Perceiving in Depth”, Oxford University Press, 2012

depth cues and more:

  • Cutting & Vishton,” Perceiving layout and knowing distances: The interaction, relative potency, and contextual use of different information about depth”, Epstein and Rogers

(Eds.), Perception of space and motion, 1995

  • Held, Cooper, O’Brien, Banks, “Using Blur to Affect Perceived Distance and Size”, ACM Transactions on Graphics, 2010
  • Hoffman and Banks, “Focus information is used to interpret binocular images”. Journal of Vision 10, 2010
  • Hoffman, Girshick, Akeley, and Banks, “Vergence-accommodation conflicts hinder visual performance and cause visual fatigue”. Journal of Vision 8, 2008
  • Huang, Chen, Wetzstein, “The Light Field Stereoscope”, ACM SIGGRAPH 2015

the retina and visual acuity:

  • Roorda, Williams, “The arrangement of the three cone classes in the living human eye”, Nature, Vol 397, 1999
  • Snellen chart: https://en.wikipedia.org/wiki/Snellen_chart

the visual field:

  • Ruch and Fulton, Medical physiology and biophysics, 1960
  • contrast sensitivity function & hybrid images:
  • Oliva

Oliva, , Torralba Torralba, , Schyns Schyns, “Hybrid Images”, ACM Transactions on Graphics (SIGGRAPH), 2006 , “Hybrid Images”, ACM Transactions on Graphics (SIGGRAPH), 2006

  • Spatio-temporal CSF: Kelly, Motion and Vision. II. Stabilized spatio-temporal threshold surface, Journal of the Optical Society of America,1979
  • Mantiuk, Kim, Rempel, Heidrich, “HDR-VDP-2: A calibrated visual metric for visibility and quality predictions in all luminance conditions”, SIGGRAPH 2011