3D Vision Viktor Larsson Spring 2019 Schedule Feb 18 - PowerPoint PPT Presentation

3D Vision Viktor Larsson Spring 2019

Schedule Feb 18 Introduction Feb 25 Geometry, Camera Model, Calibration Mar 4 Features, Tracking / Matching Mar 11 Project Proposals by Students Mar 18 Structure from Motion (SfM) + papers Mar 25 Dense Correspondence (stereo / optical flow) + papers Apr 1 Bundle Adjustment & SLAM + papers Apr 8 Student Midterm Presentations Apr 15 Multi-View Stereo & Volumetric Modeling + papers Easter break Apr 22 Apr 29 3D Modeling with Depth Sensors + papers May 6 3D Scene Understanding + papers May 13 4D Video & Dynamic Scenes + papers May 20 papers May 27 Student Project Demo Day = Final Presentations

3D Vision – Class 3 Features & Correspondences feature extraction, image descriptors, feature matching, feature tracking Chapters 4, 8 in Szeliski’s Book [Shi & Tomasi, Good Features to Track, CVPR 1994]

Overview • Local Features • Invariant Feature Detectors • Invariant Descriptors & Matching • Feature Tracking

Importance of Features Features are key component of many 3D Vision algorithms

Importance of Features Schönberger & Frahm, Structure-From-Motion Revisited, CVPR 2016

Feature Detectors & Descriptors • Detector : Find salient structures • Corners, blob-like structures, ... • Keypoints should be repeatable • Descriptor : Compact representation of image region around keypoint • Describes patch around keypoints • Establish matches between images by comparing descriptors

Feature Detectors & Descriptors (Lowe, Distinctive Image Features From Scale-Invariant Keypoints , IJCV’04)

Feature Matching vs. Tracking Matching Tracking • Extract features independently • Extract features in first image • Match by comparing descriptors • Find same feature in next view

Wide Baseline Matching • Requirement to cope with larger variations between images  • Translation, rotation, scaling geometric • Perspective foreshortening transformations  • Non-diffuse reflections photometric transformations • Illumination

Good Detectors & Descriptors? • What are the properties of good detectors and descriptors? • Invariances against transformations • How to design such detectors and descriptors? • This lecture: • Feature detectors & their invariances • Feature descriptors, invariances, & matching • Feature tracking

Overview • Local Features Intro • Invariant Feature Detectors • Invariant Descriptors & Matching • Feature Tracking

Good Feature Detectors? • Desirable properties? • Precise (sub-pixel perfect) localization • Repeatable detections under • Rotation • Translation • Illumination • Perspective distortions • … • Detect distinctive / salient structures

Feature Point Extraction • Find “distinct” keypoints (local image patches) • As different as possible from neighbors homogeneous edge corner

Comparing Image Regions • Compare intensities pixel-by-pixel I ´ (x,y) I(x,y) • Dissimilarity measure: Sum of Squared Differences / Distances ( SSD )     2 SSD  I ( x , y )  I ( x , y )  x y ฀

Finding Stable Features • Measure uniqueness of candidate • Approximate SSD for small displacement Δ    2 SSD  w ( x i ) I ( x i   )  I ( x i ) i     2 w ( x i ) I ( x i )   I  I     I ( x i )      x  y     i   ฀ 2 I x   I I x I x I y   w ( x i )  T      T M   x 2   I x I y I y i ฀ • possible weights ฀ ฀

Finding Stable Features homogeneous edge corner Suitable feature positions should maximize i.e. maximize smallest eigenvalue of M

Harris Corner Detector • Use small local window: • Directly computing eigenvalues λ 1 , λ 2 of M is computationally expensive • Alternative measure for “ cornerness ”: = 𝜇 1 ⋅ 𝜇 2 − 𝑙 𝜇 1 + 𝜇 2 2 • Homogeneous: 𝜇 1 , 𝜇 2 small ⇒ 𝑆 small 2 < 0 • Edge: 𝜇 1 ≫ 𝜇 2 ≈ 0 ⇒ 𝑆 = 𝜇 1 ⋅ 0 − 𝑙𝜇 1 • Corner: 𝜇 1 , 𝜇 2 large ⇒ 𝑆 large

Harris Corner Detector • Alternative measure for “ cornerness ” • Select local maxima as keypoints • Subpixel accuracy through second order surface fitting (parabola in 1D)

Harris Corner Detector • Keypoint detection: Select strongest features over whole image or over each tile (e.g. 1000 per image or 2 per tile) • Invariances against geometric transformations • Shift / translation?

Geometric Invariances Rotation Harris: Yes Scale Harris: No Affine (approximately invariant w.r.t. perspective/viewpoint) Harris: No

2D Transformations of a Patch Harris corners VIP Harris corners MSER SIFT

Scale-Invariant Feature Transform (SIFT) • Detector + descriptor (later) • Recover features with position, orientation and scale (Lowe, Distinctive Image Features From Scale-Invariant Keypoints , IJCV’04)

Position • Look for strong responses of Difference-of- Gaussian filter ( DoG ) 3 2  k • Approximates Laplacian of Gaussian ( LoG ) • Detects blob-like structures • Only consider local extrema

Scale • Look for strong DoG responses over scale space    ฀  1/2 image ( σ =2) ฀ ฀ ฀    ฀  ฀ orig. image   4 ฀ 2 Slide credits: Bastian Leibe, Krystian Mikolajczyk ฀ ฀

Scale • Only consider local maxima/minima in both position and scale • Fit quadratic around extrema for subpixel & sub-scale accuracy

Minimum Contrast and “ Cornerness ” all features

Minimum Contrast and “ Cornerness ” after suppressing edge-like features

Minimum Contrast and “ Cornerness ” after suppressing edge-like features + small contrast features

Invariants So Far • Translation? Yes • Scale? Yes • Rotation? Yes

Orientation Assignment • Compute gradient for each pixel in patch at selected scale • Bin gradients in histogram & smooth histogram • Select canonical orientation at peak(s) • Keypoint = 4D coordinate 2  0 (x, y, scale, orientation)

Invariants So Far • Translation • Scale • Rotation • Brightness changes: • Additive changes? • Multiplicative changes?

2D Transformations of a Patch Harris corners VIP Harris corners MSER SIFT

Affine Invariant Features Perspective effects can locally be approximated by affine transformation

Extreme Wide Baseline Matching • Detect stable keypoints using the Maximally Stable Extremal Regions ( MSER ) detector • Detections are regions , not points! (Matas et al., Robust Wide Baseline Stereo from Maximally Stable Extremal Regions, BMVC’02)

Maximally Stable Extremal Regions Extremal regions: • Much brighter than surrounding • Use intensity threshold

Maximally Stable Extremal Regions Extremal regions: • OR: Much darker than surrounding • Use intensity threshold

Maximally Stable Extremal Regions • Regions: Connected components at a threshold • Region size = #pixels • Maximally stable: Region constant near some threshold

A Sample Feature

A Sample Feature T is maximally stable wrt. surrounding

From Regions To Ellipses • Compute „ center of gravity “ • Compute Scatter (PCA / Ellipsoid)

From Regions To Ellipses • Ellipse abstracts from pixels! • Geometric representation: position/size/shape

Achieving Invariance • Normalize to „ default “ position, size, shape • For example: Circle of radius 16 pixels

• Normalize ellipse to circle (affine transformation) • 2D rotation still unresolved

• Same approach as for SIFT: Compute histogram of local gradients • Find dominant orientation in histogram • Rotate local patch into dominant orientation

Summary: MSER Features • Detect sets of pixels brighter/darker than surrounding pixels • Fit elliptical shape to pixel set • Warp image so that ellipse becomes circle • Rotate to dominant gradient direction (other constructions possible as well)

MSER Features - Invariants • Constant brightness changes (additive and multiplicative) • Rotation, translation, scale • Affine transformations  Affine normalization of feature leads to similar patches in different views !

2D Transformations of a Patch Harris corners VIP In practice hardly observable for small patches ! Harris corners MSER SIFT

Viewpoint Invariant Patches (VIP) • Use known planar geometry to remove perspective distortion • Or: Use vanishing points to rectify patch (Wu et al., 3D Model Matching with Viewpoint Invariant Patches (VIPs), CVPR’08)

Learning Feature Detectors • In the age of deep learning, can we learn good detectors from data? • How can we model repeatable feature detection? • Learn ranking function H(x|w): R 2 → [ -1, 1] with parameters w • Interesting points close to -1 or 1 (Savinov et al., Quad-networks: unsupervised learning to rank for interest point detection, CVPR’17)

3D Vision Viktor Larsson Spring 2019 Schedule Feb 18 - PowerPoint PPT Presentation

3D Vision Viktor Larsson Spring 2019 Schedule Feb 18 Introduction Feb 25 Geometry, Camera Model, Calibration Mar 4 Features, Tracking / Matching Mar 11 Project Proposals by Students Mar 18 Structure from Motion (SfM) + papers Mar 25

Computer Vision Computer Vision How does vision work? What is vision for? Ela Claridge

Branding Presentation VISION Mevushal VISION Muscat of Alexandria & Viognier VISION

Vision Services Vision Services & & Vision Therapy Vision Therapy February 2, 2007

Vision Our National Church partners .. Vision Our National Network partners Vision Getting

HIM Without Walls Realizing Our Vision! Realizing Our Vision Realize Our Vision Realizing Our

J J R R Our Vision . . . Our Vision . . . Our Vision . . . Our Vision . . . TO BE THE BEST

Post- -trauma vision trauma vision Post Post- -trauma vision trauma vision Post syndrome

2017 Humana Vision 130 LOOK Whats NEW! NEW RETAIL FRAME BENEFIT 2 Humana Vision 100

Vision What is the Vision? The American Fork Canyon Vision (Vision) will ho- Few places in the

Building Our Vision St. Andrews Vision and Mission Our Vision: Our Vision: The Tree of Life is

FLITTER FLITTER The Foldable Litter Pink B Our Vision Our Vision Our Vision Our Vision A

FOCUS AREAS FOCUS AREAS FOCUS AREAS FOCUS AREAS Our Our Vision Vision Our Our Vision

So What Has So, What Has So, What Has So What Has Vision Done For Vision Done For Vision Done

Analog night vision devices April, 2020 ANALOG NIGHT VISION DEVICES Night vision devices

No Excuse Vision Weekend October 27-28, 2018 Why Vision Weekend? Because vision is

VISION ZERO SF: ELIMINATING TRAFFIC DEATHS BY 2024 FEBRUARY 6, 2017 VISION ZERO VISION ZERO SF

E9 205 Machine Learning for Signal Processing 23-8-17 Outline Basics for Image Processing

ImageProof: Enabling Authentication for Large-Scale Image Retrieval Shangwei Guo 1 Jianliang Xu 1

Computational Photography Si Lu Spring 2018 http://web.cecs.pdx.edu/~lusi/CS510/CS510_Computati

Heaps and Heapsort 1 October 2020 OSU CSE 1 Heaps A heap is a binary tree of T that

Interactive Image Mining Annie Morin 1 , Nguyen-Khang Pham 1,2 1 TEXMEX/IRISA 2 Cantho

Learning Representations for Visual Object Class Recognition Marcin Marszaek Cordelia Schmid

Texture and materials Subhransu Maji CMPSCI 670: Computer Vision December 1, 2016 CMPSCI 670

image matching presented by Dmytro Mishkin joint work with Anastasia Mishchuk, Milan Pultar,

3D Vision Viktor Larsson Spring 2019 Schedule Feb 18 - PowerPoint PPT Presentation

3D Vision Viktor Larsson Spring 2019 Schedule Feb 18 Introduction Feb 25 Geometry, Camera Model, Calibration Mar 4 Features, Tracking / Matching Mar 11 Project Proposals by Students Mar 18 Structure from Motion (SfM) + papers Mar 25

Computer Vision Computer Vision How does vision work? What is vision for? Ela Claridge

Branding Presentation VISION Mevushal VISION Muscat of Alexandria &amp; Viognier VISION

Vision Services Vision Services &amp; &amp; Vision Therapy Vision Therapy February 2, 2007

Vision Our National Church partners .. Vision Our National Network partners Vision Getting

HIM Without Walls Realizing Our Vision! Realizing Our Vision Realize Our Vision Realizing Our

J J R R Our Vision . . . Our Vision . . . Our Vision . . . Our Vision . . . TO BE THE BEST

Post- -trauma vision trauma vision Post Post- -trauma vision trauma vision Post syndrome

2017 Humana Vision 130 LOOK Whats NEW! NEW RETAIL FRAME BENEFIT 2 Humana Vision 100

Vision What is the Vision? The American Fork Canyon Vision (Vision) will ho- Few places in the

Building Our Vision St. Andrews Vision and Mission Our Vision: Our Vision: The Tree of Life is

FLITTER FLITTER The Foldable Litter Pink B Our Vision Our Vision Our Vision Our Vision A

FOCUS AREAS FOCUS AREAS FOCUS AREAS FOCUS AREAS Our Our Vision Vision Our Our Vision

So What Has So, What Has So, What Has So What Has Vision Done For Vision Done For Vision Done

Analog night vision devices April, 2020 ANALOG NIGHT VISION DEVICES Night vision devices

No Excuse Vision Weekend October 27-28, 2018 Why Vision Weekend? Because vision is

VISION ZERO SF: ELIMINATING TRAFFIC DEATHS BY 2024 FEBRUARY 6, 2017 VISION ZERO VISION ZERO SF

E9 205 Machine Learning for Signal Processing 23-8-17 Outline Basics for Image Processing

ImageProof: Enabling Authentication for Large-Scale Image Retrieval Shangwei Guo 1 Jianliang Xu 1

Computational Photography Si Lu Spring 2018 http://web.cecs.pdx.edu/~lusi/CS510/CS510_Computati

Heaps and Heapsort 1 October 2020 OSU CSE 1 Heaps A heap is a binary tree of T that

Interactive Image Mining Annie Morin 1 , Nguyen-Khang Pham 1,2 1 TEXMEX/IRISA 2 Cantho

Learning Representations for Visual Object Class Recognition Marcin Marszaek Cordelia Schmid

Texture and materials Subhransu Maji CMPSCI 670: Computer Vision December 1, 2016 CMPSCI 670

image matching presented by Dmytro Mishkin joint work with Anastasia Mishchuk, Milan Pultar,

Branding Presentation VISION Mevushal VISION Muscat of Alexandria & Viognier VISION

Vision Services Vision Services & & Vision Therapy Vision Therapy February 2, 2007