CS 532: 3D Computer Vision Lecture 3 Enrique Dunn - PowerPoint PPT Presentation

1 CS 532: 3D Computer Vision Lecture 3 Enrique Dunn edunn@stevens.edu Lieb 310

Course TA Andy Wiggins <awiggins@stevens.edu> Office hours: Lieb lounge on Wednesdays &Thursdays 2pm-4pm 2

RANSAC Slides by R. Hartley, A. Zisserman and M. Pollefeys 3

Robust Estimation • What if set of matches contains gross outliers? 4

RANSAC Objective Robust fit of model to data set S which contains outliers Algorithm (i) Randomly select a sample of s data points from S and instantiate the model from this subset. (ii) Determine the set of data points S i which are within a distance threshold t of the model. The set S i is the consensus set of samples and defines the inliers of S. (iii) If the subset of S i is greater than some threshold T , re- estimate the model using all the points in S i and terminate (iv) If the size of S i is less than T , select a new subset and repeat the above. (v) After N trials the largest consensus set S i is selected, and the model is re-estimated using all the points in the subset S i 5

How Many Samples? Choose N so that, with probability p , at least one random sample is free from outliers. e.g. p =0.99 ( ) N Sampling Inlier Data point s ( ) 1 1 e 1 p − − = − Sampling All-Inlier Set ( ) s ( ) ( ) N log 1 p / log 1 1 e = − − − Sampling Contaminated Set proportion of outliers e s 5% 10% 20% 25% 30% 40% 50% 2 2 3 5 6 7 11 17 3 3 4 7 9 11 19 35 4 3 5 9 13 17 34 72 5 4 6 12 17 26 57 146 6 4 7 16 24 37 97 293 7 4 8 20 33 54 163 588 8 5 9 26 44 78 272 1177 6

Acceptable Consensus Set • Typically, terminate when inlier ratio reaches expected ratio of inliers ( ) n T = 1 e − 7

Adaptively Determining the Number of Samples e is often unknown a priori, so pick worst case, e.g. 50%, and adapt if more inliers are found, e.g. 80% would yield e =0.2 – N = ∞ , sample_count =0 – While N > sample_count repeat • Choose a sample and count the number of inliers • Set e=1-(number of inliers)/(total number of points) • Recompute N from e • Increment the sample_count by 1 ( ( ) ) s N log ( 1 p ) / log 1 ( 1 e ) = − − − – Terminate 8

Other robust algorithms • RANSAC maximizes number of inliers • LMedS minimizes median error • Not recommended: case deletion, iterative least-squares, etc. 9

Automatic Computation of H Objective Compute homography between two images Algorithm (i) Interest points: Compute interest points in each image (ii) Putative correspondences: Compute a set of interest point matches based on some similarity measure (iii) RANSAC robust estimation: Repeat for N samples (a) Select 4 correspondences and compute H (b) Calculate the distance d ⊥ for each putative match (c) Compute the number of inliers consistent with H ( d ⊥ < t ) Choose H with most inliers (iv) Optimal estimation: re-estimate H from all inliers by minimizing ML cost function with Levenberg-Marquardt (v) Guided matching: Determine more matches using prediction by computed H Optionally iterate last two steps until convergence 10

Determine Putative Correspondences • Compare interest points Similarity measure: – SAD, SSD, ZNCC in small neighborhood • If motion is limited, only consider interest points with similar coordinates 11

Example: robust computation Interest points (500/image) (640x480) #in 1-e adapt. N 6 2% 20M 10 3% 2.5M 44 16% 6,922 Putative correspondences (268) 58 21% 2,291 (Best match,SSD<20) 73 26% 911 Outliers (117) 151 56% 43 ( t =1.25 pixel; 43 iterations) Inliers (151) Final inliers (262) 12

Radial Distortion and Undistortion Slides by R. Hartley, A. Zisserman and M. Pollefeys 13

Radial Distortion short and long focal length 14

Typical Undistortion Model Correction of distortion Choice of the distortion function and center Computing the parameters of the distortion function (i) Minimize with additional unknowns (ii) Straighten lines (iii) … 17

Why Undistort? 18

Two-View Geometry Slides by R. Hartley, A. Zisserman and M. Pollefeys 19

Three questions: Correspondence geometry: Given an image point x in the first (i) image, how does this constrain the position of the corresponding point x’ in the second image? (ii) Camera geometry (motion): Given a set of corresponding image points {x i ↔ x’ i }, i=1,…,n, what are the cameras P and P’ for the two views? (iii) Scene geometry (structure): Given corresponding image points x i ↔ x’ i and cameras P, P’, what is the position of (their pre- image) X in space? 20

The Epipolar Geometry C, C’, x, x’ and X are coplanar 21

The Epipolar Geometry What if only C,C’,x are known? 22

The Epipolar Geometry All points on π project on l and l’ 23

The Epipolar Geometry Family of planes π and lines l and l’ Intersection in e and e’ 24

The Epipolar Geometry epipoles e, e’ = intersection of baseline with image plane = projection of projection center in other image = vanishing point of camera motion direction an epipolar plane = plane containing baseline (1-D family) an epipolar line = intersection of epipolar plane with image (always come in corresponding pairs) 25

Example: Converging Cameras 26

Exa Examp mple le: Mo Motio ion Pa Para ralle llel l to Ima mage Pla Plane (simple for stereo → rectification) 27

Example: Forward Motion e’ e 28

The Fundamental Matrix F algebraic representation of epipolar geometry x � l' we will see that mapping is a (singular) correlation (i.e. projective mapping from points to lines) represented by the fundamental matrix F 29

The Fundamental Matrix F correspondence condition The fundamental matrix satisfies the condition that for any pair of corresponding points x ↔ x’ in the two images x' T Fx 0 ( x' T = ) = l' 0 30

The Fundamental Matrix F F is the unique 3x3 rank 2 matrix that satisfies x’ T Fx=0 for all x ↔ x’ (i) Transpose: if F is fundamental matrix for (P,P’), then F T is fundamental matrix for (P’,P) (ii) Epipolar lines: l’=Fx & l=F T x’ (iii) Epipoles: on all epipolar lines, thus e’ T Fx=0, ∀ x ⇒ e’ T F=0, similarly Fe=0 (iv) F has 7 d.o.f. , i.e. 3x3-1(homogeneous)-1(rank2) (v) F is a correlation, projective mapping from a point x to a line l’=Fx (not a proper correlation, i.e. not invertible) 31

Two View Geometry Computation: Linear Algorithm x' T Fx 0 For every match (m,m´): = x ' xf x ' yf x ' f y ' xf y ' yf y ' f xf yf f 0 + + + + + + + + = 11 12 13 21 22 23 31 32 33 separate known from unknown T [ ][ ] x ' x , x ' y , x ' , y ' x , y ' y , y ' , x , y , 1 f , f , f , f , f , f , f , f , f 0 = 11 12 13 21 22 23 31 32 33 (data) (unknowns) (linear) x ' x x ' y x ' y ' x y ' y y ' x y 1 ⎡ ⎤ 1 1 1 1 1 1 1 1 1 1 1 1 � � � � � � � � � f 0 ⎢ ⎥ = ⎢ x ' x x ' y x ' y ' x y ' y y ' x y 1 ⎥ n n n n n n n n n n n n ⎣ ⎦ Af = 0 32

Benefits from having F • Given a pixel in one image, the corresponding pixel has to lie on epipolar line • Search space reduced from 2-D to 1-D 33

Image Pair Rectification simplify stereo matching by warping the images Apply projective transformation so that epipolar lines correspond to horizontal scanlines e e 1 ⎡ ⎤ map epipole e to (1,0,0) 0 He ⎢ ⎥ = 0 ⎢ ⎥ try to minimize image distortion ⎣ ⎦ problem when epipole in (or close to) the image 34

Pla Planar r Rect ctif ifica icatio ion (standard approach) Bring two views to standard stereo setup (moves epipole to ∞ ) (not possible when in/close to image) 35

The Essential Matrix ~fundamental matrix for calibrated cameras (remove K) T [ ] E t R R[R t] = = × × ˆ T ˆ x ' E x 0 = ( ) -1 -1 ˆ ˆ x K x; x ' K x' = = T E K' FK = 5 d.o.f. (3 for R; 2 for t up to scale) E is an essential matrix if and only if two singular values are equal (and the third=0) T E = Udiag(1,1, 0)V 37

Four Possible Solutions from E Given E and setting the first camera matrix P = [I | 0], there are four possible solutions for P’ (only one solution, however, where a reconstructed point is in front of both cameras) 38

Fundamental Matrix Estimation 39

Epipolar Geometry: Basic Equation x' T Fx 0 = x ' xf x ' yf x ' f y ' xf y ' yf y ' f xf yf f 0 + + + + + + + + = 11 12 13 21 22 23 31 32 33 separate known from unknown T [ ][ ] x ' x , x ' y , x ' , y ' x , y ' y , y ' , x , y , 1 f , f , f , f , f , f , f , f , f 0 = 11 12 13 21 22 23 31 32 33 (data) (unknowns) (linear) x ' x x ' y x ' y ' x y ' y y ' x y 1 ⎡ ⎤ 1 1 1 1 1 1 1 1 1 1 1 1 � � � � � � � � � f 0 ⎢ ⎥ = ⎢ x ' x x ' y x ' y ' x y ' y y ' x y 1 ⎥ ⎣ n n n n n n n n n n n n ⎦ Af = 0 40

CS 532: 3D Computer Vision Lecture 3 Enrique Dunn - PowerPoint PPT Presentation

1 CS 532: 3D Computer Vision Lecture 3 Enrique Dunn edunn@stevens.edu Lieb 310 Course TA Andy Wiggins <awiggins@stevens.edu> Office hours: Lieb lounge on Wednesdays &Thursdays 2pm-4pm 2 RANSAC Slides by R. Hartley, A.

Jodi Kaus Associate Director for OSFA Director of Powercat Financial 785-532-6420 785-532-2889

Wharton State Forest Disturbed Sites 532 532 [ 9 [ 534 [ }

10 East 40 th Street, Suite 3601, New York, NY 10016 Tel: +1.212.532.1158 | Fax: +1.212.532.1162 |

Computer Vision Computer Vision How does vision work? What is vision for? Ela Claridge

1 2 CS 532: 3D Computer Vision Lecture 2 Enrique Dunn edunn@stevens.edu Lieb 310 Image

Anthemius of Tralles and Isidorus of Miletus, Hagia Sophia, Istanbul, Turkey 532-537 Anthemius of

CS262: Computer Vision (and Human-Computer Interaction) John Magee 1 Computer Vision How are

Branding Presentation VISION Mevushal VISION Muscat of Alexandria & Viognier VISION

Vision Services Vision Services & & Vision Therapy Vision Therapy February 2, 2007

Vision Our National Church partners .. Vision Our National Network partners Vision Getting

Scalability and Replication Marco Serafini COMPSCI 532 Lecture 13 Scalability 2 Scalability

Resource Management Marco Serafini COMPSCI 532 Lecture 17 What Are the Functions of an OS?

TensorFlow Marco Serafini COMPSCI 532 Lecture 20 Motivations DistBelief: Previous

File Systems and Storage Marco Serafini COMPSCI 532 Lecture 14 2 Why GFS? Store the

Graph Processing Marco Serafini COMPSCI 532 Lecture 9 Graph Analytics Marco Serafini 2 2

Analytical Query Processing Marco Serafini COMPSCI 532 Lecture 7 Announcement Midterm date

Harnessing the Power of the Oceans SPEAKERS Panelists H.E. Nani Chrougha, Minister of Fisheries

Dynamics of axion strings and Implications for axion dark matter Toyokazu Sekiguchi (RESCEU,

Numerical Coarsening using Discontinuous Shape Functions Jiong Chen 1 , Hujun Bao 1 , Tianyu Wang

Depth 4 lower bounds for elementary symmetric polynomials Nutan Limaye CSE, IITB Joint work

Computer Vision Exercise Session 4 (Discussions) Institute of Visual Computing SIFT feature

Image S(tching Ali Farhadi CSE 576 Several

CS 730/830: Intro AI Unsuperv. Learning asst 11 posted Wheeler Ruml (UNH) Lecture 23, CS 730

Lecture: RANSAC and feature detectors 08-Oct-2019 Juan Carlos Niebles and Ranjay Krishna

CS 532: 3D Computer Vision Lecture 3 Enrique Dunn - PowerPoint PPT Presentation

1 CS 532: 3D Computer Vision Lecture 3 Enrique Dunn edunn@stevens.edu Lieb 310 Course TA Andy Wiggins <awiggins@stevens.edu> Office hours: Lieb lounge on Wednesdays &Thursdays 2pm-4pm 2 RANSAC Slides by R. Hartley, A.

Jodi Kaus Associate Director for OSFA Director of Powercat Financial 785-532-6420 785-532-2889

Wharton State Forest Disturbed Sites 532 532 [ 9 [ 534 [ }

10 East 40 th Street, Suite 3601, New York, NY 10016 Tel: +1.212.532.1158 | Fax: +1.212.532.1162 |

Computer Vision Computer Vision How does vision work? What is vision for? Ela Claridge

1 2 CS 532: 3D Computer Vision Lecture 2 Enrique Dunn edunn@stevens.edu Lieb 310 Image

*Anthemius of Tralles and Isidorus of Miletus, Hagia Sophia, Istanbul, Turkey 532-537 *Anthemius of

CS262: Computer Vision (and Human-Computer Interaction) John Magee 1 Computer Vision How are

Branding Presentation VISION Mevushal VISION Muscat of Alexandria &amp; Viognier VISION

Vision Services Vision Services &amp; &amp; Vision Therapy Vision Therapy February 2, 2007

Vision Our National Church partners .. Vision Our National Network partners Vision Getting

Scalability and Replication Marco Serafini COMPSCI 532 Lecture 13 Scalability 2 Scalability

Resource Management Marco Serafini COMPSCI 532 Lecture 17 What Are the Functions of an OS?

TensorFlow Marco Serafini COMPSCI 532 Lecture 20 Motivations DistBelief: Previous

File Systems and Storage Marco Serafini COMPSCI 532 Lecture 14 2 Why GFS? Store the

Graph Processing Marco Serafini COMPSCI 532 Lecture 9 Graph Analytics Marco Serafini 2 2

Analytical Query Processing Marco Serafini COMPSCI 532 Lecture 7 Announcement Midterm date

Harnessing the Power of the Oceans SPEAKERS Panelists H.E. Nani Chrougha, Minister of Fisheries

Dynamics of axion strings and Implications for axion dark matter Toyokazu Sekiguchi (RESCEU,

Numerical Coarsening using Discontinuous Shape Functions Jiong Chen 1 , Hujun Bao 1 , Tianyu Wang

Depth 4 lower bounds for elementary symmetric polynomials Nutan Limaye CSE, IITB Joint work

Computer Vision Exercise Session 4 (Discussions) Institute of Visual Computing SIFT feature

Image S(tching Ali Farhadi CSE 576 Several

CS 730/830: Intro AI Unsuperv. Learning asst 11 posted Wheeler Ruml (UNH) Lecture 23, CS 730

Lecture: RANSAC and feature detectors 08-Oct-2019 Juan Carlos Niebles and Ranjay Krishna

Anthemius of Tralles and Isidorus of Miletus, Hagia Sophia, Istanbul, Turkey 532-537 Anthemius of

Branding Presentation VISION Mevushal VISION Muscat of Alexandria & Viognier VISION

Vision Services Vision Services & & Vision Therapy Vision Therapy February 2, 2007