1/22
Consistent Multi-View Reconstruction from Epipolar Geometries with Outliers
Daniel Martinec and Tom´ aˇ s Pajdla Center for Machine Perception Department of Cybernetics, Faculty Electrical Engineering Czech Technical University, Prague
Consistent Multi-View Reconstruction from Epipolar Geometries with - - PowerPoint PPT Presentation
1/22 Consistent Multi-View Reconstruction from Epipolar Geometries with Outliers Daniel Martinec and Tom a s Pajdla Center for Machine Perception Department of Cybernetics, Faculty Electrical Engineering Czech Technical University,
1/22
Daniel Martinec and Tom´ aˇ s Pajdla Center for Machine Perception Department of Cybernetics, Faculty Electrical Engineering Czech Technical University, Prague
2/22
What is it about. . .
Epipolar geometries between image pairs avialable (Matas & al BMVC’02)
2/22
What is it about. . .
Epipolar geometries between image pairs avialable (Matas & al BMVC’2002)
2/22
What is it about. . .
2/22
What is it about. . .
Goal: projective reconstruction consistent with all images constructed from pairwise epipolar geometries
2/22
What is it about. . .
3/22
Problems
x1
1
x2
1
3/22
Problems
x1
1
x2
1
(may satisfy the epipolar geometry)
4/22
Contribution
consistent projective reconstruction from pairwise epipolar geometries
Martinec & Pajdla PCV’2002
4/22
Contribution
consistent projective reconstruction from pairwise epipolar geometries
5/22
Outline
INTRODUCTION
PART I: Perspective Cameras & Occlusions
PART II: Outlier Detection
6/22
Previous Work
[1] C. Tomasi and T. Kanade. Shape and motion from image streams under orthography: A factorization method. In IJCV(9)2, 1992.
6/22
Previous Work
[1] C. Tomasi and T. Kanade. Shape and motion from image streams under orthography: A factorization method. In IJCV(9)2, 1992. [2] D. Jacobs. Linear fitting with missing data: Applications to structure from motion and to characterizing intensity images. In CVPR, 1997.
6/22
Previous Work
[1] C. Tomasi and T. Kanade. Shape and motion from image streams under orthography: A factorization method. In IJCV(9)2, 1992. [2] D. Jacobs. Linear fitting with missing data: Applications to structure from motion and to characterizing intensity images. In CVPR, 1997. [3] P. Sturm and B. Triggs. A factorization based algorithm for multi-image projective structure and motion. In ECCV, 1996.
6/22
Previous Work
[1] C. Tomasi and T. Kanade. Shape and motion from image streams under orthography: A factorization method. In IJCV(9)2, 1992. [2] D. Jacobs. Linear fitting with missing data: Applications to structure from motion and to characterizing intensity images. In CVPR, 1997. [3] P. Sturm and B. Triggs. A factorization based algorithm for multi-image projective structure and motion. In ECCV, 1996. [4] D. Q. Huynh and A. Heyden. Outlier detection in video sequences under affine
7/22
Review of ECCV’2002
8/22
Problem Formulation & Solution
x1
1
x2
1
X1
8/22
Problem Formulation & Solution
x1
1
x2
1
X1
Perspective camera projection: λi
p xi p
= Pi
3×4
Xp
8/22
Problem Formulation & Solution
x1
1
x2
1
X1
Perspective camera projection: λi
p xi p
= Pi
3×4
Xp
λ1
1x1 1
λ1
2x1 2
. . . λ1
nx1 n
× λ2
2x2 2
× . . . ... . . . λm
1 xm 1
× . . . λm
n xm n
rescaled measurement matrix = P1 P2 . . . Pm
motion [X1X2 . . . Xn]
structure
8/22
Problem Formulation & Solution
x1
1
x2
1
X1
Perspective camera projection: λi
p xi p
= Pi
3×4
Xp
λ1
1x1 1
λ1
2x1 2
. . . λ1
nx1 n
× λ2
2x2 2
× . . . ... . . . λm
1 xm 1
× . . . λm
n xm n
rescaled measurement matrix = P1 P2 . . . Pm
motion [X1X2 . . . Xn]
structure
j using, e.g., the epipolar geometry.
Repeat 1 and 2 until R is complete.
9/22
Estimation of λi
p (Sturm & Triggs)
uses the epipolar geometry
✂✁ ☎✄ ✆ ✝ ✞ ✁ ✞ ✄9/22
Estimation of λi
p (Sturm & Triggs)
uses the epipolar geometry
✂✁ ☎✄ ✆ ✝ ✞ ✁ ✞ ✄λ1x1 = τe + λ2x2
9/22
Estimation of λi
p (Sturm & Triggs)
uses the epipolar geometry
✂✁ ☎✄ ✆ ✝ ✞ ✁ ✞ ✄λ1x1 = τe + λ2x2
λi
p = (eij ∧ xi
p) · (Fij xj p)
eij ∧ xi
p 2
λj
p
c
10/22
Filling of ×
p = 1)
10/22
Filling of ×
p = 1)
for perspective cameras (various λi
p)
10/22
Filling of ×
p = 1)
for perspective cameras (various λi
p)
(rank R = 4 because R = P
X
)
10/22
Filling of ×
p = 1)
for perspective cameras (various λi
p)
(rank R = 4 because R = P
X
)
a 4-tuple of “LI” columns Bk = ? x1
1
λ1
2x1 2
λ1
3x1 3
λ1
4x1 4
λ2
1x2 1
λ2
2x2 2
λ2
3x2 3
λ2
4x2 4
. . . λm
1 xm 1
λm
2 xm 2
λm
3 xm 3
× =
? x1
1
? y1
1
? w1
1
λ1
2x1 2
λ1
2y1 2
λ1
2w1 2
λ1
3x1 3
λ1
3y1 3
λ1
3w1 3
λ1
4x1 4
λ1
4y1 4
λ1
4w1 4
λ2
1x2 1
λ2
1y2 1
λ2
1w2 1
λ2
2x2 2
λ2
2y2 2
λ2
2w2 2
λ2
3x2 3
λ2
3y2 3
λ2
3w2 3
λ2
4x2 4
λ2
4y2 4
λ2
4w2 4
. . .
λm
1 xm 1
λm
1 ym 1
λm
1 wm 1
λm
2 xm 2
λm
2 ym 2
λm
2 wm 2
λm
3 xm 3
λm
3 ym 3
λm
3 wm 3
× × ×
linear hull of all possible fillings Bk = Span(
1
y1
1
w1
1
λ1
2x1 2
λ1
2y1 2
λ1
2w1 2
λ1
3x1 3
λ1
3y1 3
λ1
3w1 3
4x1 4
λ1
4y1 4
λ1
4w1 4
λ2
1x2 1
λ2
1y2 1
λ2
1w2 1
λ2
2x2 2
λ2
2y2 2
λ2
2w2 2
λ2
3x2 3
λ2
3y2 3
λ2
3w2 3
λ2
4x2 4
λ2
4y2 4
λ2
4w2 4
. . .
λm
1 xm 1
λm
1 ym 1
λm
1 wm 1
λm
2 xm 2
λm
2 ym 2
λm
2 wm 2
λm
3 xm 3
λm
3 ym 3
λm
3 wm 3
1 1 1
)
10/22
Filling of ×
p = 1)
for perspective cameras (various λi
p)
(rank R = 4 because R = P
X
)
a 4-tuple of “LI” columns Bk = ? x1
1
λ1
2x1 2
λ1
3x1 3
λ1
4x1 4
λ2
1x2 1
λ2
2x2 2
λ2
3x2 3
λ2
4x2 4
. . . λm
1 xm 1
λm
2 xm 2
λm
3 xm 3
× =
? x1
1
? y1
1
? w1
1
λ1
2x1 2
λ1
2y1 2
λ1
2w1 2
λ1
3x1 3
λ1
3y1 3
λ1
3w1 3
λ1
4x1 4
λ1
4y1 4
λ1
4w1 4
λ2
1x2 1
λ2
1y2 1
λ2
1w2 1
λ2
2x2 2
λ2
2y2 2
λ2
2w2 2
λ2
3x2 3
λ2
3y2 3
λ2
3w2 3
λ2
4x2 4
λ2
4y2 4
λ2
4w2 4
. . .
λm
1 xm 1
λm
1 ym 1
λm
1 wm 1
λm
2 xm 2
λm
2 ym 2
λm
2 wm 2
λm
3 xm 3
λm
3 ym 3
λm
3 wm 3
× × ×
linear hull of all possible fillings Bk = Span(
1
y1
1
w1
1
λ1
2x1 2
λ1
2y1 2
λ1
2w1 2
λ1
3x1 3
λ1
3y1 3
λ1
3w1 3
4x1 4
λ1
4y1 4
λ1
4w1 4
λ2
1x2 1
λ2
1y2 1
λ2
1w2 1
λ2
2x2 2
λ2
2y2 2
λ2
2w2 2
λ2
3x2 3
λ2
3y2 3
λ2
3w2 3
λ2
4x2 4
λ2
4y2 4
λ2
4w2 4
. . .
λm
1 xm 1
λm
1 ym 1
λm
1 wm 1
λm
2 xm 2
λm
2 ym 2
λm
2 wm 2
λm
3 xm 3
λm
3 ym 3
λm
3 wm 3
1 1 1
)
→ constraint on R
→ fill R
11/22
Wide Base-Line Stereo
LM = lin. method, BA = bundle adjustment
Scene House 10 images [2952×2003] Point detection manual, 203 points in space Depth estimation central image No. 1 Amount of missing data 47.83 % LM Mean error per image point [pxl] 3.91 LM + BA 1.44 10 images 203 points
”•” λi
p ←
− F (75.7 %), ”◦” λi
p ←
− B (24.3 %), ” ” occlusion
12/22
Dense Sequence (Hannover)
Scene Dinosaur (Oxford) 36 images [720×576] Point detection Harris’ operator, 4983 points in space Depth estimation sequence Amount of missing data 90.84 % LM Mean error per image point [pxl] 1.76 LM + BA 0.64 36 images 4983 points
”•” λi
p ←
− F (100.0 %), ” ” occlusion
13/22
Review of PCV’2002
14/22
Correspondences from the Epipolar Geometry
some outliers satisfy the epipolar geometry epipolar geometry from a dominant plane & outliers
15/22
Problem Formulation & Related Work
Perspective camera projection: λi
p xi p
= Pi
3×4
Xp
xi
p . . . outliers
λ1
1x1 1
λ1
2x1 2
. . . λ1
nx1 n
× λ2
2x2 2
× . . . ... . . . λm
1 xm 1
× . . . λm
n xm n
rescaled measurement matrix = P1 P2 . . . Pm
motion [X1X2 . . . Xn]
structure
16/22
Outlier Detection — Main Idea
Pairwise epipolar geometry − → not many outliers
16/22
Outlier Detection — Main Idea
Pairwise epipolar geometry − → not many outliers Assumption One consistent structure & random independent outliers
16/22
Outlier Detection — Main Idea
Pairwise epipolar geometry − → not many outliers Assumption One consistent structure & random independent outliers Main Idea Minimal configurations of points in triples of images are sufficient to validate inliers reliably.
16/22
Outlier Detection — Main Idea
Pairwise epipolar geometry − → not many outliers Assumption One consistent structure & random independent outliers Main Idea Minimal configurations of points in triples of images are sufficient to validate inliers reliably. = ⇒
→ T .
→ validate points not used for estimation T as inliers. Repeat 1 and 2 until there is enough inliers.
16/22
Outlier Detection — Main Idea
Pairwise epipolar geometry − → not many outliers Assumption One consistent structure & random independent outliers Main Idea Minimal configurations of points in triples of images are sufficient to validate inliers reliably. = ⇒
→ T .
→ validate points not used for estimation T as inliers. Repeat 1 and 2 until there is enough inliers. − → estimate reconstruction using our method for occlusions [ECCV’2002] (xi
p → ×)
17/22
Consistent Multi-View Reconstruction — Algorithm
17/22
Consistent Multi-View Reconstruction — Algorithm
conflicting ones.
17/22
Consistent Multi-View Reconstruction — Algorithm
conflicting ones.
17/22
Consistent Multi-View Reconstruction — Algorithm
conflicting ones.
17/22
Consistent Multi-View Reconstruction — Algorithm
conflicting ones.
17/22
Consistent Multi-View Reconstruction — Algorithm
conflicting ones.
In each column, p, of R: (a) Random triple of image points, P − → Xp (b) If repr. error small − → inliers Repeat (a) and (b) until the column is sufficiently sampled
17/22
Consistent Multi-View Reconstruction — Algorithm
conflicting ones.
In each column, p, of R: (a) Random triple of image points, P − → Xp (b) If repr. error small − → inliers Repeat (a) and (b) until the column is sufficiently sampled
18/22
Experiments on WBS
18/22
Experiments on WBS
Scene Valbonne all pairs (Oxford) 14 images [768 × 512] Outliers 396 (28.14 % of 1407 image points)
14 / 0
271 / 32 / 105 of 376 Mean / maximal reprojection error 0.45 / 3.66 pxl (from inliers) 14 images 376 correspondences
19/22
20/22
Some Other WBS Experitments. . .
Scene Movi-house (CMP) 14 images [512 × 512] Outliers 207 (44.90 % of 461 image points)
9 / 0
67 / 33 / 34 of 101 Mean / maximal reprojection error 0.75 / 5.27 pxl (from inliers) 9 images 101 correspondences
21/22
Some Other WBS Experitments. . .
Scene Shelf (CMP) 12 images [1200 × 1600] Outliers 414 (6.46 % of 6411 image points)
12 / 0
1839 / 72 / 114 of 1953 Mean / maximal reprojection error 0.51 / 4.90 pxl (from inliers) 12 images 1953 correspondences
22/22
Conclusion
camera
a 4-tuple of “LI” columns Bk = ? x1
1
λ1
2x1 2
λ1
3x1 3
λ1
4x1 4
λ2
1x2 1
λ2
2x2 2
λ2
3x2 3
λ2
4x2 4
. . . λm
1 xm 1
λm
2 xm 2
λm
3 xm 3
× =
? x1
1
? y1
1
? w1
1
λ1
2x1 2
λ1
2y1 2
λ1
2w1 2
λ1
3x1 3
λ1
3y1 3
λ1
3w1 3
λ1
4x1 4
λ1
4y1 4
λ1
4w1 4
λ2
1x2 1
λ2
1y2 1
λ2
1w2 1
λ2
2x2 2
λ2
2y2 2
λ2
2w2 2
λ2
3x2 3
λ2
3y2 3
λ2
3w2 3
λ2
4x2 4
λ2
4y2 4
λ2
4w2 4
. . .
λm
1 xm 1
λm
1 ym 1
λm
1 wm 1
λm
2 xm 2
λm
2 ym 2
λm
2 wm 2
λm
3 xm 3
λm
3 ym 3
λm
3 wm 3
× × ×
linear hull of all possible fillings Bk = Span(
1
y1
1
w1
1
λ1
2x1 2
λ1
2y1 2
λ1
2w1 2
λ1
3x1 3
λ1
3y1 3
λ1
3w1 3
4x1 4
λ1
4y1 4
λ1
4w1 4
λ2
1x2 1
λ2
1y2 1
λ2
1w2 1
λ2
2x2 2
λ2
2y2 2
λ2
2w2 2
λ2
3x2 3
λ2
3y2 3
λ2
3w2 3
λ2
4x2 4
λ2
4y2 4
λ2
4w2 4
. . .
λm
1 xm 1
λm
1 ym 1
λm
1 wm 1
λm
2 xm 2
λm
2 ym 2
λm
2 wm 2
λm
3 xm 3
λm
3 ym 3
λm
3 wm 3
1 1 1
)