SLIDE 1 Light Field Vision for Transparent Object Categorization and Segmentation 光场视觉在透明物体分类和分割中的应用
Yichao Xu 徐轶超
xuyichao.cn
SLIDE 2
Just a reminder – Last day P4A-04
1
SLIDE 3 About me
- A: Hometown in Zhejiang
- Jiaxing
- B: Undergraduate in Beijing
- BESTI
- C: Master 1 in Anhui
- USTC, Hefei
- D: Master 2-3 in Shanghai
- SINAP
, CAS
- E: PhD in Fukuoka, Japan
- Kyushu University
2
SLIDE 4 Outline
- Introduction of Light Field Vision
- Transcat: Transparent Object Categorization
- Transcut: Transparent Object Segmentation
3
SLIDE 5
Scene
Light field
4 Light field describes all the light rays in the space
SLIDE 6
Sensors for visual perception
5 Cameras with CCD and CMOS sensors
SLIDE 7
Only a few light rays can be captured
Scene Image
Regular camera sensing
6
SLIDE 8
Each light ray can be represented by L(s, t, u, v)
Scene
Light field parameterization
7
Position (s, t) Angle (u, v)
4D light field
SLIDE 9
u 𝑡 Light field camera can capture richer information
Scene Viewpoint plane Sensor plane
Light field sensing
8
SLIDE 10
u 𝑡 Regular camera can only sample sub light field space
Scene Viewpoint plane Sensor plane
Light field sampling in phase space
9 u 𝑡
𝑡u phase space
SLIDE 11
u 𝑡 Light field camera can capture richer information
Scene Viewpoint plane Sensor plane
Light field sampling in phase space
10 u 𝑡
𝑡u phase space
SLIDE 12
Computational Photography
11 Multi-focus Multi-view Light Field is widely used for Image-based Rendering
SLIDE 13
12
Simultaneously record positional and angular information of ray
Obtain rich information with single-shot
Light field cameras
Stanford Raytrix Viewplus Lytro PiCam
SLIDE 14
Light field vision
13 Capture To solve computer vision problems
SLIDE 15
Computer vision makes our life better
14 Help us know more Free our hands
SLIDE 16
Visual recognition makes it possible
15 Visual recognition is important in these applications France Prešeren, Poet
SLIDE 17
Advantage of light field vision
16 Regular Computer vision Light Field Vision
Redundant information makes it easier to understand the 3D world
SLIDE 18 Light field vision applications
- Surveillance - Accurately detect desired foreground
- Depth estimation - Accurate and consistent
- Salience detection - Accurate in challenge scenes
17
[A.Shimada et al., IPSJ CVA 2013] LF method Conventional LF method Conventional [S. Wanner et al., PAMI2014] [N. Li et al., CVPR2014] LF method Conventional GT
SLIDE 19 Light Field Vision Application
- - transparent object recognition
18
SLIDE 20
Transparent object recognition
19 Which type is the object? Where is the object?
SLIDE 21
Challenge of the target object
20
Appearance of transparent objects drastically varies with background
SLIDE 22
Transparent object causes distortion
Regular computer vision methods cannot understand whether the image is distorted or not without prior knowledge Different objects produce different image of the same scene 21
SLIDE 23
Know light field from background
22 [Ben-Ezra and Nayar, ICCV2003] Known motion, Manually tagged feature [G. Wetzstein et al, ICCV2011] Known background Transparent object
SLIDE 24
Features from Light Field for Transparent Object Recognition
23
SLIDE 25
Distortion modeled by light field vision
Background distortion changes with viewpoint Background distortion is modeled as the correspondences between the viewpoints 24
SLIDE 26
Modeled distortion is independent of background textures
Background invariant distortion
25
SLIDE 27
∆u ∆v
Light Field Distortion (LFD) feature
26
SLIDE 28
LFD feature visualization
27 2D vectors on different viewpoints 24x2D feature vector for each pixel
∆u ∆v
SLIDE 29
u u 𝑡 𝑡 Rays from background are linear distributed
Background Viewpoint plane Sensor plane 𝑡u phase space
Light Field Linearity (LF-linearity)
28
SLIDE 30
Transparent object
𝑡
Sensor plane Background Viewpoint plane
u u 𝑡
𝑡u phase space
Light Field Linearity (LF-linearity)
Rays from transparent object are not linear distributed 29
SLIDE 31
Extract LF-linearity
30
∆𝑣 𝑡 Disparity Euclidean Distance Hyper-plane ∆𝑣 𝑡
SLIDE 32
LF-linearity visualization
31 Central view LF-linearity
SLIDE 33
Light Field Consistency (LF-consistency)
( , ) view s t (0,0) view
forward matching backward matching
Good consistency Poor consistency 32 LF-consistency is used for detecting the depth discontinuity
forward matching backward matching
(0,0) view ( , ) view s t
SLIDE 34
Occlusion in light field
Occlusion is caused by depth discontinuity 33 Occlusion detector
SLIDE 35
Occlusion detectors
34
SLIDE 36 Detect occlusion point
35 X = 1 The detected occlusion point is from θ = 0
1 1 1 1 1 1 1 1 1 1 1
= 0.7
SLIDE 37
Detected occlusion visualization
36 Central view Occlusion response
SLIDE 38 Feature and descriptor
- LFD Feature (光场扭曲特征)
- 2x24 Dimensional vector
- Describe the distortion pattern
- LF-linearity(光场线性度)
- A metric to describe how much is the distortion
- Occlusion detector (遮挡检测)
- Describe the probability of
a point to be in the occlusion
- Occlusion in which direction
37
SLIDE 39 Outline
- Introduction of Light Field Vision
- Transcat: Transparent Object Categorization
- Transcut: Transparent Object Segmentation
38
SLIDE 40
TransCat: Transparent Object Categorization
39 Which object?
SLIDE 41 Training pipeline
Extracting the LFD feature Training based on Bag of features Training based on Bag of features Estimate background by LF-linearity
40
Filtering the background Extracting the LFD feature
Non-linear Non-linear Non-linear Non-linear Linear Linear Linear Linear
SLIDE 42 Extracting the LFD feature Filtering the background Training based on Bag of features
Training pipeline
41
Estimate relative differences by
SLIDE 43
Representative LFD Feature space Extracting the LFD feature Filtering the background by LF-linearity Training based on Bag of features
Training pipeline
42
SLIDE 44 Representative LFD
Categorization based
Testing for transparent objects categorization
43
result
SLIDE 45
Experimental setting
18 objects 10 backgrounds 44
Background scenes can be dynamic!
SLIDE 46 Categorization result
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 A B C D E F G H I J K L M N O P Q R
Recognition ratio Object
Average categorization accuracy: 84% 45
Evaluation by leave-one-out cross-validation
SLIDE 47 Analysis
46
0.2 0.4 0.6 0.8 1 30 35 40 45 50 Recognition ratio Camera position [cm] 0.2 0.4 0.6 0.8 1 50 100 150 200 250 Recognition ratio Background position [cm] 0.2 0.4 0.6 0.8 Recognition ratio Lighting angle [degree] 0.2 0.4 0.6 0.8 0.1 0.2 0.3 Recognition ratio Noise standard deviation
SLIDE 48 Analysis
47
0.2 0.4 0.6 0.8 1
5 10 Recognition ratio Rotation along x-axis [degree] 0.2 0.4 0.6 0.8 10 20 30 40 Recognition ratio Rotation along y-axis [degree] 0.2 0.4 0.6 0.8 10 20 30 40 Recognition ratio Rotation along z-axis [degree]
Overall Symmetric objects Asymmetric objects
SLIDE 49
Results for real scene
48
SLIDE 50 Outline
- Introduction of Light Field Vision
- Transcat: Transparent Object Categorization
- Transcut: Transparent Object Segmentation
49
SLIDE 51
Transcut: Transparent Object Segmentation
50
SLIDE 52 Properties of different components
Transparent object segmentation formulated as labeling problem 51
Trans Obj Background Good LF-linearity Occlusion Extracted by occlusion detector Transparent Object Poor LF-linearity exclude the occlusion
SLIDE 53
Regional term
Central view of input light field image
Background penalty Foreground penalty
large penalty assigns to pixels that have poor LF-linearity exclude the occlusion area large penalty assigns to pixels with poor LF-linearity in the occlusion or pixels with good LF-linearity 52
SLIDE 54 Boundary term
Central view of input light field image
q3
… …
p q2 q4 q1
… …
If is from 𝑃𝑞 θ = 0,
Detected
𝑃𝑞 53
SLIDE 55
Energy minimization
54 Central view of input light field image Regional term Boundary term
Graph Cut
SLIDE 56 Experiments
Object 1 Object 2 Object 3 Object 4 Object 5 Object 6 Object 7 Scene 1 Scene 2 Scene 3 Scene 4 Scene 5 Scene 6 Scene 7
55 Background scenes can be dynamic!
SLIDE 57 Comparison with related work
56
Finding glass
McHenry et al., CVPR2005
6 features from appearance LF-linearity thresholding Single feature from LF
SLIDE 58 Visual comparison
57
Images from the central viewpoint Results from Finding glass Results from LF-linearity thresholding Results from TransCut Ground Truth
SLIDE 59
Quantitative comparison
58
TP FN FP
F-measure Recall Precision Th=1 0.19 0.84 0.11 Th=2 0.34 0.78 0.22 Th=3 0.42 0.76 0.29 Th=4 0.45 0.73 0.33 Th=5 0.48 0.70 0.37 Th=6 0.49 0.67 0.39 Th=7 0.50 0.65 0.41 Th=8 0.50 0.63 0.43 Th=9 0.50 0.61 0.44 F-measure Recall Precision Finding glass 0.30 0.82 0.19 LF-linearity thresholding 0.50 0.65 0.41 Proposed 0.85 0.96 0.77
SLIDE 60 Summary
- Light field vision can get more
information for solving vision problems
- Full space sampling vs. sub-space
sampling
- Transparent object categorization
- Transparent object segmentation
59
SLIDE 61 Open issues
- Develop robust feature descriptors
- Distance invariant
- Rotation invariant
- Apply to other objects
- Specular objects
- Recover the undistorted background
- Reconstruct 3D shape of transparent objects
- Natural scenes
60
SLIDE 62 Publications
- Y. Xu, K. Maeno, H. Nagahara, and R. Taniguchi, “Mobile camera
array calibration for light field acquisition,” in International Conference on Quality Control by Artificial Vision (QCAV), pp. 283–290, 2013.
- Y. Xu, K. Maeno, H. Nagahara, and R. Taniguchi, “Camera array
calibration for light field acquisition,” Frontiers of Computer Science, 2015, 9(5), pp. 691-702.
- Y. Xu, K. Maeno, H. Nagahara, A. Shimada, and R. Taniguchi,
“Light field distortion feature for transparent object classification,” Computer Vision and Image Understanding, Vol. 139, pp. 122-135, 2015.
- Y. Xu, H. Nagahara, A. Shimada, and R. Taniguchi, "TransCut:
Transparent Object Segmentation from a Light-Field Image", ICCV 2015, Santiago, Chile
61
SLIDE 63
Thank you!
62