郭振华
清华大学深圳研究生院 zhenhua.guo@sz.tsinghua.edu.cn
Robust texture image representation by scale selective local binary - - PowerPoint PPT Presentation
Robust texture image representation by scale selective local binary patterns (TIP2016) zhenhua.guo@sz.tsinghua.edu.cn 2001
郭振华
清华大学深圳研究生院 zhenhua.guo@sz.tsinghua.edu.cn
地办学机构。同一学校,同一品牌,秉承同一文化传统。
兼职教师40余人。
2
http://www.sz.tsinghua.edu.cn
、先进制造、海洋科学与技术、社会科学与管理七个学部。
新能源、物流、海洋等学科。
3
4
6
草地(纹理) 树林(纹理) 楼房(纹理) 天空(纹理)
7
Desert vs Mountain Normal vs Abnormal Defect Detection
8
9
Structural approach: a set of texels in some regular or repeated pattern
10
grass leaves What/Where are the texels?
11
12
13
with Local Binary Pattern. IEEE Transactions on Pattern Analysis and Machine Intelligence 24(7), pp. 971-987, 2002.
14
15
16
17
18
19
Central pixel and its P circularly and evenly spaced neighbours with radius R. (a) (b) (c) (d) (a) A 33 sample block; (b) the local differences; (c) the sign and (d) magnitude components.
24
21
Original Image LDSMT CLBP Map Center Gray Level CLBP_C CLBP_S S CLBP_M M Local Difference CLBP Histogram Classifier
Histogram of CLBP_S of a sample. Histogram of CLBP_M of the sample. Histogram of CLBP_S_M. Histogram of CLBP_S/M.
Zhenhua Guo, Lei Zhang, David Zhang, A Completed Modeling of Local Binary Pattern Operator for Texture Classification, IEEE Transactions on Image Processing, vol. 19, no. 6, pp. 1657-1663, 2010.
23
24
Detect a Harris or Laplacian Region->Normalize the region->Feature Extraction
25
Estimating local scale or extracting local fractal feature
26
Global fractal feature
27
Polar transform
28
Scale shift matching
29
discriminant information for texture classification.
dominant patterns does not change.
Transactions on Image Processing 18(5), pp. 1107-1118, 2009.
Gaussian filter;
kept;
set;
30
1
, =1 , 1< L, "*" is the convolution operator
i l l
f l s s g l
1 2
( )=max( ( ), ( ), ..., ( ))
i L
f s s s CLBP _ S / C CLBP _ S / C CLBP _ S / C CLBP _ S / C
H k H k H k H k ( ) ( )= ( )+
i
f CLBP _ S / C T T CLBP _ S / C CLBP _ S / C
H k H k H k N
Gaussian filter;
1;
31
1
, =1 , 1< L, "*" is the convolution operator
l l
I l s s g l
1 2
( )=max( ( ), ( ), ..., ( ))
L
s s s I CLBP _ S / C CLBP _ S / C CLBP _ S / C CLBP _ S / C
DPH k DPH k DPH k DPH k
32
. . .
1 : Original Image I
s
Convolved by 2D Gaussian function g
Image Scale Space
2 I
s
I L
s
0 05 . 0 1 . 0 08 . 0 03 . 0 02 .
Feature Size: K Feature frequency of
ri P ,R T CLBP _ M / CDP Feature frequency
DP Feature Size: K
0 14 . 0 01 . 0 02 . 0 05 . 0 0 . 0 06 . 0 07 . 0 06 . 0 05 . 0 0 .
Feature frequency of
ri P ,R T CLBP _ M / CDP Feature frequency
DP
0 15 . 0 01 . 0 02 . 0 01 . 0 04 .
Feature frequency of
ri P ,R T CLBP _ M / CDP Feature frequency
DP
Feature Scale Space
0 15 . 0 1 . 0 08 . 0 05 . 0 04 .
Max
0 14 . 0 02 . 0 07 . 0 05 . 0 02 . 0 09 . 0 02 . 0 04 . 0 03 . 0 02 . 0 13 . 0 01 . 0 07 . 0 01 . 0 01 .
Output feature for image I
Feature Size: 2 K
Max
33
5 10 15 20 25 30 35 40 45 50 0.01 0.02 0.03 0.04 0.05 0.06 Pattern Index Frequency(%) 5 10 15 20 25 30 35 40 45 50 0.01 0.02 0.03 0.04 0.05 0.06 Pattern Index Frequency(%) 5 10 15 20 25 30 35 40 45 50 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 Pattern Index Frequency(%) 5 10 15 20 25 30 35 40 45 50 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 Pattern Index Frequency(%)0.35 0.17
34
35
1 2 3 4 5 6 7 8 9 1.7 1.75 1.8 1.85 1.9 1.95 2 2.05 2.1 Image Scale Average Feature Scale
Scale parameter of KTH-TIPS.
, 1,2,...,9 90
z
I I IS z
FS AFS z
K k ScaleIndex FS
K k I I
1
) (
36
Texture Database Name Imaging property Image Size Number
classes Number
samples CUReT The images are captured under different illumination and viewing directions. Fixed, 200*200 61 92 KTH- TIPS It extends CUReT by imaging new samples of ten of the CUReT textures at a subset of the viewing and lighting angles used in CUReT but also over a range of scales. Varied, 196*201 10 81 UIUC Textures are acquired under significant scale and viewpoint changes, arbitrary rotations, and uncontrolled illumination conditions, even including textures with non-rigid deformation. Fixed, 640*480 25 40 UMD It has been designed in a similar way as UIUC, while the image resolution is 4 times of UIUC. Fixed, 1280* 960 25 40 A LOT It is systematically collected with varied viewing angles, illumination angles, and illumination colors for each material. Varied, 1536* 891 250 100
37
38
39
40
41
42
( ), 1,2,...,
k k
H sqrt H k K
43
1
T T y c c c c
2 y c c c
err h H
,1 ,2 ,
, ,...,
c c c c n
H h h h
Transactions on Pattern Analysis and Machine Intelligence 27(5), pp. 684–698, 2005.
Method CUReT (46) KTH-TIPS (40) UIUC (20) UMD (20) ALOT (50) SRP [ICCV2011] (SVM) 99.37 99.29 98.56 99.30
98.52 97.71 96.27 99.13
98.46 94.8 92.0
98.6 98.5 98.8
98.42 95.6 WMFS [TIP2013] (SVM)
98.68 96.94 PLS [CVPR2014](SVM)
96.57 98.99 93.35 PFS [IVC2014](SVM)
97.92 99.38 97.5 LEP [TIP2013] (Shift Matching NNC)
99.29
99.25
98.4 98.4
84.93 86.99 60.73 89.87 78.38 LBPSRI [TIP2012] (NNC) 85.00 89.73 70.05 91.71 71.29 LBP [TPAMI2002] (NNC) 80.63 82.67 55.26 88.23 63.33 CLBP [TIP2010] (NNC) 97.40 97.19 93.26 98.00 93.28 SSLBP (NNC) 98.55 97.80 97.02 98.84 96.69 SSLBP (NSC) 99.51 99.39 99.36 99.46 99.71
44
Method Scale Space Building Pattern/ Patch Processing Histogramming Classification (NNC) RP [TPAMI2012]
SR·C·ST·Ip C·ST·(C·Tn) VZ_Patch [TPAMI2009]
C·ST·(C·Tn) SSLBP (L-1)·Sg· Ip 2·L·P·Ip 2·L·Ip 4·K·(C·Tn)
45 Here L=4 is the size of scale space, P=24 is the number of neighbours for LBP, Ip denotes the number of pixels per image sample, Sg denotes the size of Gaussian smooth kernel, K=600 is the number of selected dominant
15, C denotes the number of classes, ST represents the number of clustered textons per class, here C·ST≈4·K. Tn is the number of training samples per class.
CUReT KTH-TIPS UIUC UMD ALOT Feature extraction of MFS (ICCV2009) (Unit: Second) 0.09 0.08 0.62 2.60 2.67 Feature extraction of VZ_MR8 (ICCV2005) (Unit: Second) 1.03 0.93 9.98 37.35 40.11 Feature extraction of VZ_Patch (TPAMI2009) (Unit: Second) 12.44 11.28 96.97 309.51 346.64 Feature extraction of the proposed scheme (Unit: Second) 0.24 0.23 1.80 7.63 8.46 Matching (NNC) (Unit: Millisecond) 177.47 25.09 30.78 31.1 297.42 Matching (NSC) (Unit: Millisecond) 2.25 0.64 0.90 0.92 5.05
46
7 7
Image size \#Training Sample 20 15 10 5 1280*960 99.46+0.46 99.31+0.51 98.81+0.70 96.41+1.28 640*480 99.71+0.26 99.48+0.37 98.77+0.68 95.68+1.45 320*240 99.38+0.45 98.80+0.69 97.41+1.06 92.74+1.84
47
CUReT KTH-TIPS UMD UIUC CUReT 100% 80.33% 73% 74.54% KTH-TIPS
76.12% 80.16% UMD
87.87% UIUC
48
Percentage of identical dominant patterns between different training sets.
49
50
51
For any inquiry, please contact with zhenhua.guo@sz.tsinghua.edu.cn