MACHINE VISION GROUP
Motion and Activity Analysis with Spatiotemporal Local Binary Patterns Matti Pietikäinen and Guoying Zhao
{mkp,gyzhao}@ee.oulu.fi Machine Vision Group University of Oulu, Finland http://www.ee.oulu.fi/mvg/
MACHINE VISION GROUP
Motion and Activity Analysis with Spatiotemporal Local Binary - - PDF document
Motion and Activity Analysis with Spatiotemporal Local Binary Patterns Matti Pietikinen and Guoying Zhao {mkp,gyzhao}@ee.oulu.fi Machine Vision Group University of Oulu, Finland http://www.ee.oulu.fi/mvg/ MACHINE VISION GROUP Contents 1.
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
70 51 70 62 83 65 78 47 80
13
8
10 1 1 1 1
1*1 + 1*2 + 1*4 + 1*8 + 0*16 + 0*32 + 0*64 + 0*128 = 15
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
riu2 / VARP,R
riu2
riu2 0 1 2 3 4 5 6 7 ... P+1
MACHINE VISION GROUP
N
riu2 + LBP8,3 riu2 + LBP8,5 riu2
n=1
MACHINE VISION GROUP
B-1
b=0
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
Sampling in volume Thresholding Multiply Pattern
MACHINE VISION GROUP
2 4 6 8 10 12 14 16 5 10 x 10
4
P: Number of Neighboring Points Length of Feature Vector Concatenated LBP VLBP
MACHINE VISION GROUP
3 2 1
1 3 2 1
X T Y
1 2 3
1 2 3 X Y
1 2 3
1 X T
1 2 3
1 Y T
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
5 10 15 20 25 30 0.2 0.4
100 200 300 400 500 600 700 800 0.05 0.1 0.15 0.2
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
Low resolution No eye detection Translation, in-plane and out-of- plane rotation, scale Illumination change Robust with respect to errors in face alignment
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
(a) Volume of utterance sequence (b) Image in XY plane (147x81) (c) Image in XT plane (147x38) in y =40 (d) Image in TY plane (38x81) in x = 70 Overlapping blocks (1 x 3, overlap size = 10).
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
C1 C2 C3 C4 C5 C6 C7 C8 C9 C10 20 40 60 80 100 Phrases index Recognition results (%) 1x5x3 block volumes 1x5x3 block volumes (features just from XY plane) 1x5x1 block volumes
MACHINE VISION GROUP
Features Normalization Results (%) [Arsic 2006] MRPCA Y 81.25 [Arsic 2006] MI MRPCA Y 87.5 [Gurban 2005] Temporal Derivatives Features Y 80 91(a&v, 10 dB SNR level) Ours Blocks: 3x6x2 N 92.71
8,8,8,1,1,1
LBP TOP
MACHINE VISION GROUP
MACHINE VISION GROUP
XY, XT and YT slices
time scales
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
These phrases were most difficult to recognize because they are quite similar in the latter part containing the same word ”you”. The selected slices are mainly in the first and second part of the phrase, The phrases ”excuse me” and ”I am sorry”are different throughout the whole utterance, and the selected features also come from the whole pronunciation.
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
Child Adult M-Age Elderly
ID
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
Silhouette representation LBP feature extraction HMM modeling MHI MEI Silhouette representation LBP feature extraction HMM modeling MHI MEI
MACHINE VISION GROUP
w w w w
1 2 3 4
w w w w
1 2 3 4
MACHINE VISION GROUP
) , min( ) | (
i
i t
h h q s h P
a23 a 11 a 22 a 33 a 12
MACHINE VISION GROUP
MACHINE VISION GROUP
Ref. Act. Seq. Res. Our method 10 90 97.8% Wang and Suter 2007 10 90 97.8% Boiman and Irani 2006 9 81 97.5% Niebles et al 2007 9 83 72.8% Ali et al. 2007 9 81 92.6% Scovanner et al. 2007 10 92 82.6% MHI 99% MEI 90% MHI + MEI 100% 8,2 4,1
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
yt xt
MACHINE VISION GROUP
Feature histogram of a bounding volume
MACHINE VISION GROUP
1.00 1.00 1.00 .78 .22 1.00 1.00 1.00 .11 .11 .78 1.00 1.00 1.00 1.00 1.00 .78 .22 1.00 1.00 1.00 .11 .11 .78 1.00 1.00
Bend Jack Jump Pjump Run Side Skip Walk Wave1 Wave2 Bend Jack Jump Pjump Run Side Skip Walk Wave1 Wave2 MACHINE VISION GROUP
.980 .020 .855 .145 .032 ,108 .860 .977 .020 .003 .01 .987 .003 .033 .967 .980 .020 .855 .145 .032 ,108 .860 .977 .020 .003 .01 .987 .003 .033 .967 Box Clap Wave Jog Run Walk Clap Wave Jog Run Walk Box
MACHINE VISION GROUP
Feature histogram of the whole volume xt xy yt
j i h
MACHINE VISION GROUP
B F S B F S
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
L i i i
XY XT YT (a)
MACHINE VISION GROUP
MACHINE VISION GROUP
XY XT YT (a)
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
[6] G. Doretto, A. Chiuso, Y. N. Wu and S. Soatto, Dynamic Texture Segmentation, ICCV, 2003 [7] A. Ghoreyshi and R. Vidal, Segmenting Dynamic Textures with Ising Descriptors, ARX Models and Level Sets, ECCV, 2006
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
and depends less on texture properties than parametric approaches
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
human motions. (http://www.texturesynthesis.com/links.htm and DynTex database, which provides dynamic texture samples for learning and synthesizing.)
frames from not only space but also time domain, thus can reduce discontinuities in synthesis. (http://www.ee.oulu.fi/~guoyimo/download/)
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP
MACHINE VISION GROUP