Motion and Activity Analysis with Spatiotemporal Local Binary - PDF document

Motion and Activity Analysis with Spatiotemporal Local Binary Patterns Matti Pietikäinen and Guoying Zhao {mkp,gyzhao}@ee.oulu.fi Machine Vision Group University of Oulu, Finland http://www.ee.oulu.fi/mvg/ MACHINE VISION GROUP Contents 1. Introduction to LBP operators in spatial domain 2. Motion analysis with spatiotemporal LBPs 3. Summary MACHINE VISION GROUP

Dynamic textures (R Nelson & R Polana: IUW, 1992; M Szummer & R Picard: ICIP, 1995; G Doretto et al., IJCV, 2003) MACHINE VISION GROUP Local Binary Pattern and Contrast operators Ojala T, Pietikäinen M & Harwood D (1996) A comparative study of texture measures with classification based on feature distributions. Pattern Recognition 29:51-59. An example of computing LBP and C in a 3x3 neighborhood: example thresholded weights 6 5 2 1 0 0 1 2 4 Important properties: 7 6 1 1 128 8 0 • LBP is invariant to any 9 8 7 64 32 16 1 1 1 monotonic gray level change Pattern = 11110001 • computational simplicity LBP = 1 + 16 +32 + 64 + 128 = 241 C = (6+7+8+9+7)/5 - (5+2+1)/3 = 4.7 MACHINE VISION GROUP

Multiscale LBP Ojala T, Pietikäinen M & Mäenpää T (2002) Multiresolution gray-scale and rotation invariant texture classification with Local Binary Patterns. IEEE Transactions on Pattern Analysis and Machine Intelligence 24(7):971-987. - arbitrary circular neighborhoods - uniform patterns - multiple scales - rotation invariance - gray scale variance as contrast measure MACHINE VISION GROUP -19 0 51 -23 -5 0 0 47 65 -8 0 0 1 62 70 70 10 8 1 1 80 78 13 1 83 1. Sample 2. Difference 3. Threshold 1*1 + 1*2 + 1*4 + 1*8 + 0*16 + 0*32 + 0*64 + 0*128 = 15 4. Multiply by powers of two and sum MACHINE VISION GROUP

„Uniform‟ patterns ‘Uniform’ patterns (P=8) U=0 U=2 Examples of „nonuniform‟ patterns (P=8) U=4 U=6 U=8 MACHINE VISION GROUP Uniform patterns • Bit patterns with 0 or 2 transitions 0 → 1 or 1 → 0 when the pattern is considered circular • All non-uniform patterns assigned to a single bin • 58 uniform patterns in case of 8 sampling points MACHINE VISION GROUP

Texture primitives (“micro - textons”) detected by the uniform patterns of LBP MACHINE VISION GROUP Estimation of empirical feature distributions Input image (region) is scanned with the chosen operator(s), pixel by pixel, and operator outputs are accumulated into a discrete histogram LBP P,R riu2 P+1 0 1 2 3 4 5 6 7 ... LBP P,R riu2 Joint histogram of two operators B-1 0 1 2 3 4 5 6 7 ... LBP P,R riu2 / VAR P,R VAR P,R MACHINE VISION GROUP

Multiscale analysis Information provided by N operators can be combined simply by summing up operatorwise similarity scores into an aggregate similarity score: N L N = L n e.g. LBP 8,1 riu2 + LBP 8,3 riu2 + LBP 8,5 riu2 n=1 Effectively, the above assumes that distributions of individual operators are independent MACHINE VISION GROUP Nonparametric classification principle Sample S is assigned to the class of model M that maximizes B-1 L(S,M) = S b ln M b b=0 Many other dissimilarity measures can be used (chi square, histogram intersection, Kullback-Leibler divergence, Jeffrey ’ s divergence, etc.) Nonparametric: no assumptions about underlying feature distributions are made!! MACHINE VISION GROUP

Face analysis using local binary patterns • Face recognition is one of the major challenges in computer vision • We proposed (ECCV 2004, PAMI 2006) a face descriptor based on LBP‟s • Our method has already been adopted by many leading scientists and groups • Computationally very simple, excellent results in face recognition and authentication, face detection, facial expression recognition, gender classification MACHINE VISION GROUP Face description with LBP Ahonen T, Hadid A & Pietikäinen M (2006) Face description with local binary patterns: application to face recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 28(12):2037-2041. (an early version published at ECCV 2004) A facial description for face recognition: MACHINE VISION GROUP

Dynamic texture recognition Zhao G & Pietikäinen M (2007) Dynamic texture recognition using local binary  Determine the emotional patterns with an application to facial expressions. IEEE Transactions on Pattern state ofthe face Analysis and Machine Intelligence 29(6):915-928. (parts of this were earlier presented at ECCV 2006 Workshop on Dynamical Vision and ICPR 2006) MACHINE VISION GROUP Dynamic texture • Dynamic Textures (DT): Temporal texture • Textures with motion • An extension of texture to the temporal domain • Encompass the class of video sequences that exhibit some stationary properties in time  Lots of dynamic textures in real world  Description and recognition of DT is needed MACHINE VISION GROUP

V olume L ocal B inary P atterns ( VLBP ) Sampling in volume Thresholding Multiply Pattern MACHINE VISION GROUP LBP from T hree O rthogonal P lanes ( LBP-TOP ) Length of Feature Vector 4 x 10 10 Concatenated LBP VLBP 5 0 0 2 4 6 8 10 12 14 16 P: Number of Neighboring Points MACHINE VISION GROUP

3 2 1 0 Y -1 -2 -3 1 3 2 0 1 0 -1 -2 -1 -3 T X 3 1 1 2 1 T 0 0 T Y 0 -1 -2 -1 -1 -3 -2 -1 0 1 2 3 -3 -2 -1 0 1 2 3 Y X -3 -3 -2 -1 0 1 2 3 X MACHINE VISION GROUP LBP-TOP MACHINE VISION GROUP

DynTex database • Our methods outperformed the state-of-the-art in experiments with DynTex and MIT dynamic texture databases MACHINE VISION GROUP MACHINE VISION GROUP

Results of LBP from three planes 0.2 0.15 0.4 0.1 0.2 0.05 0 0 0 100 200 300 400 500 600 700 800 5 10 15 20 25 30 LBP XY XZ YZ Con weighted 8,8,8,1,1,1 riu2 88.57 84.57 86.29 93.14 93.43[2,1,1] 8,8,8,1,1,1 u2 92.86 88.86 89.43 94.57 96.29[4,1,1] 8,8,8,1,1,1 Basic 95.14 90.86 90 95.43 97.14[5,1,2] 8,8,8,3,3,3 Basic 90 91.17 94.86 95.71 96.57[1,1,4] 8,8,8,3,3,1 Basic 89.71 91.14 92.57 94.57 95.71[2,1,8] MACHINE VISION GROUP Facial expression recognition Zhao G & Pietikäinen M (2007) Dynamic texture recognition using local binary patterns with an application to facial expressions. IEEE Transactions on Pattern Analysis and Machine Intelligence 29(6):915-928.  Determine the emotional state of the face • Regardless of the identity of the face MACHINE VISION GROUP

Facial Expression Recognition Mug Shot Dynamic Information [Feng, 2005][Shan, 2005] [Bartlett, 2003][Littlewort,2004] Action Units Prototypic Emotional Expressions [Cohen,2003] [Tian, 2001][Lien, 1998] [Yeasin, 2004] [Bartlett,1999][Donato,1999] [Aleksic,2005] [Cohn,1999] Psychological studies [Bassili 1979], have demonstrated that humans do a better job in recognizing expressions from dynamic images as opposed to the mug shot. MACHINE VISION GROUP (a) Non-overlapping blocks(9 x 8) (b) Overlapping blocks (4 x 3, overlap size = 10) (a) Block volumes (b) LBP features (c) Concatenated features for one block volume from three orthogonal planes with the appearance and motion MACHINE VISION GROUP

Database Cohn-Kanade database : • 97 subjects • 374 sequences • Age from 18 to 30 years • Sixty-five percent were female, 15 percent were African-American, and three percent were Asian or Latino. MACHINE VISION GROUP Angry Happiness Disgust Sadness Surprise Fear MACHINE VISION GROUP

Comparison with different approaches People Sequence Class Dynamic Measure Recognition Num Num Num Rate (%) [Shan,2005] 96 320 7(6) N 10 fold 88.4(92.1) [Bartlett, 2003] 90 313 7 N 10 fold 86.9 [Littlewort, 90 313 7 N leave-one- 93.8 2004] subject- out [Tian, 2004] 97 375 6 N ------- 93.8 [Yeasin, 2004] 97 ------ 6 Y five fold 90.9 [Cohen, 2003] 90 284 6 Y ------- 93.66 Ours 97 374 6 Y two fold 95.19 Ours 97 374 6 Y 10 fold 96.26 MACHINE VISION GROUP Demo for facial expression recognition  Low resolution  No eye detection  Translation, in-plane and out-of- plane rotation, scale  Illumination change  Robust with respect to errors in face alignment MACHINE VISION GROUP

Example images in different illuminations Visible light (VL) : 0.38-0.75 μ m Near Infrared (NIR) : 0.7 μ m-1.1 μ m Taini M, Zhao G, Li SZ & Pietik ä inen M (2008) Facial expression recognition from near-infrared video sequences. Proc. 19th International Conference on MACHINE VISION GROUP Pattern Recognition (ICPR), 4 p. On-line facial expression recognition from NIR videos • NIR web camera allows expression recognition in near darkness. • Image resolution 320 × 240 pixels. • 15 frames used for recognition. • Distance between the camera and subject around one meter. Start sequences Middle sequences End sequences MACHINE VISION GROUP

Motion and Activity Analysis with Spatiotemporal Local Binary - PDF document

Motion and Activity Analysis with Spatiotemporal Local Binary Patterns Matti Pietikinen and Guoying Zhao {mkp,gyzhao}@ee.oulu.fi Machine Vision Group University of Oulu, Finland http://www.ee.oulu.fi/mvg/ MACHINE VISION GROUP Contents 1.

Visual Motion Motion illusions Uses for motion cues Optic flow Motion blindness

Spatiotemporal Regulation of ERK by Spatiotemporal Regulation of ERK by Dual- -specificity

Forces and Motion Click on the topic to go to that section Motion Motion Graphs of Motion

Forces and Motion Click on the topic to go to that section Motion Motion Graphs of Motion

Motion Estimation for Video Coding Motion-Compensated Prediction Bit Allocation Motion

Outline Outline Motion & Inverse Motion Motion & Inverse Motion Time

Learning to Synthesize Motion Blur CVPR 2019 Tim Brooks and Jon Barron Research Motion During

Motion in Photography Freeze Motion / Blur Motion Objective The student will create freeze

Motion Aftereffects Without Motion: Engaging the Human Motion Perception System With Still

An Overview of Models and Methods for Spatiotemporal Data Analysis Jim Zidek- U British

Motion Capture Specialized Motion Capture N. Alberto Borghese Laboratory of Human Motion

Simple Harmonic Motion (SHM) Slide 2 / 67 SHM and Uniform Circular Motion There is a deep

Simple Harmonic Motion Slide 2 / 70 SHM and Circular Motion There is a deep connection between

Computer Vision by Learning: Motion in Action Jan van Gemert, UvA 2 Motion and perceptual

Forces and Motion Click on the topic to go to that section Motion Graphs of Motion Forces

Class 38: Energy and Simple Harmonic Motion General equation of motion for simple harmonic motion

Motion Capture Sistemi a marker passivi N. Alberto Borghese Laboratory of Human Motion Analysis

Play Well Lab Pixifox Animation Vincent Elzenga kick-off meeting Arjan van Panhuis SWKGroep

PENZ conference 2016 WHY GO CONTEMPORARY? Dr Sarah-Kate Millar, Margot Bowes, Michelle Cleaver

Neutrino Mass Ordering: Hints and Challenges Eligio Lisi (INFN, Bari, Italy) Solvay Workshop,

Dynamics of Social Networks and Collective Motion in Sheep Hamed Haddadi Structure & Motion

Evaluation of pairwise calibration techniques for range cameras and their ability to detect a

When, Why, and How Do Deficits Matter? Daniel Shaviro, NYU Law School St. Francis College,

IMPROVING FELONY CASEFLOW National Center for State Courts Judicial Branch of Arizona in

Motion and Activity Analysis with Spatiotemporal Local Binary - PDF document

Motion and Activity Analysis with Spatiotemporal Local Binary Patterns Matti Pietikinen and Guoying Zhao {mkp,gyzhao}@ee.oulu.fi Machine Vision Group University of Oulu, Finland http://www.ee.oulu.fi/mvg/ MACHINE VISION GROUP Contents 1.

Visual Motion Motion illusions Uses for motion cues Optic flow Motion blindness

Spatiotemporal Regulation of ERK by Spatiotemporal Regulation of ERK by Dual- -specificity

Forces and Motion Click on the topic to go to that section Motion Motion Graphs of Motion

Forces and Motion Click on the topic to go to that section Motion Motion Graphs of Motion

Motion Estimation for Video Coding Motion-Compensated Prediction Bit Allocation Motion

Outline Outline Motion &amp; Inverse Motion Motion &amp; Inverse Motion Time

Learning to Synthesize Motion Blur CVPR 2019 Tim Brooks and Jon Barron Research Motion During

Motion in Photography Freeze Motion / Blur Motion Objective The student will create freeze

Motion Aftereffects Without Motion: Engaging the Human Motion Perception System With Still

An Overview of Models and Methods for Spatiotemporal Data Analysis Jim Zidek- U British

Motion Capture Specialized Motion Capture N. Alberto Borghese Laboratory of Human Motion

Simple Harmonic Motion (SHM) Slide 2 / 67 SHM and Uniform Circular Motion There is a deep

Simple Harmonic Motion Slide 2 / 70 SHM and Circular Motion There is a deep connection between

Computer Vision by Learning: Motion in Action Jan van Gemert, UvA 2 Motion and perceptual

Forces and Motion Click on the topic to go to that section Motion Graphs of Motion Forces

Class 38: Energy and Simple Harmonic Motion General equation of motion for simple harmonic motion

Motion Capture Sistemi a marker passivi N. Alberto Borghese Laboratory of Human Motion Analysis

Play Well Lab Pixifox Animation Vincent Elzenga kick-off meeting Arjan van Panhuis SWKGroep

PENZ conference 2016 WHY GO CONTEMPORARY? Dr Sarah-Kate Millar, Margot Bowes, Michelle Cleaver

Neutrino Mass Ordering: Hints and Challenges Eligio Lisi (INFN, Bari, Italy) Solvay Workshop,

Dynamics of Social Networks and Collective Motion in Sheep Hamed Haddadi Structure &amp; Motion

Evaluation of pairwise calibration techniques for range cameras and their ability to detect a

When, Why, and How Do Deficits Matter? Daniel Shaviro, NYU Law School St. Francis College,

IMPROVING FELONY CASEFLOW National Center for State Courts Judicial Branch of Arizona in

Outline Outline Motion & Inverse Motion Motion & Inverse Motion Time

Dynamics of Social Networks and Collective Motion in Sheep Hamed Haddadi Structure & Motion