http://pitt.edu/~emotion
Criteria and metrics for thresholded AU detection
Jeff Girard and Jeff Cohn University of Pittsburgh
BeFIT Workshop, ICCV 2011
University of Pittsburgh – Affect Analysis Group
Criteria and metrics for thresholded AU detection Jeff Girard and - - PowerPoint PPT Presentation
University of Pittsburgh Affect Analysis Group http://pitt.edu/~emotion Criteria and metrics for thresholded AU detection Jeff Girard and Jeff Cohn University of Pittsburgh BeFIT Workshop, ICCV 2011 Facial Action Coding System
http://pitt.edu/~emotion
BeFIT Workshop, ICCV 2011
University of Pittsburgh – Affect Analysis Group
http://pitt.edu/~emotion
BeFIT Workshop, ICCV 2011 2/16 November 13, 2011
http://pitt.edu/~emotion
BeFIT Workshop, ICCV 2011 3/16 November 13, 2011
http://pitt.edu/~emotion
Data Collection Groundtruth Coding (subset) Classifier Training (subset) Automatic Coding
Data Collection Classifier from other Database Automatic Coding
Novel Classifier Training Naïve Classifier Implementation
Strengths: +Classifier trained on same database Limitations:
Strengths: +Requires no ground truth coding +Requires no classifier training Limitations:
BeFIT Workshop, ICCV 2011 4/16 November 13, 2011
http://pitt.edu/~emotion
Data Collection Classifier from other Database Threshold Analysis (subset) Automatic Coding
BeFIT Workshop, ICCV 2011
Strengths: +Requires no new classifier training +Threshold optimized for current database Limitations:
0,25 0,5 0,75 1
5/16 November 13, 2011
http://pitt.edu/~emotion
BeFIT Workshop, ICCV 2011 6/16 November 13, 2011
http://pitt.edu/~emotion
BeFIT Workshop, ICCV 2011
4 10 12 14
7/16 November 13, 2011
http://pitt.edu/~emotion
0,25 0,5 0,75 1 300 600 900 1200 SVM Decision Value Frame Number
SVM_12
BeFIT Workshop, ICCV 2011
8/16 November 13, 2011
http://pitt.edu/~emotion
0,25 0,5 0,75 1 300 600 900 1200
SVM_12 Threshold
300 600 900 1200
Thresholded Prediction
BeFIT Workshop, ICCV 2011 9/16 November 13, 2011
http://pitt.edu/~emotion
300 600 900 1200
Thresholded Prediction
300 600 900 1200
Groundtruth Labels
BeFIT Workshop, ICCV 2011
Accuracy = 0.855 F1 = 0.756 Kappa = 0.656
10/16 November 13, 2011
http://pitt.edu/~emotion
BeFIT Workshop, ICCV 2011
0,1 0,2 0,3 0,4 0,5 0,6 0,7 0,8 0,9 1
1 2 3 4 5 6 7 8 9 10
Score on Performance Metric Threshold Value
AU_10 Threshold Training
Accuracy F1 Kappa
0,1 0,2 0,3 0,4 0,5 0,6 0,7 0,8 0,9 1
1 2 3 4 5 6
Score on Performance Metric Threshold Value
AU_4 Threshold Training
Accuracy F1 Kappa
11/16 November 13, 2011
http://pitt.edu/~emotion
BeFIT Workshop, ICCV 2011
0,1 0,2 0,3 0,4 0,5 0,6 0,7 0,8 0,9 1
5 10 15 20
Score on Performance Metric
Threshold Value
AU_14 Threshold Training
Accuracy F1 Kappa
0,1 0,2 0,3 0,4 0,5 0,6 0,7 0,8 0,9 1
1 2 3 4
Score on Performance Metric Threshold Value
AU_12 Threshold Training
Accuracy F1 Kappa
12/16 November 13, 2011
http://pitt.edu/~emotion
0,1 0,2 0,3 0,4 0,5 0,6 0,7 0,8 0,9 1 Accuracy F1 Kappa Score on Performance Metric Performance Metric Naïve Classifier Threshold Analysis
BeFIT Workshop, ICCV 2011
p < .0001 p < .002 p < .0001
13/16 November 13, 2011
http://pitt.edu/~emotion
BeFIT Workshop, ICCV 2011
0,1 0,2 0,3 0,4 0,5 0,6 0,7 0,8 0,9 1
Naïve Implementation Threshold Analysis FERA Winner* Overall F1
14/16 November 13, 2011
http://pitt.edu/~emotion
BeFIT Workshop, ICCV 2011
4 10 12 14
0% 10% 20% 30% 40% 50% 60% 70% 80% AU_4 AU_10 AU_12 AU_14
Percent Increase in Performance Accuracy F1 Kappa
15/16 November 13, 2011
http://pitt.edu/~emotion
BeFIT Workshop, ICCV 2011
16/16 November 13, 2011
http://pitt.edu/~emotion
300 600 900 1200
Thresholded Prediction
300 600 900 1200
Groundtruth Labels
BeFIT Workshop, ICCV 2011
Accuracy = 0.855 F1 = 0.756 Kappa = 0.656
300 600 900 1200
Thresholded Prediction (with smoothing) Accuracy = 0.896 F1 = 0.826 Kappa = 0.754
17/16 November 13, 2011
http://pitt.edu/~emotion
BeFIT Workshop, ICCV 2011
0,1 0,2 0,3 0,4 0,5 0,6 0,7 0,8 0,9 1
Accuracy F1 Kappa Score on Performance Metric Performance Metric Training Set Naïve Implementation Threshold Analysis
18/16 November 13, 2011
http://pitt.edu/~emotion
0,1 0,2 0,3 0,4 0,5 0,6 0,7 0,8 0,9 1
Accuracy F1 Kappa Score on Performance Metric Performance Metric
Zero maxAc maxF1 maxKa EER
BeFIT Workshop, ICCV 2011
The threshold that maximized Accuracy performed poorly on F1 and Kappa. Thresholds that maximized F1, Kappa, and EER performed best on all metrics.
19/16 November 13, 2011