COST-SENSITIVE MEASURES OF INSTANCE HARDNESS
Carlos Melo Ricardo Prudêncio Centro de Informática – UFPE Recife-Brazil
C OST -S ENSITIVE M EASURES OF I NSTANCE H ARDNESS Carlos Melo - - PowerPoint PPT Presentation
C OST -S ENSITIVE M EASURES OF I NSTANCE H ARDNESS Carlos Melo Ricardo Prudncio Centro de Informtica UFPE Recife-Brazil I NTRODUCTION Instance hardness Which instances are more difficult in a dataset? Motivation Data
Carlos Melo Ricardo Prudêncio Centro de Informática – UFPE Recife-Brazil
Instance hardness Which instances are more difficult in a dataset? Motivation Data cleaning, ensemble methods,... Aspects considered in our work Misclassification costs Decision thresholds choice methods
Context A Cost of FN = Cost of FP Context B Cost of FN > Cost of FP
Instance hardness depends on the observed context (misclassification costs) and how to deal with it (decision threshold choice method)
Framework to define cost curves and
Questions: Given a context and an algorithm, how
hard is an instance?
How hard is an instance in general? Which algorithm is the best for each
instance?
Different curves for different decision
Loss Instance x
Instances can be either positive (y = 0) or
Learned model m is a scoring function s = m(x) is high for negative instances Decision Threshold (t)
y = 1, if s > t (i.e., x is negative) 0, otherwise (i.e., x is positive) s y y 0.92 1 1 0.71 1 0 0.54 1 0 0.36 0 0 0.21 0 1 t = 0.5 ^ ^
Cost model :
1
Positive instances Negative instances
p
n
Threshold is set equal to the cost proportion t = T(c) = c
s y y 0.92 1 1 0.71 1 0 0.54 1 0 0.36 0 0 0.21 0 1 c = 0.4 Higher cost for false positives t = 0.4 ^
1
QI
0.54
n
x
Instance cost curves (positive instances)
n
1
QI
2 0 2
s
s
2 2
2s
Threshold is equal to a desired
t = T(c) = R-1(c)
s y y 0.92 1 1 0.71 0 0 0.54 0 0 0.36 0 0 0.21 0 1 R = 0.80 (80% of positive predictions) ^
1
0.4 0.6
x R(0.36) = 0.40
R(0.54) = 0.60
Instance cost curves (positive instances)
n
1
2
R(s)
2
x m1(x) y x1 0.92 1 x2 0.71 1 x3 0.34 0 x4 0.31 1 x5 0.23 1 x6 0.20 1 x7 0.15 0 x8 0.13 0 x9 0.11 1 x10 0.05 0
IHRD = (0.7)2 IHSD = (0 - 0.34)2 = (0.34)2
Well calibrated score but poor rank
Average cost curves and instances hardness
| | 1
L j j
| | 1
L j j x
Ensemble instance cost curves for the positive instances Score-Driven Rate-Driven x5 x6 x7 x8 x9 x10
Score-Driven Rate-Driven Positive Class Negative Class
Instance hardness measures and cost curves
Other threshold choice methods Probabilistic methods (rate-uniform and score-
uniform), rate-fixed and score-fixed.
Future work Integrate instance hardness into classification
methods (ensemble learning)
Empirical and meta-learning studies