SLIDE 21 2.1 Weakness of l-diversity
Simplified 2-diversity: to
generate a data set such that each individual is linked to “cancer” with probability at most 1/2
None Q None q2 None None Yes Yes
Cancer
q2 Q Q Q
QI D
None q2 None q2 None None Yes Yes
Cancer
q2 q2 q1 q1
QI D
Knowledge 1 I also know that there are two q1 values and four q2 values in the table. Knowledge 3 The anonymization algorithm tries to minimize the generalization steps for 2-diversity Knowledge 4 I will think in the following way.
None q2 None q2 None None Yes Yes
Cancer
q2 q2 q1 q1
QI D
None q1 None q2 None None Yes Yes
Cancer
q2 q1 q2 q2
QI D
None q2 None q2 None None Yes Yes
Cancer
q2 q1 q2 q1
QI D
I deduce that the
- riginal table MUST be
- Poss. 1.
This person o MUST suffer From Cancer. This attack is called
Minimality Attack.
That is, P(o is linked to Cancer | Knowledge) = 1
Problem: to generate a data set which satisfies the
following. for each individual o, P(o is linked to Cancer | Knowledge) < = 1/l I also know Peter with QID = (q1) Knowledge 2
m-confidentiality (where m = l)