PRETSA: Event Log Sanitization for Privacy-aware Process Discovery
Stephan A. Fahrenkrog-Petersen, Han van der Aa & Matthias Weidlich
PRETSA: Event Log Sanitization for Privacy-aware Process Discovery - - PowerPoint PPT Presentation
PRETSA: Event Log Sanitization for Privacy-aware Process Discovery Stephan A. Fahrenkrog-Petersen, Han van der Aa & Matthias Weidlich Motivation hu-berlin.de/pda 2 Related Work [Sweeney et al., 2002] Process Mining [Monreale et
Stephan A. Fahrenkrog-Petersen, Han van der Aa & Matthias Weidlich
hu-berlin.de/pda
2
hu-berlin.de/pda
3
Information System Individual Event Data Sanitized Event Data Process Mining Artifact
Privatized Process Mining Process Mining Event Log Sanitization
Process Mining Artifact
Data Contribution Data Extraction
[Mannhardt et al., 2019] [Monreale et al., 2014] [Sweeney et al., 2002]
hu-berlin.de/pda
—> Illegal e.g. in Germany
4
hu-berlin.de/pda
5
hu-berlin.de/pda
6
hu-berlin.de/pda
7
hu-berlin.de/pda
PREfix-Tree based event log SAnitization
PRETSA Event Log Event Log with k-anonymity & t-closeness Process Discovery Process Model with Performance Data
8
hu-berlin.de/pda
9
Sequence variant # σ1 create po,update po,receive gd,check in,pay in 10 σ2 create po,update po,receive gd,check in,reject in 5 σ3 create po,receive gd,update po,check in,pay in 7 σ4 create po,receive gd,update po,check in,reject in 5 σ5 create po,receive gd,update po,update po,check in,pay in 1
hu-berlin.de/pda
prefix tree from an event log
an equivalence class
10
Root create_po (28) update_po (15) receive_gd (13) receive_gd (15) check_in (15) pay_in (10) reject_in (5) update_po (13) update_po (1) check_in (12) pay_in (7) reject_in (5) check_in (1) pay_in (1)
hu-berlin.de/pda
11
Root create_po (28) update_po (15) receive_gd (13) receive_gd (15) check_in (15) pay_in (10) reject_in (5) update_po (13) update_po (1) check_in (12) pay_in (7) reject_in (5) check_in (1) pay_in (1)
k=8
violation is found
hu-berlin.de/pda
12
Root create_po (28) update_po (15) receive_gd (13) receive_gd (15) check_in (15) pay_in (15) update_po (13) update_po (1) check_in (12) pay_in (7) reject_in (5) check_in (1) pay_in (1)
k=8
branch with violation
most similar branch
hu-berlin.de/pda
13
Root create_po (28) update_po (15) receive_gd (13) receive_gd (15) check_in (15) pay_in (15) update_po (13) check_in (13) pay_in (13) k=8
hu-berlin.de/pda
14
hu-berlin.de/pda
15
hu-berlin.de/pda
16
Sequence variant # σ1 create po,update po,receive gd,check in,pay in 10 σ2 create po,update po,receive gd,check in,reject in 5 σ3 create po,receive gd,update po,check in,pay in 7 σ4 create po,receive gd,update po,check in,reject in 5 σ5 create po,receive gd,update po,update po,check in,pay in 1
hu-berlin.de/pda
17
hu-berlin.de/pda
18
hu-berlin.de/pda
19
hu-berlin.de/pda
…ensures privacy (k-anonymity & t-closeness) for event logs …uses a prefix tree representation of the event log …provides event logs with high utility for process discovery …is available on GitHub under MIT license: github.com/samadeusfp/PRETSA Questions? Reach out to fahrenks@hu-berlin.de
20