Explainable Machine Learning Models for Structured Data
Dr Georgiana Ifrim
georgiana.ifrim@insight-centre.org
(joint work with Severin Gsponer, Thach Le Nguyen, Iulia Ilie)
30 July 2018
Explainable Machine Learning Models for Structured Data Dr - - PowerPoint PPT Presentation
Explainable Machine Learning Models for Structured Data Dr Georgiana Ifrim georgiana.ifrim@insight-centre.org (joint work with Severin Gsponer, Thach Le Nguyen, Iulia Ilie) 30 July 2018 Overview Structured Data Symbolic Sequences (e.g.,
georgiana.ifrim@insight-centre.org
30 July 2018
Insight Centre for Data Analytics Slide 2
Insight Centre for Data Analytics Slide 3
Value Data points 290.507 AGGGCATCATGGAGCTGTCCAG 679.305 ATCACAATTTTGCCGAGAGCGA 1998.715 GTACACCCCGTTCGGCGGCCCA 447.803 CCTTTAGCCCATCGTTGGCCAA
Byte sequence Class Data points +1 C7 01 24 04 5F 0E EA DC 00 E9 D6 4A 00 0C 66 89 +1 74 13 BA EF 01 00 06 68 95 14 88 B7 00 0F 0E EA
08 F9 C8 1A 80 C1 8B 48 40 00 89 51 10 B8 04 00
B8 00 00 00 00 50 E8 D8 00 00 00 83 C4 04 53 FF Assembly code
Insight Centre for Data Analytics Slide 4
Insight Centre for Data Analytics Slide 5
[Source: http://www.darpa.mil/program/explainable-artificial-intelligence ]
Insight Centre for Data Analytics Slide 6
Insight Centre for Data Analytics Slide 7
Score Sequence 290.5 AGTCCACAAGGCTAGGATAGCTATCCGGATCGA 315.1 TATCCTGCAGTACAAGTCCGTAATTCACAATCCA 805.6 AGTCCGCTAGGCTAGGATAGCTAGCCCGATCGA 799.7 AGCCAAGACCTGAAATAGGCTCCTGAGATACAG ??? CGGGTCGTATCCGCACTGAATATCTAGGCTTACG Goal is to learn a mapping: f : S → R Weight k-mer 796.6 TAGGCT 402,5 CACAA
TCCG
Insight Centre for Data Analytics Slide 8
Algorithm 1 Coordinate Descent with Gauss Southwell Selection
1: Set β(0) = 0 2: while termination condition not met do 3:
Calculate objective function L(β(t))
4:
Find coordinate jt with maximum gradient value
5:
Find optimal step size ηjt
6:
Update β(t) = β(t−1) − ηjt
∂L ∂βjt (β(t−1))ejt
7:
Add corresponding feature to feature set
8: end while
How do we find coordinate jt efficiently?
Key Ideas Bound gradient of k-mer using only information about its sub-k-mers. Example Given: sp = ”ACT” Calculate bound: µ(sp) s1 = ”ACTC” -> gradient(s1) ≤ µ(sp) s2 = ”AACT” -> gradient(s2) ≤ µ(sp) s3 = ”TACTG” -> gradient(s3) ≤ µ(sp)
Insight Centre for Data Analytics Slide 9
Insight Centre for Data Analytics Slide 10 0 1 0 0 1 0 0 1 Classifier M 0.1 0.3 0.2 0.4 a1b1b1c1 a1c1c1d1 a2b2b2b2 a2c2c2d2 anbnbncn ancndndn
...
SAX/SFA SEQL SEQL SEQL F1 F2 Fn
... ...
a1b1b1 a1c1c1
Insight Centre for Data Analytics Slide 11
Top-3 models:
6 7 8 9 10 CD mtSS−SEQL+LR FCN COTE WEASEL ResNet mtSFA−SEQL+LR mtSS−SEQL ST mtSAX−SEQL+LR BOSS
Insight Centre for Data Analytics Slide 12
Hand at rest Hand moving above holster Hand moving down to grasp gun shoulder level Steady pointing Hand moving to
Gun time series annotation Point time series annotation
10 20 30 40 50 60 70 80 90
Hand at rest Hand moving to shoulder level Steady pointing
Insight Centre for Data Analytics Slide 13
Point (top) and Gun (bottom) Salient Region for Classification Decision Github code for our work: https://github.com/heerme?tab=repositories
Coefficients Subsequences 0.065 84 cbaab 0.062 47 db 0.062 23 ddddb 0.062 00 da 0.059 72 bbbbbbbbbbcdddd −0.053 72 aaaaaabbbb −0.054 39 bbbbaaaaaa −0.054 58 bbbcddddd
Insight Centre for Data Analytics Slide 14
All-Subsequence Learning and Symbolic Representations in Time and Frequency Domains, DMKD18, 2018.
2018
Subsequence Space, ECML-PKDD, 2017.
Space, ICDE, 2017.
scale identification and categorization of protein sequences using structured logistic regression, PloS one 9 (1), 2014.
predictor space, KDD, 2011.
grams, KDD, 2008.
15 Insight Centre for Data Analytics