The Mythos of Model Interpretability
Zachary C. Lipton https://arxiv.org/abs/1606.03490
The Mythos of Model Interpretability Zachary C. Lipton - - PowerPoint PPT Presentation
The Mythos of Model Interpretability Zachary C. Lipton https://arxiv.org/abs/1606.03490 Outline What is interpretability ? What are its desiderata? What model properties confer interpretability? Caveats, pitfalls, and takeaways
Zachary C. Lipton https://arxiv.org/abs/1606.03490
This model is {interpretable, explainable, intelligible, transparent, understandable}
Evaluation Metric
Evaluation Metric Interpretation
Evaluation Metric Interpretation
It seems either:
We hope to refine the discourse on interpretability In dialogue with the literature, we create a taxonomy
when it’s uncertain?
same mistakes as humans?
with the model?
the natural world
in triage) that assigns lower risk to asthma patients
the wild
non-stationary, noisy
depend on weak setup
to make a *decision*
to be a feature
may simply be valuable for the extra bits it carries
to fall into two categories
model works
some (potentially post-hoc) explanation
is simplicity
linear models, rules
person can *run* it
understanding individual components of a model
model or the nodes of a decision tree
behavior algorithm (but maybe not output)
convex optimizations, generalization bounds
A h y e s , s
e t h i n g c
i s h a p p e n i n g i n n
e 7 5 , 3 4 5 , 1 6 7 … m a y b e i t s e e s a c a t ? T r y j i g g l i n g t h e i n p u t s ?
explanations (absent transparency), we might train a (possibly separate) model to generate explanations
as interpretations
(Image: Karpathy et al 2015)
might be impossible to describe succinctly, local explanations are potentially useful. (Image: Wang et al 2016)
that look similar to the model
to explain treatments (Image: Mikolov et al 2014)
deep learning
Acknowledgments: Zachary C. Lipton was supported by the Division of Biomedical Informatics at UCSD, via training grant (T15LM011271) from the NIH/NLM. Thanks to Charles Elkan, Julian McAuley, David Kale, Maggie Makar, Been Kim, Lihong Li, Rich Caruana, Daniel Fried, Jack Berkowitz, & Sepp Hochreiter
References: The Mythos of Model Interpretability (ICML Workshop on Human Interpretability 2016) - ZC Lipton http://arxiv.org/abs/1511.03677