Stakeholders in Explainable AI Alun Preece, Dan Harborne (Cardiff), - PowerPoint PPT Presentation

Stakeholders in Explainable AI Alun Preece, Dan Harborne (Cardiff), Dave Braines, Richard Tomsett (IBM UK), Supriyo Chakraborty (IBM US) https://arxiv.org/abs/1810.00184

Explainability in GOFAI Explainability in AI is not a new problem. In the last ‘AI summer’ (the expert systems boom, a.k.a., ‘Good Old Fashioned AI’) it was acknowledged that explainability was needed for • system development • gaining end-user trust It was also realized that these require different forms of explanation, framed by developers’ vs end-users’ conceptual models. The problem wasn’t solved before funding dried up!

Interpretability in AI now ‘Interpretability’ is now preferred over ‘explainability’. For our purposes, an explanation is a message intended to convey the cause and reason for a system output; an interpretation is the understanding gained by the recipient of the message. We can still observe different motivations: • understanding how modern ‘deep learning’ based systems actually work • gaining end-user trust Again, different forms of explanation are required, due to the differing conceptual models of the recipients….

Transparency vs post hoc explanation A useful distinction (Lipton 2017) • Transparency reveals the internal workings of an AI system (e.g., deep neural network model; decision tree; rule base) • Post hoc methods construct ‘rationalizations’ of the internal workings ‘after the fact’ (e.g., visualization of salient features, explanation by similar training examples) Notes: • ‘Full technical transparency’ is generally considered infeasible • Explanations from humans are always post hoc

“Interpretable to Whom?” framework WHI workshop at ICML 2018 https://arxiv.org/abs/1806.07552 Argues that a machine learning system’s interpretability should be defined in relation to a specific agent or task: we should not ask if the system is interpretable, but to whom is it interpretable.

Four stakeholder communities Developers are chiefly concerned with building AI applications. Theorists are chiefly concerned with understanding and advancing AI. Ethicists are chiefly concerned with fairness, accountability, transparency and related societal aspects of AI. Users are chiefly concerned with using AI systems. The first three of these communities are well-represented in the AI interpretability literature. The fourth will ultimately determine how long summer lasts.

Verification, validation & explanation Verification is about ‘building the system right’. Validation is about ‘building the right system’. [ ⚠ Overgeneralization alert] • Developer / theorist communities are more focused on verification • User / ethicist communities are more focused on validation It is hard to envisage verification without (some) transparency. Post hoc explanations are valuable for validation.

Rumsfeldian perspective Known knowns: things we train an AI system to know Known unknowns: things we train an AI system to predict Unknown knowns: things we know the AI system doesn’t know (i.e., things outside its bounds) Unknown unknowns: the key area of concern for all communities! Verification tends to be used to define the space of knowns. Validation is essential to define the space of unknowns.

Transparency-based explanations Methods that derive at least in part from internal states of an AI system, e.g., deep Taylor decomposition, Google feature viz. Caveats: • Concerns that these methods are non-axiomatic • Can be hard to interpret by user & ethicist communities • Where explanation does not clearly highlight meaningful features of the input, may make recipients less Traffic congestion classifier: ‘congested’ image explanation inclined to trust system… by deep Taylor decomposition

Post hoc explanations Important to differentiate post hoc from transparent explanations, e.g., LIME local – approximations, to recipients. Explanation features favoured by subject-matter experts include • examples • counterfactuals • background knowledge • text narrative ‘Congested’: explanation by example via influence functions

Layered explanations Composite explanation object that can be unpacked per a recipient’s requirements. Layer 1 — Traceability: transparency-based bindings to internal states of the system showing the system ‘did the thing right’ [main stakeholders: developers and theorists] Layer 2 — Justification: post-hoc semantic relationships between input and output features showing the system ‘did the right thing’ [main stakeholders: developers and users] Layer 3 — Assurance: post-hoc representations with explicit reference to policy/ontology to give confidence that the system ‘does the right thing’ [main stakeholders: users and ethicists]

Example: wildlife monitoring Layer 1 (traceability): saliency map visualisation of input layer features for classification ‘gorilla’ Layer 2 (justification): ‘right for the right reasons’ semantic annotation of salient gorilla features Layer 3 (assurance): counterfactual examples showing that images of humans are not miss-classified as ‘gorilla’

Summary / conclusion Explanation in AI is an old problem: ‘debuggability’ shouldn’t be conflated with ‘trustability’. We can identify four communities with different stakes in explanation: developers, theorists, ethicists users. Developers & theorists tend to focus on verification (knowns); users & ethicists are more interested in validation (unknowns). Can the needs of multiple stakeholders be addressed via ‘joined- up’, composite explanations? Despite a large and growing literature on explanation/interpretation in AI, the voice of users is under-represented. As in the 1980s, it will be the users that determine whether AI thrives!

Thanks for listening! Any questions? This research was sponsored by the U.S. Army Research Laboratory and the UK Ministry of Defence under Agreement Number W911NF–16–3–0001. The views and conclusions contained in this document are those of the authors and should not be interpreted as representing the official policies, either expressed or implied, of the U.S. Army Research Laboratory, the U.S. Government, the UK Ministry of Defence or the UK Government. The U.S. and UK Governments are authorized to reproduce and distribute reprints for Government purposes notwithstanding any copy-right notation hereon.

Stakeholders in Explainable AI Alun Preece, Dan Harborne (Cardiff), - PowerPoint PPT Presentation

Stakeholders in Explainable AI Alun Preece, Dan Harborne (Cardiff), Dave Braines, Richard Tomsett (IBM UK), Supriyo Chakraborty (IBM US) https://arxiv.org/abs/1810.00184 Explainability in GOFAI Explainability in AI is not a new problem. In the

Automated Reasoning for EXplainable Artificial Intelligence Maria Paola Bonacina Dipartimento di

The Role of Normware in Trustworthy and Explainable AI Giovanni Sileno (g.sileno@uva.nl),

Explainable (Deep) Learning and Simulation approaches Torsten Mller Visualization and

Visualization for Explainable Classifiers Yao MING THE HONG KONG UNIVERSITY OF SCIENCE AND

Kowledge-Based Programs as Explainable Policies for Contingent Planning J. Lang, A. Saffidine,

Explainable(?) Statistical ML Derek Doran Dept. of Computer Science and Engineering Wright

NSR Stakeholders NSR Stakeholders Working Group Meeting Working Group Meeting January 21, 2009

Explainable Recommendation Through Attentive Multi-View Learning Advisor: Jia-Ling Koh

Understanding Alzheimer diseases structural connectivity through explainable AI Essemlali

Explainable Improved Ensembling for Natural Language and Vision Nazneen Rajani University of

Explainable AI: Beware of Inmates Running the Asylum Or: How I Learnt to Stop Worrying and Love

Towards a Grounded Dialog Model for Explainable Artificial Intelligence Prashan Madumal , Tim

Explainable AI for Human-Robot Collaboration Prof. Brad Hayes Bradley.Hayes@Colorado.edu

Explainable Self-Learning Self-Adaptive Systems Verena Kls | ES4CPS 2019 Motivation

Explainable Artificial Intelligence Student: Nedeljko Radulovi Supervisors: Mr. Albert Bifet

Model-Based Explainable AI for Safe and Trusted Human-Autonomy Teaming Daniele Magazzeni

Hand to Hand Combat with a Gorilla Wont Work Tony Grout Presentation Title Presentation

BuildHeap & Disjoint sets Todays announcements HW3 due Nov 15, 23:59 PA3 out, Due

B i n a r y s e a r c h t r e e s ( W e i s s c h a p t e r 4 . 2

Phylogenetics: Bayesian Phylogenetic Analysis COMP 571 Luay Nakhleh, Rice University 2 Bayes

The Hand The effector organ of the upper limb The sensory receptor The hand and

My Vim life VimConf 2019 @gorilla0513 About me Name Gorilla (Twitter: @gorilla0513) Job

Pipeline Safety Engaging With The Public Continuous Improvement Beyond Public Awareness New

Combining Maintenance, Operational and Capital Planning Prepared for: THE DOCKS EXPO 2018

Stakeholders in Explainable AI Alun Preece, Dan Harborne (Cardiff), - PowerPoint PPT Presentation

Stakeholders in Explainable AI Alun Preece, Dan Harborne (Cardiff), Dave Braines, Richard Tomsett (IBM UK), Supriyo Chakraborty (IBM US) https://arxiv.org/abs/1810.00184 Explainability in GOFAI Explainability in AI is not a new problem. In the

Automated Reasoning for EXplainable Artificial Intelligence Maria Paola Bonacina Dipartimento di

The Role of Normware in Trustworthy and Explainable AI Giovanni Sileno (g.sileno@uva.nl),

Explainable (Deep) Learning and Simulation approaches Torsten Mller Visualization and

Visualization for Explainable Classifiers Yao MING THE HONG KONG UNIVERSITY OF SCIENCE AND

Kowledge-Based Programs as Explainable Policies for Contingent Planning J. Lang, A. Saffidine,

Explainable(?) Statistical ML Derek Doran Dept. of Computer Science and Engineering Wright

NSR Stakeholders NSR Stakeholders Working Group Meeting Working Group Meeting January 21, 2009

Explainable Recommendation Through Attentive Multi-View Learning Advisor: Jia-Ling Koh

Understanding Alzheimer diseases structural connectivity through explainable AI Essemlali

Explainable Improved Ensembling for Natural Language and Vision Nazneen Rajani University of

Explainable AI: Beware of Inmates Running the Asylum Or: How I Learnt to Stop Worrying and Love

Towards a Grounded Dialog Model for Explainable Artificial Intelligence Prashan Madumal , Tim

Explainable AI for Human-Robot Collaboration Prof. Brad Hayes Bradley.Hayes@Colorado.edu

Explainable Self-Learning Self-Adaptive Systems Verena Kls | ES4CPS 2019 Motivation

Explainable Artificial Intelligence Student: Nedeljko Radulovi Supervisors: Mr. Albert Bifet

Model-Based Explainable AI for Safe and Trusted Human-Autonomy Teaming Daniele Magazzeni

Hand to Hand Combat with a Gorilla Wont Work Tony Grout Presentation Title Presentation

BuildHeap &amp; Disjoint sets Todays announcements HW3 due Nov 15, 23:59 PA3 out, Due

B i n a r y s e a r c h t r e e s ( W e i s s c h a p t e r 4 . 2

Phylogenetics: Bayesian Phylogenetic Analysis COMP 571 Luay Nakhleh, Rice University 2 Bayes

The Hand The effector organ of the upper limb The sensory receptor The hand and

My Vim life VimConf 2019 @gorilla0513 About me Name Gorilla (Twitter: @gorilla0513) Job

Pipeline Safety Engaging With The Public Continuous Improvement Beyond Public Awareness New

Combining Maintenance, Operational and Capital Planning Prepared for: THE DOCKS EXPO 2018

BuildHeap & Disjoint sets Todays announcements HW3 due Nov 15, 23:59 PA3 out, Due