Robust and Stable Black Box Explanations Hi Hima Lakka kkaraju - PowerPoint PPT Presentation

Robust and Stable Black Box Explanations Hi Hima Lakka kkaraju Nino Ar Arsov ov Os Osbert ert Bas Bastan ani Harvard University Macedonian Academy University of Pennsylvania of Arts & Sciences

Motivation § ML models are increasingly proprietary and complex, and are therefore not interpretable § Several post hoc explanation techniques proposed in recent literature § E.g., LIME, SHAP, MUSE, Anchors, MAPLE 2

Motivation § However, post hoc explanations have been shown to be unstable and unreliable § Small perturbations to input can substantially change the explanations; running same algorithm multiple times results in different explanations (Ghorbani et. al.) § High-fidelity explanations with very different covariates than black box (Lakkaraju & Bastani) § Also, they are not robust to distribution shifts 3

Why can explanations be unstable? § Distribution !(# $ , # & ) where # $ and # & are perfectly correlated § Blackbox ( ∗ # $ , # & = I # $ ≥ 0 § Explanation . / # $ , # & = I # & ≥ 0 § . / has perfect fidelity, but is completely different from ( ∗ ! § If !(# $ , # & ) shifts, . / may no longer have high fidelity 4

Why do we care? § Domain experts rely on explanations to validate properties of the black box model § Check if model uses spurious or sensitive attributes [Caruana 2015, Bastani 2017, Rudin 2019] § Poor explanations may mislead experts into drawing incorrect conclusions 5

Our Contributions: ROPE § We propose ROPE (RObust Post hoc Explanations) § Framework for generating stable and robust explanations § It is flexible, e.g., it can be instantiated for local vs. global explanations as well as linear vs. rule based explanations § First approach to generating explanations robust to distribution shifts § Our experiments show that ROPE significantly improves robustness on real-world distribution shifts 6

Robust Learning Objective § ROPE ensures robustness via a minimax objective: worst-case over standard supervised distribution shifts learning loss for ! " # § The maximum in the objective is over possible distribution shifts ! " # = ! # − & § Ensures ' ( has high fidelity for all distributions ! " # 7

Robust Learning Objective § We can upper bound the objective as follows: § Thus, we can approximate ! " as follows: 8

Class of Distribution Shifts § Ke Key question: How to choose Δ ? § Determines distributions " # to which $ % is robust § Ou Our choice § & ' constraint induces sparsity, i.e., only a few covariates are perturbed § & ( constraint bounds the magnitude of the perturbation, i.e., covariates do not change too much 9

Robust Linear Explanations § Use adversarial training, i.e., approximate stochastic gradient descent on the objective where § Can approximate ! ∗ using a linear program 10

Robust Rule Based Explanations § Approximate the objective using sampling Distribution over shifts + ∈ Δ § Adjust learning algorithm to handle maximum over finite set § For rule lists and decision sets, only count a point ∗ ! + + , for all of the !, # $(!) as correct if # $(!) = ( possible perturbations + , 11

Experimental Evaluation § Real-world distribution shifts Dataset Da # of of At Attributes Ou Outcomes Ca Cases 31K defendants Criminal History, Demographic Attributes, Bail (Yes/No) Bail Ba (2 courts) Current Offenses He Healthc thcare 22K patients Symptoms, Demographic Attributes, Diabetes (Yes/No) (2 hospitals) Current & Past Conditions 19K students Grades, Absence Rates, Suspensions, Graduated High School Ac Academic (2 schools) Tardiness Scores on Time (Yes/No) § Approach § Generate explanation on one distribution (e.g., first court) § Evaluate fidelity on shifted distribution (e.g., second court) 12

Experimental Evaluation § Baselines § LIME, SHAP, MUSE § All state-of-the-art post hoc explanation tools § Instantiations of ROPE § Linear models (comparison to LIME and SHAP) § Decision sets (comparison to MUSE) § Focus on global explanations 13

Robustness to Real Distribution Shifts § Report fidelity on both original and shifted distributions, as well as percentage drop in fidelity § ROPE is substantially more robust without sacrificing fidelity on original distribution 14

Percentage Drop in Fidelity vs. Size of Distribution Shift § Use synthetic data and vary size of shift § Report percentage drop in fidelity 15

Structural Match with the Black Box § Choose “black box” from the same model class as explanation (e.g., linear or decision set) § Report match between explanation and black box § ROPE explanations match black box substantially better 16

Conclusions § We have proposed the first framework for generating stable and robust explanations § Our approach significantly improves explanation robustness to real-world distribution shifts 17

Robust and Stable Black Box Explanations Hi Hima Lakka kkaraju - PowerPoint PPT Presentation

Robust and Stable Black Box Explanations Hi Hima Lakka kkaraju Nino Ar Arsov ov Os Osbert ert Bas Bastan ani Harvard University Macedonian Academy University of Pennsylvania of Arts & Sciences

Paradoxes in Probability How probability continues to amuse me! Let's play a game! Box A Box B

A recipe for black box functors Maru Sarazola and Brendan Fong What is a black box functor? In

Black Box Scanning Tool + White Box Testing Tool Toshis Black Box Scanning Tool Same

Outlier Outlier Outlier- Outlier - -robust - robust robust robust identification

Stable Marriage Problem Stable Marriage Problem Small town with n boys and n girls. Stable

Flux Box Flux Box A concept by Flux Laboratory Flux box : concept Flux box : concept What is Flux

Kid s Box American English Level 1 Presentation Plus: Kid s Box American English Kid s Box

[7] Gaussian Elimination Starting to peek inside the black box So far sol ve( A, b) is a black

$US 50-60 oil prices stable at $US 50-60 construction slows oil prices stable at job market

Side Channel Analysis & Countermeasures Begl Bilgin 27 Dec. 2014 - IAM Alumni Meeting

Black Hole Thermodynamics Robert M. Wald I. Black Holes; Event Horizons and Killing Horizons II.

PRIVATE EVENTS PrivateEvents@ACL-LIVE.com (512)404-1318 ACL LIVE: A Black Box for events

Make sure we can query black box algorithms

Efficient Black-Box Combinatorial Optimization Hamid Dadkhahi Karthikeyan Shanmugam Jesus Rios

Black Kernel Rot Malady of Pecan B Wood, C Bock, l Wells, T Cottrell, M Hotchkiss Black Kernel

Red-Black Trees Binary Search Trees with O(log n) Worst-Case Time per Operation The Red-Black

A Song of Tradition and Modernity: The Emergence of Instructional Design and Multimedia-based

A New Era of Silicon Prototyping in Computer Architecture Research Christopher Torng Computer

H istory of M erger C onversations kjihgfedcbaZYXWVUTSRQPONMLKJIHGFEDCBA 1824 1968

Towards high-precision cluster lensing models: illuminating dark matter and dark ages Piero Rosati

Measuring Internet Experience from Home Networks Renata Teixeira Muse Team Inria

PEOPLE JUST WANT TO CHIME IN TROLLS ARE PEOPLE WHO WANT TO BE HEARD. SETHMUSE.COM WE WANT TO

Star formation in counter-rotating galaxies. Observations and Simulations Alessandro Pizzella

Computing + Statistics = Data Science Dr Graham Cormode Data Science: What can we do with

Robust and Stable Black Box Explanations Hi Hima Lakka kkaraju - PowerPoint PPT Presentation

Robust and Stable Black Box Explanations Hi Hima Lakka kkaraju Nino Ar Arsov ov Os Osbert ert Bas Bastan ani Harvard University Macedonian Academy University of Pennsylvania of Arts & Sciences

Paradoxes in Probability How probability continues to amuse me! Let's play a game! Box A Box B

A recipe for black box functors Maru Sarazola and Brendan Fong What is a black box functor? In

Black Box Scanning Tool + White Box Testing Tool Toshis Black Box Scanning Tool Same

Outlier Outlier Outlier- Outlier - -robust - robust robust robust identification

Stable Marriage Problem Stable Marriage Problem Small town with n boys and n girls. Stable

Flux Box Flux Box A concept by Flux Laboratory Flux box : concept Flux box : concept What is Flux

Kid s Box American English Level 1 Presentation Plus: Kid s Box American English Kid s Box

[7] Gaussian Elimination Starting to peek inside the black box So far sol ve( A, b) is a black

$US 50-60 oil prices stable at $US 50-60 construction slows oil prices stable at job market

Side Channel Analysis &amp; Countermeasures Begl Bilgin 27 Dec. 2014 - IAM Alumni Meeting

Black Hole Thermodynamics Robert M. Wald I. Black Holes; Event Horizons and Killing Horizons II.

PRIVATE EVENTS PrivateEvents@ACL-LIVE.com (512)404-1318 ACL LIVE: A Black Box for events

Make sure we can query black box algorithms

Efficient Black-Box Combinatorial Optimization Hamid Dadkhahi Karthikeyan Shanmugam Jesus Rios

Black Kernel Rot Malady of Pecan B Wood, C Bock, l Wells, T Cottrell, M Hotchkiss Black Kernel

Red-Black Trees Binary Search Trees with O(log n) Worst-Case Time per Operation The Red-Black

A Song of Tradition and Modernity: The Emergence of Instructional Design and Multimedia-based

A New Era of Silicon Prototyping in Computer Architecture Research Christopher Torng Computer

H istory of M erger C onversations kjihgfedcbaZYXWVUTSRQPONMLKJIHGFEDCBA 1824 1968

Towards high-precision cluster lensing models: illuminating dark matter and dark ages Piero Rosati

Measuring Internet Experience from Home Networks Renata Teixeira Muse Team Inria

PEOPLE JUST WANT TO CHIME IN TROLLS ARE PEOPLE WHO WANT TO BE HEARD. SETHMUSE.COM WE WANT TO

Star formation in counter-rotating galaxies. Observations and Simulations Alessandro Pizzella

Computing + Statistics = Data Science Dr Graham Cormode Data Science: What can we do with

Side Channel Analysis & Countermeasures Begl Bilgin 27 Dec. 2014 - IAM Alumni Meeting