The What-If Tool (WIT)
Interactive Probing of Machine Learning Models
James Wexler, Mahima Pushkarna, Tolga Bolukbasi, Martin Wattenberg, Fernanda Viegas, and Jimbo Wilson Presented on Nov 19, by Patrick Huber
The What-If Tool (WIT) Interactive Probing of Machine Learning - - PowerPoint PPT Presentation
The What-If Tool (WIT) Interactive Probing of Machine Learning Models James Wexler, Mahima Pushkarna, Tolga Bolukbasi, Martin Wattenberg, Fernanda Viegas, and Jimbo Wilson Presented on Nov 19, by Patrick Huber Problem & Objective Problem:
James Wexler, Mahima Pushkarna, Tolga Bolukbasi, Martin Wattenberg, Fernanda Viegas, and Jimbo Wilson Presented on Nov 19, by Patrick Huber
Problem:
Objective:
○ when does it perform well/poorly ○ How is a change in the input reflected in the output (diversity) Solution:
2
Black-Box:
White-Box:
3
Proof-of-concept
Workshops
4
Need 1: Test multiple hypotheses with minimal code
Need 2: Use visualizations as a medium for model understanding
5
Need 3: Test hypotheticals without having access to the inner workings of a model
○
“How would increasing the value of X affect a model’s prediction scores?”
○
“What would need to change in the data point for a different outcome?”
6
Need 4: Conduct exploratory intersectional analysis of model performance
Need 5: Evaluate potential performance improvements for multiple models
7
Build using Tensorboard, a code-free and installation-free visualization framework
https://pair-code.github.io/what-if-tool/iris.html
8
9
10
11
○ Solve prediction task ○ Classify individuals as high or low income ○ Train 2 models ■ Multi-layer neural network ■ Simple linear classifier
12
13
Customizable Analysis
14
Customizable Analysis
15
Feature Analysis: Dataset Summary
○ Edit data points ○ Identify counterfactuals ○ Observe partial dependencies
16
○ ROC ○ Confusion Matrix ○ Cost Ratio
Compare models
17
○ Tabular Data: ■ # Features: 10-100 ■ # Datapoints: ~100,000 ○ Image Data: ■ Pixel dimensions: 78x64 ■ # Datapoints: 2,000
○ As seen before, occlusion already a problem with less data
18
○ 2 studies in a large software company ○ 1 study in a university environment
○ Uncover bugs ○ Explore the data ○ Find partial dependencies
19
○ User data & machine learning models
○ Inference of the model (on the data)
○ Dataset- and datapoint-level results of ML models ○ Giving a better understanding of the capabilities and possible adversarial attacks
20
○ 3 common tasks derived from user studies
○ Extension of a out-of-the-box visualization tool
○ Machine Learning models are black boxes ○ Making crucial decisions in the real world ○ Understanding is important
21
Strengths: + Versatile tool + Many useful real-world applications + Greatly reducing workload compared to creating own visualizations Weaknesses:
22
23