It Takes Two to Tango: Towards Theory of AIs Mind It Takes Two to - - PowerPoint PPT Presentation
It Takes Two to Tango: Towards Theory of AIs Mind It Takes Two to - - PowerPoint PPT Presentation
It Takes Two to Tango: Towards Theory of AIs Mind It Takes Two to Tango: Towards Theory of AIs Mind * Arjun Deshraj Prithvijit Viraj Prabhu Devi Parikh * * * Chandrasekaran Yadav Chattopadhyay Georgia Tech Georgia Tech
It Takes Two to Tango: Towards Theory of AI’s Mind
It Takes Two to Tango: Towards Theory of AI’s Mind
Viraj Prabhu
Georgia Tech
Devi Parikh
Georgia Tech
Arjun Chandrasekaran
Georgia Tech
Deshraj Yadav
Georgia Tech
Prithvijit Chattopadhyay
Georgia Tech
* * * *
Slide: 2
Outline
Slide: 2
Theory of AI’s mind (ToAIM): Motivation Theory of AI’s mind (ToAIM): Experimental Setup and Results Theory of AI’s mind (ToAIM): Take-away messages Theory of AI’s mind (ToAIM): Ongoing work - Human-AI Games
Outline
Slide: 2
Theory of AI’s mind (ToAIM): Motivation Theory of AI’s mind (ToAIM): Experimental Setup and Results Theory of AI’s mind (ToAIM): Take-away messages Theory of AI’s mind (ToAIM): Ongoing work - Human-AI Games
Theory of Mind (ToM)
Slide: 3
Theory of Mind (ToM)
- Ability to attribute mental states to others
Slide: 3
Theory of Mind (ToM)
- Ability to attribute mental states to others
Humans attribute mental states to fellow humans
Slide: 3
Theory of Mind (ToM)
- Ability to attribute mental states to others
Humans attribute mental states to fellow humans
Slide: 3
Theory of Mind (ToM)
- Ability to attribute mental states to others
Humans attribute mental states to fellow humans Make reasonable inferences about their behavior
Slide: 3
Theory of Mind (ToM)
- Ability to attribute mental states to others
Humans attribute mental states to fellow humans Make reasonable inferences about their behavior
- Crucial for collaborative team-performance
Slide: 3
Motivation
Slide: 4
Motivation
- Traditional AI Research
Slide: 4
Motivation
- Traditional AI Research
AI more accurate AI more human-like
Slide: 4
Motivation
- Traditional AI Research
AI more accurate AI more human-like
Slide: 4
Motivation
Slide: 5
Motivation
- As AI progresses rise of collaborative work with AI agents
Slide: 5
Motivation
- As AI progresses rise of collaborative work with AI agents
Siri
Slide: 5
Motivation
- As AI progresses rise of collaborative work with AI agents
Siri Cortana
Slide: 5
Motivation
- As AI progresses rise of collaborative work with AI agents
Siri Cortana Google Assistant
Slide: 5
Motivation
- As AI progresses rise of collaborative work with AI agents
Siri Cortana Google Assistant
Slide: 5
Personal Assistants
Motivation
- As AI progresses rise of collaborative work with AI agents
Siri Cortana Google Assistant IBM Watson
Slide: 5
Personal Assistants
Motivation
- As AI progresses rise of collaborative work with AI agents
Siri Cortana Google Assistant IBM Watson
Slide: 5
Self-driving Cars Personal Assistants
Motivation
- As AI progresses rise of collaborative work with AI agents
Siri Cortana Google Assistant IBM Watson
Slide: 5
Self-driving Cars Personal Assistants Sensitive Applications
Motivation
Slide: 6 Slide Credits: Devi Parikh
Motivation
- Aiding visually impaired users
Slide: 6 Slide Credits: Devi Parikh
Motivation
- Aiding visually impaired users
Slide: 6 Peter just uploaded a picture from his vacation in Hawaii Slide Credits: Devi Parikh
Motivation
- Aiding visually impaired users
Slide: 6 Peter just uploaded a picture from his vacation in Hawaii Great, is he at the beach? Slide Credits: Devi Parikh
Motivation
- Aiding visually impaired users
Slide: 6 Peter just uploaded a picture from his vacation in Hawaii Great, is he at the beach? No, on a mountain Slide Credits: Devi Parikh
Motivation
- Aiding visually impaired users
Slide: 6 Peter just uploaded a picture from his vacation in Hawaii Great, is he at the beach? No, on a mountain ………… Slide Credits: Devi Parikh
Motivation
Slide: 7
Motivation
- Human-AI teams in healthcare
Slide: 7
Motivation
- Human-AI teams in healthcare
Slide: 7
Motivation
- Human-AI teams in healthcare
Slide: 7
Motivation
- Human-AI teams in healthcare
Slide: 7
Report-1
Motivation
- Human-AI teams in healthcare
Slide: 7
Diagnosis-1 Report-1
Motivation
- Human-AI teams in healthcare
Slide: 7
Diagnosis-1 Report-1 Report-2
Motivation
- Human-AI teams in healthcare
Slide: 7
Diagnosis-1 Diagnosis-2 Report-1 Report-2
Motivation
- Human-AI teams in healthcare
Slide: 7
Diagnosis-1 Diagnosis-2 Which instance to trust? Report-1 Report-2
Motivation
- Human-AI teams in healthcare
Slide: 7
Diagnosis-1 Diagnosis-2 Which instance to trust? Report-1 Report-2
Motivation
- Human-AI teams in healthcare
Slide: 7
Diagnosis-1 Diagnosis-2 Which instance to trust?
- Critical for human to have a sense of AI’s
Report-1 Report-2
Motivation
- Human-AI teams in healthcare
Slide: 7
Diagnosis-1 Diagnosis-2 Which instance to trust?
- Critical for human to have a sense of AI’s
Failure Modes Behavior
Report-1 Report-2
Theory of AI’s Mind (ToAIM)
Slide: 8
Theory of AI’s Mind (ToAIM)
- For human-AI teams to be effective, humans must also develop a
theory of AI’s mind
Slide: 8
Theory of AI’s Mind (ToAIM)
- For human-AI teams to be effective, humans must also develop a
theory of AI’s mind
Slide: 8
Theory of AI’s Mind (ToAIM)
- For human-AI teams to be effective, humans must also develop a
theory of AI’s mind
Slide: 8
Theory of AI’s Mind (ToAIM)
- For human-AI teams to be effective, humans must also develop a
theory of AI’s mind
Slide: 8
Theory of AI’s Mind (ToAIM)
- For human-AI teams to be effective, humans must also develop a
theory of AI’s mind
- Predict success, failure and responses
- Approximate a neural network!
Slide: 8
Outline
Slide: 9
Outline
Slide: 9
Theory of AI’s mind (ToAIM): Motivation Theory of AI’s mind (ToAIM): Experimental Setup and Results Theory of AI’s mind (ToAIM): Take-away messages Theory of AI’s mind (ToAIM): Ongoing work - Human-AI Games
Outline
Slide: 9
Theory of AI’s mind (ToAIM): Motivation Theory of AI’s mind (ToAIM): Experimental Setup and Results Theory of AI’s mind (ToAIM): Take-away messages Theory of AI’s mind (ToAIM): Ongoing work - Human-AI Games
AI Agent
Slide: 10
AI Agent
- Vicki : A VQA Model
Initial Scope: Visual Question Answering (VQA)
Slide: 10
AI Agent
- Vicki : A VQA Model
Initial Scope: Visual Question Answering (VQA)
Slide: 10
AI Agent
- Vicki : A VQA Model
Initial Scope: Visual Question Answering (VQA)
What is the child doing?
Slide: 10
AI Agent
- Vicki : A VQA Model
Initial Scope: Visual Question Answering (VQA)
What is the child doing?
Vicki Vision Language
Slide: 10
AI Agent
- Vicki : A VQA Model
Initial Scope: Visual Question Answering (VQA)
What is the child doing?
Vicki
playing baseball
Vision Language
Slide: 10
AI Agent
- Vicki : A VQA Model
Initial Scope: Visual Question Answering (VQA)
What is the child doing?
Vicki
playing baseball
Vision Language Dataset
Slide: 10
AI Agent
- Vicki : A VQA Model
Initial Scope: Visual Question Answering (VQA)
What is the child doing?
Vicki
playing baseball
Vision Language 248349 QI-pairs in train-split Dataset
Slide: 10
AI Agent
- Vicki : A VQA Model
Initial Scope: Visual Question Answering (VQA)
What is the child doing?
Vicki
playing baseball
Vision Language 248349 QI-pairs in train-split Vicki can answer
- nly from the
top-1k answers Dataset
Slide: 10
AI Agent
Slide: 11
AI Agent
- VQA model by Lu, Yang et al. NIPS 2016
Jiasen Lu Jianwei Yang
Slide: 11
AI Agent
- VQA model by Lu, Yang et al. NIPS 2016
- Hierarchical co-Attention model
Jiasen Lu Jianwei Yang
Slide: 11
AI Agent
- VQA model by Lu, Yang et al. NIPS 2016
- Hierarchical co-Attention model
Jiasen Lu Jianwei Yang
Slide: 11
Vicki’s Quirks
Slide: 12
Vicki’s Quirks
- Imperfect vision
Slide: 12
Vicki’s Quirks
- Imperfect vision
- Limited capability to understand language
Slide: 12
Vicki’s Quirks
- Imperfect vision
- Limited capability to understand language
- Can’t reason about common-sense
Slide: 12
Vicki’s Quirks
- Imperfect vision
- Limited capability to understand language
- Can’t reason about common-sense
- Limited vocabulary
Slide: 12
Vicki’s Quirks
- Imperfect vision
- Limited capability to understand language
- Can’t reason about common-sense
- Limited vocabulary
- Doesn’t understand question-image relevance
Slide: 12
Vicki’s Quirks
- Imperfect vision
- Limited capability to understand language
- Can’t reason about common-sense
- Limited vocabulary
- Doesn’t understand question-image relevance
- Heavily influenced by dataset biases
Slide: 12
Vicki’s Quirks
- Imperfect vision
- Limited capability to understand language
- Can’t reason about common-sense
- Limited vocabulary
- Doesn’t understand question-image relevance
- Heavily influenced by dataset biases
Vicki
Slide: 12
Vicki’s Quirks
- Imperfect vision
- Limited capability to understand language
- Can’t reason about common-sense
- Limited vocabulary
- Doesn’t understand question-image relevance
- Heavily influenced by dataset biases
Vicki Q1, I1 A1
Slide: 12
Vicki’s Quirks
- Imperfect vision
- Limited capability to understand language
- Can’t reason about common-sense
- Limited vocabulary
- Doesn’t understand question-image relevance
- Heavily influenced by dataset biases
Vicki Q1, I1 Q2, I2 A1 A2
Slide: 12
Vicki’s Quirks
- Imperfect vision
- Limited capability to understand language
- Can’t reason about common-sense
- Limited vocabulary
- Doesn’t understand question-image relevance
- Heavily influenced by dataset biases
Vicki Q1, I1 Q2, I2 A1 A2
Slide: 12
Vicki’s Quirks
- Imperfect vision
- Limited capability to understand language
- Can’t reason about common-sense
- Limited vocabulary
- Doesn’t understand question-image relevance
- Heavily influenced by dataset biases
Vicki Q1, I1 Q2, I2 Qn, In A1 A2 An
Slide: 12
Vicki’s Quirks
- Imperfect vision
- Limited capability to understand language
- Can’t reason about common-sense
- Limited vocabulary
- Doesn’t understand question-image relevance
- Heavily influenced by dataset biases
Vicki Q1, I1 Q2, I2 Qn, In A1 A2 An Helps us pick on Vicki’s quirks
Slide: 12
Vicki’s Quirks
Slide: 13
Vicki’s Quirks
What color is the grass? Blue
Slide: 13
Vicki’s Quirks
What color is the grass? Blue What are the people doing? Eating
Slide: 13
Vicki’s Quirks
What color is the grass? Blue What are the people doing? Eating How many people are there? 4 What is the man holding? Fire Hydrant
Slide: 13
ToAIM
Slide: 14
ToAIM
- To study/evaluate ToAIM Large-scale experiments on MTurk
Slide: 14
ToAIM
- To study/evaluate ToAIM Large-scale experiments on MTurk
Subjects on AMT Vicki
Slide: 14
ToAIM
- To study/evaluate ToAIM Large-scale experiments on MTurk
Task Interface Subjects on AMT Vicki
Slide: 14
ToAIM
- To study/evaluate ToAIM Large-scale experiments on MTurk
Task Interface Subjects on AMT Vicki Failure Prediction Knowledge Prediction
Slide: 14
ToAIM
Slide: 15
ToAIM
- Failure Prediction
Slide: 15
ToAIM
- Failure Prediction
How many people are there?
Slide: 15
ToAIM
- Failure Prediction
How many people are there? Subject thinks Vicki will answer correctly
Slide: 15
ToAIM
- Failure Prediction
How many people are there? Subject thinks Vicki will answer correctly Correctly
Slide: 15
ToAIM
- Failure Prediction
Slide: 16
ToAIM
Slide: 17
ToAIM
- Knowledge Prediction
Slide: 17
ToAIM
- Knowledge Prediction
How many people are there?
Slide: 17
ToAIM
- Knowledge Prediction
How many people are there? Subject thinks Vicki will answer 4 4
Slide: 17
ToAIM
- Knowledge Prediction
How many people are there? Subject thinks Vicki will answer 4
4
4
Slide: 17
ToAIM
- Knowledge Prediction
Slide: 18
ToAIM
Slide: 19
ToAIM
- We evaluate the role of
Slide: 19
ToAIM
- We evaluate the role of
Training Explanation Modalities
Slide: 19
ToAIM
Slide: 20
ToAIM
- Training via Instant Feedback
Slide: 20
ToAIM
- Training via Instant Feedback
How many people are there?
Slide: 20
ToAIM
- Training via Instant Feedback
How many people are there? FP
Slide: 20
ToAIM
- Training via Instant Feedback
How many people are there? FP
Slide: 20
ToAIM
- Training via Instant Feedback
How many people are there? Feedback FP
Slide: 20
ToAIM
- Training via Instant Feedback
How many people are there? Feedback 4
4
FP KP
Slide: 20
ToAIM
- Training via Instant Feedback
How many people are there? Feedback 4
4 3
FP KP
Slide: 20
ToAIM
- Training via Instant Feedback
How many people are there? Feedback 4
4 3 3
FP KP
Slide: 20
ToAIM
Slide: 21
ToAIM
- Explanation Modalities
Slide: 21
ToAIM
- Explanation Modalities
Slide: 21
Vicki
What is the child doing?
ToAIM
- Explanation Modalities
Slide: 21
- Access to internal states of a model for a prediction
Vicki
What is the child doing? playing baseball Access to internal states of a model
ToAIM
- Explanation Modalities
Slide: 22
ToAIM
- Explanation Modalities
Slide: 22
QI-Attention
- Explicit question-image attention maps from HieCoAtt
ToAIM
- Explanation Modalities
Slide: 22
QI-Attention
- Explicit question-image attention maps from HieCoAtt
Which words in the question to listen to?
ToAIM
- Explanation Modalities
Slide: 22
QI-Attention
- Explicit question-image attention maps from HieCoAtt
Which words in the question to listen to? Which regions in the image are important?
ToAIM
- Explanation Modalities
Slide: 23
ToAIM
- Explanation Modalities
Slide: 23
Grad-CAM
- Grad-CAM: implicit attention mechanism
ToAIM
- Explanation Modalities
Slide: 23
Grad-CAM
- Grad-CAM: implicit attention mechanism
Attention visualization corresponding to Vicki’s most confident answer
ToAIM
- Explanation Modalities
Slide: 24
ToAIM
- Explanation Modalities
Slide: 24
How many people are there?
- Vicki’s confidence in top-5 answers
Top-5 answer confidence
ToAIM
- Explanation Modalities
Slide: 24
How many people are there?
- Vicki’s confidence in top-5 answers
Vicki’s confidence in the top-5 answers without revealing the answers Top-5 answer confidence
ToAIM
Slide: 25
ToAIM
- Training + Explanation Modalities
Slide: 25
ToAIM
- Training + Explanation Modalities
How many people are there?
+
How many people are there? Slide: 25
ToAIM
- Training + Explanation Modalities
How many people are there?
+
How many people are there? Slide: 25
ToAIM
- Training + Explanation Modalities
How many people are there?
+
How many people are there? Slide: 25
ToAIM
- Training + Explanation Modalities
Feedback
How many people are there?
+
How many people are there? Slide: 25
ToAIM
- Training + Explanation Modalities
Feedback 4
How many people are there?
+
How many people are there? Slide: 25
ToAIM
- Training + Explanation Modalities
Feedback 4
4
How many people are there?
+
How many people are there?
3
Slide: 25
ToAIM
- Training + Explanation Modalities
Feedback 4
4 3
How many people are there?
+
How many people are there?
3
Slide: 25
Experimental Results
Slide: 26
Experimental Results
Failure Prediction
Slide: 26
Experimental Results
IF+QI-Att IF+Top-5 IF+GCAM IF FP No Train Accuracy
Failure Prediction
Slide: 26 100 90 80 70 60 50 40 30 20 10
Experimental Results
IF+QI-Att IF+Top-5 IF+GCAM IF FP No Train Accuracy
Failure Prediction
Training Helps Slide: 26 100 90 80 70 60 50 40 30 20 10
Experimental Results
IF+QI-Att IF+Top-5 IF+GCAM IF FP No Train Accuracy
Failure Prediction
Training Helps
- Exp. Mod. don’t help
Slide: 26 100 90 80 70 60 50 40 30 20 10
Experimental Results
Slide: 27
Experimental Results
Knowledge Prediction
Slide: 27
Experimental Results
IF+QI-Att IF+Top-5 IF+GCAM IF KP No Train Accuracy
Knowledge Prediction
Slide: 27 100 75 50 25
Experimental Results
IF+QI-Att IF+Top-5 IF+GCAM IF KP No Train Accuracy
Knowledge Prediction
Training Helps Slide: 27 100 75 50 25
Experimental Results
IF+QI-Att IF+Top-5 IF+GCAM IF KP No Train Accuracy
Knowledge Prediction
Training Helps
- Exp. Mod. don’t help
Slide: 27 100 75 50 25
Outline
Slide: 28
Outline
Slide: 28
Theory of AI’s mind (ToAIM): Motivation Theory of AI’s mind (ToAIM): Experimental Setup and Results Theory of AI’s mind (ToAIM): Take-away messages Theory of AI’s mind (ToAIM): Ongoing work - Human-AI Games
Outline
Slide: 28
Theory of AI’s mind (ToAIM): Motivation Theory of AI’s mind (ToAIM): Experimental Setup and Results Theory of AI’s mind (ToAIM): Take-away messages Theory of AI’s mind (ToAIM): Ongoing work - Human-AI Games
Take-away messages
Slide: 29
Take-away messages
- Advocate a research agenda towards ToAIM
Slide: 29
Take-away messages
- Advocate a research agenda towards ToAIM
- Lay people successfully form ToM with a few (50) examples
Slide: 29
Take-away messages
- Advocate a research agenda towards ToAIM
- Lay people successfully form ToM with a few (50) examples
- Existing explanation modalities don’t help in predicting
what AI will do
Slide: 29
Take-away messages
- Advocate a research agenda towards ToAIM
- Lay people successfully form ToM with a few (50) examples
- Existing explanation modalities don’t help in predicting
what AI will do
- Novel evaluation protocol for explanation modalities
Slide: 29
Take-away messages
- Advocate a research agenda towards ToAIM
- Lay people successfully form ToM with a few (50) examples
- Existing explanation modalities don’t help in predicting
what AI will do
- Novel evaluation protocol for explanation modalities
Slide: 29
ArXiv
Slide: 30 Slide Credits: Devi Parikh
Outline
Slide: 31
Outline
Slide: 31
Theory of AI’s mind (ToAIM): Motivation Theory of AI’s mind (ToAIM): Experimental Setup and Results Theory of AI’s mind (ToAIM): Take-away messages Theory of AI’s mind (ToAIM): Ongoing work - Human-AI Games
Outline
Slide: 31
Theory of AI’s mind (ToAIM): Motivation Theory of AI’s mind (ToAIM): Experimental Setup and Results Theory of AI’s mind (ToAIM): Take-away messages Theory of AI’s mind (ToAIM): Ongoing work - Human-AI Games
Ongoing Work
Slide: 32
Ongoing Work
- Evaluate human-AI collaborative performance
Slide: 32
Ongoing Work
- Evaluate human-AI collaborative performance
- Goal-driven tasks (cooperative human-AI games)
Slide: 32
Ongoing Work
- Evaluate human-AI collaborative performance
- Goal-driven tasks (cooperative human-AI games)
Visual 20 Questions GuessWhich
Slide: 32
Ongoing Work
- Evaluate human-AI collaborative performance
- Goal-driven tasks (cooperative human-AI games)
Visual 20 Questions GuessWhich
Slide: 32
GuessWhich
Slide: 33
GuessWhich
- Players: 2 Agents - Questioner - and Answerer -
Slide: 33
Q A
GuessWhich
- Players: 2 Agents - Questioner - and Answerer -
Slide: 33
- Parameters:
Q A
GuessWhich
- Players: 2 Agents - Questioner - and Answerer -
Slide: 33
- Parameters:
- Fixed pool of images
Q A
GuessWhich
- Players: 2 Agents - Questioner - and Answerer -
Slide: 33
- Parameters:
- Fixed pool of images
- Fixed number of rounds of dialog (10)
Q A
GuessWhich
- Players: 2 Agents - Questioner - and Answerer -
Slide: 33
- Parameters:
- Fixed pool of images
- Fixed number of rounds of dialog (10)
- In our implementation, Human (Questioner) and AI (Answerer)
Q A
GuessWhich
Slide: 34
GuessWhich
Slide: 34
A selects an image from
GuessWhich
Slide: 34
A selects an image from
GuessWhich
Slide: 34
A selects an image from
GuessWhich
Q
Slide: 34
A selects an image from
unknown to
GuessWhich
Q
Slide: 34
A selects an image from
unknown to
A andQ are given a caption to get a idea about
GuessWhich
Q
Slide: 34
A selects an image from
unknown to
A andQ are given a caption to get a idea about
makes a guess about
Q
GuessWhich
Q
Slide: 34
A selects an image from
unknown to
Q asks questions so as to locate A andQ are given a caption to get a idea about
makes a guess about
Q
GuessWhich
Q
Slide: 34
A selects an image from
unknown to
Q asks questions so as to locate
from
A andQ are given a caption to get a idea about
makes a guess about
Q
GuessWhich
Q
Slide: 34
A selects an image from
unknown to
Q asks questions so as to locate
from
A answersQ’s questions according to A andQ are given a caption to get a idea about
makes a guess about
Q
GuessWhich
Q
Slide: 34
A selects an image from
unknown to
Q asks questions so as to locate
from
A answersQ’s questions according to
makes a guess about
Q A andQ are given a caption to get a idea about
makes a guess about
Q
GuessWhich
Q
Slide: 34
A selects an image from
unknown to
Q asks questions so as to locate
from
A answersQ’s questions according to
makes a guess about
Q
after every round of dialog
A andQ are given a caption to get a idea about
makes a guess about
Q
GuessWhich
Slide: 35
GuessWhich
Slide: 35
- Human-AI Team: Human (Questioner) and AI (Answerer)
GuessWhich
Slide: 35
- Human-AI Team: Human (Questioner) and AI (Answerer)
- Image retrieval setting. Metrics: mean-rank & mean-reciprocal
rank
GuessWhich
Slide: 35
- Human-AI Team: Human (Questioner) and AI (Answerer)
- Image retrieval setting. Metrics: mean-rank & mean-reciprocal
rank
- Our AI Agent Visual Conversation Agents
GuessWhich
Slide: 35
- Human-AI Team: Human (Questioner) and AI (Answerer)
- Image retrieval setting. Metrics: mean-rank & mean-reciprocal
rank
- Our AI Agent Visual Conversation Agents
- Das & Kottur et al. ICCV 2017
Abhishek Das Satwik Kottur
GuessWhich
Slide: 35
- Human-AI Team: Human (Questioner) and AI (Answerer)
- Image retrieval setting. Metrics: mean-rank & mean-reciprocal
rank
- Our AI Agent Visual Conversation Agents
- Das & Kottur et al. ICCV 2017
Abhishek Das Questioner Answerer Satwik Kottur
Slide: 37
Slide: 37
Slide: 38
That’s all folks! Questions?
Slide: 38
- Interface videos: https://deshraj.github.io/TOAIM/
- Interfaces: https://github.com/deshraj/TOAIM/tree/master/Assets/Interfaces
- Guess-Which: http://gw.cloudcv.org/