Visual Turing Test: defining a challenge Mateusz Malinowski Visual - PowerPoint PPT Presentation

Visual Turing Test: defining a challenge Mateusz Malinowski

Visual Turing Test challenge The task involves Object detection • Ask about the content of the image in front inside left right on Spatial reasoning ‣ How many sofas? 3 ‣ Where is the lamp? on the table, close to tv ‣ What is behind the largest table? tv ‣ What is the color of the walls? purple Natural language understanding 2 M. Malinowski | Question Answering

Roadmap (parameters) (world) θ w monitor to the left of the mugs Semantic Parsing Evaluation x y z � x. ∃ y. monitor ( x ) ∧ left-rel ( x, y ) ∧ mug ( y ) mug to the left of the other mug (question) (logical form) (answer) � x. ∃ y. mug ( x ) ∧ left-rel ( x, y ) ∧ mug ( y ) state with the Alaska ∗∗ objects on the table largest area � x. ∃ y. object ( x ) ∧ on-rel ( x, y ) ∧ table ( y ) ( x 1 x 1 two blue cups are placed near to the computer screen state 1 � x. blue ( x ) ∧ cup ( x ) ∧ comp. ( x ) ∧ screen ( x ) z ∼ p θ ( z | x ) 1 area Jointly Learning to Parse and Perceive:   y = J z K w c Connecting Natural Language to the argmax Physical World. ! Learning Dependency-Based (J. Krishnamurthy et. al. TACL 2013) Compositional Semantics   (P. Liang et. al. ACL 2011) Some ideas ? 3 M. Malinowski | Grounding

Two dimensions of language understanding Old AI Our dream Percy’s work Precision Google Recall 4 M. Malinowski | Grounding

Semantic parser The Big Picture What is the most populous city in California? Database System Los Angeles Expensive : logical forms Cheap : answers [Zelle & Mooney, 1996; Zettlemoyer & Collins, 2005] [Clarke et al., 2010] [Wong & Mooney, 2007; Kwiatkowski et al., 2010] [ this work ] What is the most populous city in California? What is the most populous city in California? ⇒ Los Angeles ⇒ argmax ( λ x. city ( x ) ∧ loc ( x, CA ) , λ x. pop. ( x )) How many states border Oregon? How many states border Oregon? ⇒ 3 ⇒ count ( λ x. state ( x ) ∧ border ( x, OR ) · · · · · · 5 M. Malinowski | Grounding

The probabilistic framework p ( y | z, w ) capital of Interpretation x California? Semantic parsing ) p ( z | x, θ ) ∗∗ 1 parameters Objective 2 P θ max θ z p ( y | z, w ) p ( z | x, θ ) z capital 1 1 Interpretation Semantic parsing CA Learning database parameters θ k -best list Sacramento y w tree1 enumerate/score DCS trees tree2 (0 . 2 , − 1 . 3 , . . . , 0 . 7) tree3 tree4 numerical optimization (L-BFGS) tree5 6 M. Malinowski | Grounding

Challenges of the semantic parsing What is the most populous city in California? λ x. city ( x ) ∧ loc ( x, CA ) Los Angeles What is the most populous city in California? λ x. state ( x ) ∧ border ( x, CA ) Los Angeles What is the most populous city in California? argmax ( λ x. city ( x ) ∧ loc ( x, CA ) , λ x. population ( x )) Los Angeles 7 M. Malinowski | Grounding

Challenges of the semantic parsing Words to Predicates (Lexical Semantics) city city state state river river argmax population population CA What is the most populous city in CA ? Lexical Triggers: 1. String match CA ⇒ CA 2. Function words (20 words) most ⇒ argmax 3. Nouns/adjectives city ⇒ city state river population 8 M. Malinowski | Grounding

Dependency-based compositional semantics Solution: Mark-Execute most populous city in California Superlatives ∗∗ x 1 x 1 city 1 1 1 1 population loc Mark at syntactic scope 2 c 1 argmax CA 9 M. Malinowski | Grounding

Results On Geo , 600 training examples, 280 test examples System Description Lexicon Logical forms CCG [Zettlemoyer & Collins, 2005] zc05 relaxed CCG [Zettlemoyer & Collins, 2007] zc07 kzgs10 CCG w/unification [Kwiatkowski et al., 2010] our system dcs our system dcs + 100 95 91.1% 88.9% test accuracy 88.6% 90 86.1% 85 79.3% 80 75 zc05 zc07 kzgs10 dcs dcs + 10 M. Malinowski | Grounding

Grounding problem The mugs {} {} 1) , ) = ) = {} {} A mug left of the monitor 12 M. Malinowski | Grounding

Question answering problem How high is the highest point in the largest state? 6.000 m Semantic Evaluation parsing A Q T W logical question answer universe form P. Liang, M. Jordan, D. Klein. Learning Dependency-Based Compositional Semantics. ACL’11 J. Berant, A. Chou, R. Frostig, and P. Liang. Semantic Parsing on Freebase from Question-Answer Pairs. EMNLP’13. 13 M. Malinowski | Grounding

Question answering problem What is in front of sofa in image 1? table Semantic Evaluation parsing A Q T W logical question answer universe form Our knowledge base sofa (1,brown, image 1, X,Y,Z) table(1,brown, image 1,X,Y,Z) wall (1,white, image 1, X,Y,Z) Scene bed (1, white, image 2 X,Y,Z) analysis chair (1,brown, image 4, X,Y,Z) chair (2,brown, image 4, X,Y,Z) chair (1,brown, image 5, X,Y,Z) … 14 M. Malinowski | Grounding

Results Environment d Language z and predicted logical form ` Predicted grounding True grounding { ( 2 , 1 ) , ( 2 , 3 ) } { ( 2 , 1 ) , ( 2 , 3 ) } monitor to the left of the mugs � x. ∃ y. monitor ( x ) ∧ left-rel ( x, y ) ∧ mug ( y ) { ( 3 , 1 ) } { ( 3 , 1 ) } mug to the left of the other mug � x. ∃ y. mug ( x ) ∧ left-rel ( x, y ) ∧ mug ( y ) { ( 1 , 4 ) , ( 2 , 4 ) { ( 1 , 4 ) , ( 2 , 4 ) , objects on the table � x. ∃ y. object ( x ) ∧ on-rel ( x, y ) ∧ table ( y ) ( 3 , 4 ) } ( 3 , 4 ) } two blue cups are placed near to the computer screen { ( 1 ) } { ( 1 , 2 ) , ( 3 , 2 ) } � x. blue ( x ) ∧ cup ( x ) ∧ comp. ( x ) ∧ screen ( x ) Denotation � 0 rel. 1 rel. other total LSP- CAT 0.94 0.45 0.20 0.51 LSP-F 0.89 0.81 0.20 0.70 LSP-W 0.89 0.77 0.16 0.67 Grounding g 0 rel. 1 rel. other total LSP- CAT 0.94 0.37 0.00 0.42 LSP-F 0.89 0.80 0.00 0.65 LSP-W 0.89 0.70 0.00 0.59 % of data 23 56 21 100 (a) Results on the S CENE data set. 15 M. Malinowski | Grounding

Current limitations • Language ‣ At most 1 relation ‣ Doesn’t model more complex phenomena (negations, superlatives, …) • Vision ‣ Dataset is restricted ‣ No uncertainty • A computer system is on the table • There are items on the desk • There are two cups on the table • The computer is off 17 M. Malinowski | Grounding

Current limitations • Language ‣ At most 1 relation ‣ Doesn’t model more complex phenomena (negations, superlatives, …) • Vision ‣ Dataset is restricted ‣ No uncertainty 18 M. Malinowski | Grounding

Our suggestions • Language ‣ At most 1 relation ‣ Doesn’t model more complex phenomena (negations, superlatives, …) • Vision ‣ Dataset is restricted ‣ No uncertainty • A computer system is on • What is the object in front of the the table photocopying machine attached to the wall? • There are items on the desk • What is the object that is placed on the middle rack of the stand that is • There are two cups on the placed closed to the wall? table • What is time showing on the • The computer is off clock? 19 M. Malinowski | Grounding

Visual Turing Test: defining a challenge Mateusz Malinowski Visual - PowerPoint PPT Presentation

Visual Turing Test: defining a challenge Mateusz Malinowski Visual Turing Test challenge The task involves Object detection Ask about the content of the image in front inside left right on Spatial reasoning How many sofas? 3

Beyond the Asymmetric Turing Test Fintan Mallory Rethinking, Reworking and Revolutionising The

Outline Super-Turing I. The Limits of Turing Computation or A. Models & Frames of

Alan Turing and the Turing Award Winners A Short Journey Through the History of Computer Ttulo

Turing Machines (TM) Deterministic Turing Machine (DTM) Nondeterministic Turing Machine

1 Undecidability; the Church-Turing Thesis The Church-Turing thesis: A Turing machine that halts

Lecture 13: Oracle Turing Machines Arijit Bishnu 13.04.2010 Oracle Turing Machines

criticisms of the turing test and why you should ignore (most of) them katrina lacurts | 6.893

NVIDIA QUADRO RTX NVIDIA TURING GPU Turing SM RT Cores Turing SM RT Cores Up to 10 Giga

Turing Machines Our most powerful model of a computer is the Turing Machine. This is an FA with

1 Turing Machines 1.1 Introduction Turing machines provide an answer to the question, What is a

Science (Bridging Course) Turing Machines Gian Diego Tipaldi Topics Covered Turing machines

Foundations of Computer Science Lecture 26 Turing Machines The Turing Machine: DFA with Random

Foundations of Computer Science Lecture 26 Turing Machines The Turing Machine: DFA with Random

Theory Chapter 3: The Church-Turing Thesis 1 Chapter 3.1 Turing Machines 2 Turing Machines:

TURING MACHINE VARIATIONS ENCODING TURING MACHINES UNIVERSAL TURING MACHINE Your Questions?

Advanced Topics in Theoretical Computer Science Part 1: Turing Machines and Turing Computability

57% 2. Expose you to lots of new ideas 3. Present a coherent structure for teaching 4. Show the

Behavioral Issues Overconfidence Simon Gervais Duke University (sgervais@duke.edu) June 29,

Realistic modelling of complex Realistic modelling of complex Supercomputing, Visualization &

Autism Case Training: Web-based Continuing Education Course October 24, 2012 Webinar Overview

Katharine Lady Berkeleys School Sixth Form Open Evening 14 November 2019 Introduction: Tim

GMTS Scheme London Prospective Hosts Information Event 2015 Intake Mark Humble - Deputy Head,

Tackling the Obesity Epidemic Facilitating Consumer Adoption of Healthy Lifestyle Facilitating

New Strategies in Weight Loss Gary D. Foster, PhD Chief Scientific Officer Weight Watchers

Visual Turing Test: defining a challenge Mateusz Malinowski Visual - PowerPoint PPT Presentation

Visual Turing Test: defining a challenge Mateusz Malinowski Visual Turing Test challenge The task involves Object detection Ask about the content of the image in front inside left right on Spatial reasoning How many sofas? 3

Beyond the Asymmetric Turing Test Fintan Mallory Rethinking, Reworking and Revolutionising The

Outline Super-Turing I. The Limits of Turing Computation or A. Models &amp; Frames of

Alan Turing and the Turing Award Winners A Short Journey Through the History of Computer Ttulo

Turing Machines (TM) Deterministic Turing Machine (DTM) Nondeterministic Turing Machine

1 Undecidability; the Church-Turing Thesis The Church-Turing thesis: A Turing machine that halts

Lecture 13: Oracle Turing Machines Arijit Bishnu 13.04.2010 Oracle Turing Machines

criticisms of the turing test and why you should ignore (most of) them katrina lacurts | 6.893

NVIDIA QUADRO RTX NVIDIA TURING GPU Turing SM RT Cores Turing SM RT Cores Up to 10 Giga

Turing Machines Our most powerful model of a computer is the Turing Machine. This is an FA with

1 Turing Machines 1.1 Introduction Turing machines provide an answer to the question, What is a

Science (Bridging Course) Turing Machines Gian Diego Tipaldi Topics Covered Turing machines

Foundations of Computer Science Lecture 26 Turing Machines The Turing Machine: DFA with Random

Foundations of Computer Science Lecture 26 Turing Machines The Turing Machine: DFA with Random

Theory Chapter 3: The Church-Turing Thesis 1 Chapter 3.1 Turing Machines 2 Turing Machines:

TURING MACHINE VARIATIONS ENCODING TURING MACHINES UNIVERSAL TURING MACHINE Your Questions?

Advanced Topics in Theoretical Computer Science Part 1: Turing Machines and Turing Computability

57% 2. Expose you to lots of new ideas 3. Present a coherent structure for teaching 4. Show the

Behavioral Issues Overconfidence Simon Gervais Duke University (sgervais@duke.edu) June 29,

Realistic modelling of complex Realistic modelling of complex Supercomputing, Visualization &amp;

Autism Case Training: Web-based Continuing Education Course October 24, 2012 Webinar Overview

Katharine Lady Berkeleys School Sixth Form Open Evening 14 November 2019 Introduction: Tim

GMTS Scheme London Prospective Hosts Information Event 2015 Intake Mark Humble - Deputy Head,

Tackling the Obesity Epidemic Facilitating Consumer Adoption of Healthy Lifestyle Facilitating

New Strategies in Weight Loss Gary D. Foster, PhD Chief Scientific Officer Weight Watchers

Outline Super-Turing I. The Limits of Turing Computation or A. Models & Frames of

Realistic modelling of complex Realistic modelling of complex Supercomputing, Visualization &