On The Universality of Visual and Multimodal Representations Jury - PowerPoint PPT Presentation

On The Universality of Visual and Multimodal Representations Jury Mathieu Cord Philippe-Henri Gosselin Céline Hudelot Iasonas Kokkinos Hervé Le Borgne Florent Perronnin Pablo Piantanida Youssef Tamaazousti | Ph.D. Defense June 1st, 2018 | 1 2018 | Tamaazousti Youssef

AI Today: performing systems in many tasks and domains Robotics Monitoring Security Medical Sport Transport | 2 2018 | Tamaazousti Youssef

Learning-based AI Representation Raw data Task-Solving Model Solve Extractor � � Task F • Learning-based AI • Aims at performing tasks from raw data | 3 2018 | Tamaazousti Youssef

Learning-based AI Representation Raw data Task-Solving Solve Extractor � � Task F • Learning-based AI • Aims at performing tasks from raw data • Consists in a Representation-extractor ( F ) and a Task-solving ( G ) | 4 2018 | Tamaazousti Youssef

Learning-based AI Representation Raw data Task-Solving Solve Extractor � � Task F • Learning-based AI • Aims at performing tasks from raw data • Consists in a Representation-extractor ( F ) and a Task-solving ( G ) • Main Characteristics: • F learned from data • F and G learned jointly • G could be omitted, F used with another G to solve another task: ``Transferability’’ | 5 2018 | Tamaazousti Youssef

Learning-based AI Representation Raw data Task-Solving Solve Extractor � � Task F • Goal in the literature: • Learning a model ( F and G ) in order to excel at a given task | 6 2018 | Tamaazousti Youssef

Challenge ● Learning a universal model: ○ Model that provides high-level representation of raw data from different nature (modalities, visual domains and semantic domains) ○ high task-solving abilities for different tasks (recognition, detection, segmentation, etc.). | 7 2018 | Tamaazousti Youssef

Motivation ● Humans: ○ able to perform an enormous variety of different tasks. ● Machines: ○ able to perform one task at time (``expert model’’) | 8 2018 | Tamaazousti Youssef

Motivation ● Humans: ○ able to perform an enormous variety of different tasks. ● Machines: ○ able to perform one task at time (``expert model’’) Humans develop powerful internal representation in their infancy and re-use it later in life to solve many problems [Atkinson, OPP’00] | 9 2018 | Tamaazousti Youssef

Motivation ● Universality: recent growing interest in AI community ● Motivations of other works ○ Same motivation than us: ``mimic’’ humans ■ [Bilen & Vedaldi, ArXiv’17]; [Rebuffi et al. , NIPS’17]; [Nie et al. , ArXiv’17]; [Rebuffi et al. , CVPR’18] ○ Practical motivation : even if we want to build an expert AI, it is always beneficial to have a good starting point (universal model) ■ [Conneau et al. , EACL’17]; [Conneau et al. , EMNLP’17]; [Cer et al., ArXiv’18]; [Subramanian & Bengio, ICLR’18]; ○ Build a ``swiss-knife’’ that may be useful for general AI ■ [Kokkinos, CVPR’17]; [Wang et al., WACV’18] | 10 2018 | Tamaazousti Youssef

General Problem Formulation Representation Raw data Task-Solving Solve Extractor � � Task F ● At least, two different aspects to address the problem | 11 2018 | Tamaazousti Youssef

General Problem Formulation Representation Raw data Task-Solving Solve Extractor � � Task F ● At least, two different aspects to address the problem ○ Universal Task-Solving: make G able to handle the largest set of tasks GENERAL AI [Kokkinos, CVPR’17]; [Wang et al. , WACV’18] | 12 2018 | Tamaazousti Youssef

General Problem Formulation Representation Raw data Task-Solving Solve Extractor � � Task F ● At least, two different aspects to address the problem ○ Universal Task-Solving: make G able to handle the largest set of tasks GENERAL AI [Kokkinos, CVPR’17]; [Wang et al. , WACV’18] ○ Universal Representation-Extractor: make F able to handle the largest set of modalities, visual & semantic domains UNIVERSAL REPRESENTATIONS [Bilen & Vedaldi, ArXiv’17] ; [Rebuffi et al. , NIPS’17] ; [Nie et al. , ArXiv’17] ; [Rebuffi et al. , CVPR’18]; [Conneau et al. , EACL’17] ; [Conneau et al. , EMNLP’17]; [Cer et al., ArXiv’18]; [Subramanian & Bengio, ICLR’18] | 13 2018 | Tamaazousti Youssef

Problem Formulation (1/4) ● A priori, no representation is completely universal ● Learned representations contain some level of universality ● Our goal: ○ Increase the universality of the representation | 14 2018 | Tamaazousti Youssef

Problem Formulation (2/4) ● Learning algorithm: ○ (Deep) neural-networks ● Data: ○ Visual or Multimodal (visual & textual) | 15 2018 | Tamaazousti Youssef

Problem Formulation (3/4) ● Learning strategy: ○ According to a supervised approach ■ better than semi-supervised and unsupervised approaches With many annotated data ○ | 16 2018 | Tamaazousti Youssef

Problem Formulation (4/4) ● Evaluation scenario of universality: Close to [Atkinson, OPP’00] : Humans learn a visual representation of the world in their infancy and use it (as-is) later in life to solve different problems a. In Transfer-Learning scheme, Infancy : source-task; later : target-task b. As-is: w/o modifying the learned representation c. Different problems: Large set of Undetermined Target-Tasks (UTT) Close to the real-world : most tasks (in academy & industry) contain few annotated data because hard to collect & annotate d. UTT with few annotated data e. Aggregated performance on set of UTT | 17 2018 | Tamaazousti Youssef

Outline ● State-Of-The-Art (S.O.T.A) ● Contributions ○ Evaluation of Universality ○ Universality in Features Learned with Explicit Supervision ○ Universality in Features Learned with Implicit Supervision ○ Universality via Multimodal Representations ● Conclusions ● Perspectives | 18 2018 | Tamaazousti Youssef

S.O.T.A: Positioning Univ. Eval. Works Mod. Source-task Goal Aspect Scenario [Conneau et al., EACL’17] Repres- Transfer 1 domain - 1 Best tasks & Textual [Conneau et al., EMNLP’17] entation Learning task algorithm 1 domain - Tricks to auto. get [Cer et al., ArXiv’17] No annotation annotations [Subramanian & Bengio, Learn many data Multi-task ICLR’18] with few param. [Kokkinos, CVPR’17] Task Visual End2End Multi-task [Wang et al., WACV’18] Solving [Bilen & Vedaldi, ArXiv’17] Repres- Multi-domain - [Rebuffi et al., NIPS’17] entation 1 task Fine Multi-domain - [Rebuffi et al., CVPR’18] Tuning 1 task Visual & Transfer 1 domain - 1 Tricks to auto. get This Thesis Multimodal Learning task more annotations | 19 2018 | Tamaazousti Youssef

S.O.T.A: Positioning Univ. Eval. SP Works Mod. Goal Aspect Scenario Domain-Task [Conneau et al., EACL’17] Repres- Transfer 1 domain - 1 Best tasks & Textual [Conneau et al., EMNLP’17] entation Learning task algorithm 1 domain - Tricks to auto. get [Cer et al., ArXiv’17] No annotation annotations [Subramanian & Bengio, Learn many data Multi-task ICLR’18] with few param. [Kokkinos, CVPR’17] Task Visual End2End Multi-task [Wang et al., WACV’18] Solving [Bilen & Vedaldi, ArXiv’17] Repres- Multi-domain - [Rebuffi et al., NIPS’17] entation 1 task Fine Multi-domain - [Rebuffi et al., CVPR’18] Tuning 1 task Visual & Transfer 1 domain - 1 Tricks to auto. get This Thesis Multimodal Learning task more annotations | 22 2018 | Tamaazousti Youssef

On The Universality of Visual and Multimodal Representations Jury - PowerPoint PPT Presentation

On The Universality of Visual and Multimodal Representations Jury Mathieu Cord Philippe-Henri Gosselin Cline Hudelot Iasonas Kokkinos Herv Le Borgne Florent Perronnin Pablo Piantanida Youssef Tamaazousti | Ph.D. Defense June 1st,

Universality of transverse-momentum Universality of transverse-momentum Universality of

Multimodal Machine Learning Louis-Philippe (LP) Morency CMU Multimodal Communication and Machine

Multimodal Machine Learning Louis-Philippe (LP) Morency CMU Multimodal Communication and Machine

Universality M. Hairer University of Warwick New fellows seminar Universality Experimental /

Orko: Facilitating Multimodal Interaction for Visual Exploration and Analysis of Networks Arjun

Reproducibility, trust, and proof checking From Universality of Facts to Universality of Proofs

A precision test of lepton universality A precision test of lepton universality in K K + l l

Universality in the equilibration of Universality in the equilibration of isolated systems after

Biovision team 2 Retina Visual cortex 3 Retina Visual cortex 3 Retina Visual cortex 3

Multimodal Corridor Planning & Engineering Analysis Project A1A MULTIMODAL CORRIDOR PLANNING

MULTIMODAL OPTIMIZATION MIKE PREUSS. Multimodal Optimization 1 2014-09-14 Mike Preuss

Overview Overview Visual displays Visual displays Visual and tactile displays Visual and

CHRONIC CHRONIC VISUAL LOSS VISUAL LOSS Wasu Supakornthanasarn, MD. Visual loss Sensory

A Model of Visual Imagery A Model of Visual Imagery John Abbondanza, OD, FCOVD John Abbondanza,

Probing the Need for Visual Context in Multimodal Machine Translation Ozan Caglayan 1 , Pranava

Samskip Multimodal Short Sea and Multimodal Business www.samskip.com 1 Samskip Group Profile

Essential Git For Developers By: Adam Culp Twitter: @ adamculp https://joind.in/ 13311 Essential

The C limate-system H istorical F orecast P roject ( CHFP ) Ramiro Saurral- CIMA (Buenos Aires,

RIPE NCC Update Ingrid Wijte Assistant Manager Registration Services and Policy Development

U nforki ng Sam ba4: The Success! Presented by A ndrew Bartl ett of C atal yst / /

THE MERIT-BASED INCENTIVE PAYMENT SYSTEM: ANNUAL CALL FOR ADVANCING CARE INFORMATION MEASURES

Navigating the Merit-Based Incentive Payment System (MIPS) Maria Balderas, PhD August 10, 2016

Strategyproof Peer Selection Haris Aziz, Omer Lev, Nicholas Mattei, Jeffrey S. Rosenschein &

Distributed Systems CS425/ECE428 02/14/2020 Todays agenda Multicast (contd.) Chapter

On The Universality of Visual and Multimodal Representations Jury - PowerPoint PPT Presentation

On The Universality of Visual and Multimodal Representations Jury Mathieu Cord Philippe-Henri Gosselin Cline Hudelot Iasonas Kokkinos Herv Le Borgne Florent Perronnin Pablo Piantanida Youssef Tamaazousti | Ph.D. Defense June 1st,

Universality of transverse-momentum Universality of transverse-momentum Universality of

Multimodal Machine Learning Louis-Philippe (LP) Morency CMU Multimodal Communication and Machine

Multimodal Machine Learning Louis-Philippe (LP) Morency CMU Multimodal Communication and Machine

Universality M. Hairer University of Warwick New fellows seminar Universality Experimental /

Orko: Facilitating Multimodal Interaction for Visual Exploration and Analysis of Networks Arjun

Reproducibility, trust, and proof checking From Universality of Facts to Universality of Proofs

A precision test of lepton universality A precision test of lepton universality in K K + l l

Universality in the equilibration of Universality in the equilibration of isolated systems after

Biovision team 2 Retina Visual cortex 3 Retina Visual cortex 3 Retina Visual cortex 3

Multimodal Corridor Planning &amp; Engineering Analysis Project A1A MULTIMODAL CORRIDOR PLANNING

MULTIMODAL OPTIMIZATION MIKE PREUSS. Multimodal Optimization 1 2014-09-14 Mike Preuss

Overview Overview Visual displays Visual displays Visual and tactile displays Visual and

CHRONIC CHRONIC VISUAL LOSS VISUAL LOSS Wasu Supakornthanasarn, MD. Visual loss Sensory

A Model of Visual Imagery A Model of Visual Imagery John Abbondanza, OD, FCOVD John Abbondanza,

Probing the Need for Visual Context in Multimodal Machine Translation Ozan Caglayan 1 , Pranava

Samskip Multimodal Short Sea and Multimodal Business www.samskip.com 1 Samskip Group Profile

Essential Git For Developers By: Adam Culp Twitter: @ adamculp https://joind.in/ 13311 Essential

The C limate-system H istorical F orecast P roject ( CHFP ) Ramiro Saurral- CIMA (Buenos Aires,

RIPE NCC Update Ingrid Wijte Assistant Manager Registration Services and Policy Development

U nforki ng Sam ba4: The Success! Presented by A ndrew Bartl ett of C atal yst / /

THE MERIT-BASED INCENTIVE PAYMENT SYSTEM: ANNUAL CALL FOR ADVANCING CARE INFORMATION MEASURES

Navigating the Merit-Based Incentive Payment System (MIPS) Maria Balderas, PhD August 10, 2016

Strategyproof Peer Selection Haris Aziz, Omer Lev, Nicholas Mattei, Jeffrey S. Rosenschein &amp;

Distributed Systems CS425/ECE428 02/14/2020 Todays agenda Multicast (contd.) Chapter

Multimodal Corridor Planning & Engineering Analysis Project A1A MULTIMODAL CORRIDOR PLANNING

Strategyproof Peer Selection Haris Aziz, Omer Lev, Nicholas Mattei, Jeffrey S. Rosenschein &