Deep Affordance-Grounded Sensorimotor Object Recognition Authors: - PowerPoint PPT Presentation

May 28, 2023 •463 likes •697 views

Deep Affordance-Grounded Sensorimotor Object Recognition Authors: Spyridon Thermos, Georgios Presented By: Th. Papadopoulos, Petros Daras, Thomas Crosley Gerasimos Potamianos UT CS 381V Autumn 2017 Problem Integrate visual appearance

Deep Affordance-Grounded Sensorimotor Object Recognition Authors: Spyridon Thermos, Georgios Presented By: Th. Papadopoulos, Petros Daras, Thomas Crosley Gerasimos Potamianos UT CS 381V Autumn 2017
Problem ● Integrate visual appearance and visual affordance information ● Object + Affordance Classification Hit Using Hammer
Affordances : “the types of actions that humans typically perform when interacting with an object.” Sit Throw Workout https://www.youtube.com/watch?v=V4XW74W9t4o https://www.youtube.com/watch?v=7Qxu5cvW-ds https://www.youtube.com/watch?v=1xS864zYIo8
Related Work Simpler Methods Smaller Data ● Factorial Conditional ● Few objects [1, 2, 3] Random Fields and Binary ● Small number of affordances [1, 2, 3] SVMs [1] ● Ex: 6 objects and 3 affordances [1] ● Gaussian Processes [2] ● SVMs + Clustering [3] [1] [2] [3]
RGB-D Sensorimotor Dataset
RGB-D Sensorimotor Dataset http://sor3d.vcl.iti.gr/wp-content/uploads/2017/03/sor3d.mp4?_=1
RGB-D Sensorimotor Dataset
RGB-D Sensorimotor Dataset Original Input
RGB-D Sensorimotor Dataset Input Processing
RGB-D Sensorimotor Dataset Data Extraction
RGB-D Sensorimotor Dataset ● 14 Object Types ● 13 Affordances ● 54 Interactions ● 105 subjects ● 4 to 8 seconds ● 20,830 instances
Architectures ● Generalized Template-Matching (GTM) ● Model spatial correlations ● Appearance CNN for object detection
Architectures ● Generalized Spatio-Temporal (GST) ● Encode time-evolving procedures ● CNN+LSTM for affordance modeling
Long Short Term Memory Networks (LSTMs) LSTMs: recurrent architecture capable of learning long-term dependencies Image Source: http://colah.github.io/posts/2015-08-Understanding-LSTMs/
LSTMs Core Idea: cell state updated and then passed on at each time step Image Source: http://colah.github.io/posts/2015-08-Understanding-LSTMs/
LSTMs “Forget Gate” “Remember Gate” Image Source: http://colah.github.io/posts/2015-08-Understanding-LSTMs/
LSTMs Image Source: http://colah.github.io/posts/2015-08-Understanding-LSTMs/
Fusion ● Given multiple sources of information ● At what point do we combine their features? Image Source: http://cs.stanford.edu/people/karpathy/deepvideo/
Fusion ● GST Architecture ● Combines ○ Appearance ○ Affordance ● (a) Late Fusion ● (b) Slow fusion
Architecture Slow Fusion Multi-Level Late Fusion Late Fusion Fusion at FC at conv
Results Single Stream (Best) Template Matching (Best) Spatio-Temporal
Open Problems ● Authors’ Thoughts ○ NN-Autoencoders for human-object interactions ○ “In-the-wild” object-affordance detection ● Others ○ Affordance identification for control tasks ○ Better temporal sampling schemes

Recommend

Its Not Open Data Unless it is Usable Data Mike Amundsen, API Academy CA / Layer7 @mamund

Its Not Open Data Unless it is Usable Data Mike Amundsen, API Academy CA / Layer7 @mamund affordance rejected affordance perceptible affordance false affordance hidden affordance Usability = Perceived Affordances I'll get back to

1.3k views • 101 slides

The Formalities of Affordance Antony Galton University of Exeter, UK Antony Galton The

ECAI 2010 Workshop on Spatio-Temporal Dynamics 16th August 2010 Lisbon, Portugal The Formalities of Affordance Antony Galton University of Exeter, UK Antony Galton The Formalities of Affordance Introduction: Affordance and Ecological

1.07k views • 88 slides

Reinforcement Learning of Reinforcement Learning of Affordance Cues Affordance Cues Final

Reinforcement Learning of Reinforcement Learning of Affordance Cues Affordance Cues Final Status of Work Final Status of Work Lucas Paletta & Gerald Fritz Lucas Paletta & Gerald Fritz Computational Perception Group Computational

640 views • 16 slides

Seeing the self in the www.hmi.unimore.it washing machine the Deep Affordance of 2.0 philosophy

Human Machine Interaction Group Seeing the self in the www.hmi.unimore.it washing machine the Deep Affordance of 2.0 philosophy in the Deep Affordance of 2.0 philosophy in the household appliance domain www.unimore.it Caterina Calefato

540 views • 15 slides

Response-based Learning for Grounded Grounded SMT Riezler, Machine Translation Simianer, Haas

Response- based Learning for Response-based Learning for Grounded Grounded SMT Riezler, Machine Translation Simianer, Haas Response- based Learning Stefan Riezler, Patrick Simianer, Carolin Haas Grounded SMT Algorithms Department of

440 views • 41 slides

Lear Learning M ning Multi ulti-Moda Modal l Grounded Lingu Grounded Linguistic istic

March 2020 Lear Learning M ning Multi ulti-Moda Modal l Grounded Lingu Grounded Linguistic istic Semantics by Playing I Spy Tong Gao Introduction Early work on grounded language learning enabled a machine to map from

540 views • 24 slides

Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object

4/13/2017 OOP Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object oriented Programming (Using C++) ht t p: / / www. com pgeom . com / ~pi yush/ t each/ 3330 Objects: State (fields), Behavior (member

635 views • 6 slides

Plan-based Control in an Plan-based Control in an Affordance-based Robot Control

Plan-based Control in an Plan-based Control in an Affordance-based Robot Control Affordance-based Robot Control Architecture Architecture Joachim Hertzberg, Christopher Lrken Institute for Informatics www.inf inf.uos. .uos.de/kbs/

345 views • 22 slides

Active Audition and Sensorimotor Integration for Sound Source Localization Mathieu Bernard 25

Active Audition and Sensorimotor Integration for Sound Source Localization Mathieu Bernard 25 novembre 2011 1/27 Mathieu Bernard - ISIR - BVS Active Audition and Sensorimotor Integration Introduction CIFRE thesis. Co-direction : Patrick

606 views • 27 slides

4. Perceptual Development Throughout the Lifespan 4.1 Sensorimotor Activities 4.2 Sensitive

4. Perceptual Development Throughout the Lifespan 4.1 Sensorimotor Activities 4.2 Sensitive Periods 4.3 Sensory Deprivation 4.4 Habituation 4.5 Sensory Acuity 4.1 Sensorimotor Activities Early perception Infants perceive with hands +

832 views • 20 slides

A summary of deep models for face recognition Qianli Liao Face recognition Face recognition:

A summary of deep models for face recognition Qianli Liao Face recognition Face recognition: Detection Alignment Recognition Face detection & alignment Face recognition Face detection & alignment Detection

1.2k views • 50 slides

Outline Introduction Definition History Features When should Grounded Theory be used? Types

Outline Introduction Definition History Features When should Grounded Theory be used? Types of Grounded Theory Process of Grounded Theory Similarities and differences with other qualitative method Data Analysis Introduction-Definition

606 views • 37 slides

TAKE TAKE GROUNDED GROUNDED DECISIONS DECISIONS Farm Modelling Statistic based, gamification

TAKE TAKE GROUNDED GROUNDED DECISIONS DECISIONS Farm Modelling Statistic based, gamification powered analytic tool for business planning, staff evaluation and training As you may know, many factors affect agricultural production around

258 views • 13 slides

CS6501: Deep Learning for Visual Recognition Object Detection: RCNN, Fast-RCNN, Faster-RCNN

CS6501: Deep Learning for Visual Recognition Object Detection: RCNN, Fast-RCNN, Faster-RCNN Todays Class Object Detection The RCNN Object Detector (2014) The Fast RCNN Object Detector (2015) The Faster RCNN Object Detector

747 views • 29 slides

Instance-level Recognition Pingmei Xu Object Recognition Friends SE01EP02 Recognition: Find the

Instance-level Recognition Pingmei Xu Object Recognition Friends SE01EP02 Recognition: Find the Ring! Friends SE01EP02 Recognition: Find the Ring! Instance

1.53k views • 91 slides

Supervised object recognition, unsupervised object recognition then Perceptual organization Bill

Supervised object recognition, unsupervised object recognition then Perceptual organization Bill Freeman, MIT 6.869 April 12, 2005 Readings Brief overview of classifiers in context of gender recognition:

1.48k views • 118 slides

SunyoungKim,PhD Last class Psychological design principles Recap. Psychological

Human-Computer Interaction 17. Design Design Principles (2) SunyoungKim,PhD Last class Psychological design principles Recap. Psychological principles 1. User sees what they expect to see. 2. Users have difficulty focusing on more

963 views • 57 slides

CS449/649: Human-Computer Interaction Winter 2018 Lecture VIII Anastasia Kuzminykh Create

CS449/649: Human-Computer Interaction Winter 2018 Lecture VIII Anastasia Kuzminykh Create Design Ideas Create Ideas Design Create Design Ideas Normans Affordances: Gibsons Affordances: - Offerings or action possibilities in the -

267 views • 25 slides

CSE 440: Introduction to HCI User Interface Design, Prototyping, and Evaluation Lecture 02:

CSE 440: Introduction to HCI User Interface Design, Prototyping, and Evaluation Lecture 02: James Fogarty Design of Everyday Things Daniel Epstein Brad Jacobson King Xia Tuesday/Thursday 10:30 to 11:50 MOR 234 Today Calendar Overview

1.42k views • 89 slides

5/5/2014 1 Peter 1:3-4, NIV "Praise be to God and Father of our Lord Jesus Christ. In his

5/5/2014 1 Peter 1:3-4, NIV "Praise be to God and Father of our Lord Jesus Christ. In his great mercy he has given us new birth into a living hope through the resurrection of Jesus Christ from the dead, and into an inheritance that can

332 views • 5 slides

Affordance Extraction and Inference based on Semantic Role Labeling Daniel Loureiro , Alpio

Affordance Extraction and Inference based on Semantic Role Labeling Daniel Loureiro , Alpio Jorge University of Porto Fact Extraction and Verification (FEVER) Workshop EMNLP 2018 uxdesign.cc Overview 1. Affordances What are they and why are

720 views • 37 slides

An introduction to Markov logic networks and their use in visual relational learning Willie Brink

An introduction to Markov logic networks and their use in visual relational learning Willie Brink Applied Mathematics, Stellenbosch University wbrink@sun.ac.za Thanks to Luc De Raedt and the DTAI research group at KU Leuven 1/20 Elephants are

220 views • 20 slides

Dreams Reoccurring Reoccurring Dreams The Craft of the Book in the Age of the W eb John Maxwell

Dreams Reoccurring Reoccurring Dreams The Craft of the Book in the Age of the W eb John Maxwell Simon Fraser University @jmaxsfu Haig Armen Emily Carr University of Art & Design @haigarmen Books in Browsers IV October 2013 Books

367 views • 34 slides

MetaMenu Adding more interactivity in context menu interactions Emman Kianga | Interaction

MetaMenu Adding more interactivity in context menu interactions Emman Kianga | Interaction Engineering WS 17/18 Motivation / Inspiration Modern computing applications are characterised by the use of context menus to make it easier to

560 views • 11 slides

Deep Affordance-Grounded Sensorimotor Object Recognition Authors: - PowerPoint PPT Presentation

Deep Affordance-Grounded Sensorimotor Object Recognition Authors: Spyridon Thermos, Georgios Presented By: Th. Papadopoulos, Petros Daras, Thomas Crosley Gerasimos Potamianos UT CS 381V Autumn 2017 Problem Integrate visual appearance

Its Not Open Data Unless it is Usable Data Mike Amundsen, API Academy CA / Layer7 @mamund

The Formalities of Affordance Antony Galton University of Exeter, UK Antony Galton The

Reinforcement Learning of Reinforcement Learning of Affordance Cues Affordance Cues Final

Seeing the self in the www.hmi.unimore.it washing machine the Deep Affordance of 2.0 philosophy

Response-based Learning for Grounded Grounded SMT Riezler, Machine Translation Simianer, Haas

Lear Learning M ning Multi ulti-Moda Modal l Grounded Lingu Grounded Linguistic istic

Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object

Plan-based Control in an Plan-based Control in an Affordance-based Robot Control

Active Audition and Sensorimotor Integration for Sound Source Localization Mathieu Bernard 25

4. Perceptual Development Throughout the Lifespan 4.1 Sensorimotor Activities 4.2 Sensitive

A summary of deep models for face recognition Qianli Liao Face recognition Face recognition:

Outline Introduction Definition History Features When should Grounded Theory be used? Types

TAKE TAKE GROUNDED GROUNDED DECISIONS DECISIONS Farm Modelling Statistic based, gamification

CS6501: Deep Learning for Visual Recognition Object Detection: RCNN, Fast-RCNN, Faster-RCNN

Instance-level Recognition Pingmei Xu Object Recognition Friends SE01EP02 Recognition: Find the

Supervised object recognition, unsupervised object recognition then Perceptual organization Bill

SunyoungKim,PhD Last class Psychological design principles Recap. Psychological

CS449/649: Human-Computer Interaction Winter 2018 Lecture VIII Anastasia Kuzminykh Create

CSE 440: Introduction to HCI User Interface Design, Prototyping, and Evaluation Lecture 02:

5/5/2014 1 Peter 1:3-4, NIV "Praise be to God and Father of our Lord Jesus Christ. In his

Affordance Extraction and Inference based on Semantic Role Labeling Daniel Loureiro , Alpio

An introduction to Markov logic networks and their use in visual relational learning Willie Brink

Dreams Reoccurring Reoccurring Dreams The Craft of the Book in the Age of the W eb John Maxwell

MetaMenu Adding more interactivity in context menu interactions Emman Kianga | Interaction

Sambuz

Useful Links

Newsletter

Mail Us

Deep Affordance-Grounded Sensorimotor Object Recognition Authors: - PowerPoint PPT Presentation

Deep Affordance-Grounded Sensorimotor Object Recognition Authors: Spyridon Thermos, Georgios Presented By: Th. Papadopoulos, Petros Daras, Thomas Crosley Gerasimos Potamianos UT CS 381V Autumn 2017 Problem Integrate visual appearance

Its Not Open Data Unless it is Usable Data Mike Amundsen, API Academy CA / Layer7 @mamund

The Formalities of Affordance Antony Galton University of Exeter, UK Antony Galton The

Reinforcement Learning of Reinforcement Learning of Affordance Cues Affordance Cues Final

Seeing the self in the www.hmi.unimore.it washing machine the Deep Affordance of 2.0 philosophy

Response-based Learning for Grounded Grounded SMT Riezler, Machine Translation Simianer, Haas

Lear Learning M ning Multi ulti-Moda Modal l Grounded Lingu Grounded Linguistic istic

Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object

Plan-based Control in an Plan-based Control in an Affordance-based Robot Control

Active Audition and Sensorimotor Integration for Sound Source Localization Mathieu Bernard 25

4. Perceptual Development Throughout the Lifespan 4.1 Sensorimotor Activities 4.2 Sensitive

A summary of deep models for face recognition Qianli Liao Face recognition Face recognition:

Outline Introduction Definition History Features When should Grounded Theory be used? Types

TAKE TAKE GROUNDED GROUNDED DECISIONS DECISIONS Farm Modelling Statistic based, gamification

CS6501: Deep Learning for Visual Recognition Object Detection: RCNN, Fast-RCNN, Faster-RCNN

Instance-level Recognition Pingmei Xu Object Recognition Friends SE01EP02 Recognition: Find the

Supervised object recognition, unsupervised object recognition then Perceptual organization Bill

SunyoungKim,PhD Last class Psychological design principles Recap. Psychological

CS449/649: Human-Computer Interaction Winter 2018 Lecture VIII Anastasia Kuzminykh Create

CSE 440: Introduction to HCI User Interface Design, Prototyping, and Evaluation Lecture 02:

5/5/2014 1 Peter 1:3-4, NIV &quot;Praise be to God and Father of our Lord Jesus Christ. In his

Affordance Extraction and Inference based on Semantic Role Labeling Daniel Loureiro , Alpio

An introduction to Markov logic networks and their use in visual relational learning Willie Brink

Dreams Reoccurring Reoccurring Dreams The Craft of the Book in the Age of the W eb John Maxwell

MetaMenu Adding more interactivity in context menu interactions Emman Kianga | Interaction

Sambuz

Useful Links

Newsletter

Mail Us

5/5/2014 1 Peter 1:3-4, NIV "Praise be to God and Father of our Lord Jesus Christ. In his