HICO: A Benchmark for Recognizing Human-Object Interactions in - PowerPoint PPT Presentation

Sep 03, 2022 •425 likes •619 views

HICO: A Benchmark for Recognizing Human-Object Interactions in Images Yu-Wei Chao, Zhan Wang, Yugeng He, Jiaxuan Wang, and Jia Deng ICCV 2015 Presented by Chia-Wen Cheng, Chia-Cheng Hsu HICO ~47,000 labeled images in 600 human-object

HICO: A Benchmark for Recognizing Human-Object Interactions in Images Yu-Wei Chao, Zhan Wang, Yugeng He, Jiaxuan Wang, and Jia Deng ICCV 2015 Presented by Chia-Wen Cheng, Chia-Cheng Hsu
HICO ~47,000 labeled images in 600 human-object interaction (HOI) categories Object-Verb sports ball - block X sports ball - carry V sports ball - hold V sports ball - sign X wine glass - fill ? apple - peel ? ....
Human-Object Interaction Prediction Horse-Ride Horse-Sit on
Evaluate the best proposed model
Pipeline of the DNN Model binary SVM per category SVM Pretrained on ImageNet SVM . AlexNet . . . SVM feature vector
Weird Output Distribution x-axis: number of prediction labels y-axis: % of testing sets
Weird Output Distribution x-axis: number of prediction labels y-axis: % of testing sets A lot of testing images are not predicted as any category.
Long Tail Distribution of Categories
Weighted Loss for Unbalanced Dataset Binary Classifier for Class 1 Positive Sample Negative Sample Class 2, 3, …,600 Class 1 Total Loss = w_p * loss on positive samples + w_n * loss on negative samples
Experiments on w_p/w_n w_p/w_n mAP (%) 1 18.58 3 19.05 10 19.39 30 19.24
Experiment on w_p/w_n w_p/w_n mAP (%) 1 18.58 3 19.05 10 19.39 30 19.24
Our Implementation: End-to-End Network
Multi-Label Classification cross 0 entropy 1 CNN 1 0 . . logistic ground sigmoid layer truth
Experimental Setting CNN Model: ● Inception v3 ● softmax layer -> logistic sigmoid layer ● number of classes -> 600 Training: ● Use pretrained model on ImageNet ● Fine-tune only the last layer ● Optimizer: Adam ● Learning rate: 0.001 ● Batch size: 64 ● Epochs: 10
Source Code ● Implemented in TensorFlow ● TF-Slim Library ● Github: https://github.com/chiawen/multi-label-classification-hico
Performance Method mAP (%) DNN (fine-tune O) 19.38 DNN (ImageNet) + weighted loss (ours) 19.39 Inception V3 + fine-tune (ours) 26.31
Related Work
Performance of HICO Benchmark Arun Mallya and Svetlana Lazebnik. Learning Models for Actions and Person-Object Interactions with Transfer to Question Answering. In ECCV , 2016. Method mAP (%) DNN (fine-tune O) 19.38 DNN (ImageNet) + 19.39 weighted loss (ours) Inception V3 + 26.31 fine-tune (ours)

Recommend

Recognizing object instances 3. Recognizing object instances Kristen Grauman UT-Austin Image

2/2/2016 Plan for today 1. Basics in feature extraction: filtering 2. Invariant local features Recognizing object instances 3. Recognizing object instances Kristen Grauman UT-Austin Image Formation Basics in feature extraction

1.02k views • 37 slides

Chicken Human 1 Human 2 Rat Chicken Human 1 Human 2 Rat Chicken Human 1 Human 2 Rat

Chicken Human 1 Human 2 Rat Chicken Human 1 Human 2 Rat Chicken Human 1 Human 2 Rat Chicken Human 1 Human 2 Rat Chicken Human 2 Rat/Human1 Chicken Human 2 Rat/Human1 Chicken Human 2 Rat/Human1 Chicken Human 2 Rat/Human1

362 views • 11 slides

Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object

4/13/2017 OOP Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object oriented Programming (Using C++) ht t p: / / www. com pgeom . com / ~pi yush/ t each/ 3330 Objects: State (fields), Behavior (member

635 views • 6 slides

Vicarious Calibration Of The Hyperspectral Imager For Coastal Oceans (HICO) Using MOBY And AERONET

Vicarious Calibration Of The Hyperspectral Imager For Coastal Oceans (HICO) Using MOBY And AERONET OC Data Mark David Lewis 1 Richard W Gould, Jr. 1 Sherwin D Ladner 1 Timothy Adam Lawson 1 Paul Martinolich 2 1 NRL, Code 7331, Stennis Space

429 views • 27 slides

Recognizing objects and actions in Finding boundaries images and video Recognizing

Outline Recognizing objects and actions in Finding boundaries images and video Recognizing objects Jitendra Malik Recognizing actions U.C. Berkeley University of California University of California Computer Vision Group

318 views • 7 slides

Medicaid Benchmark Options Analysis Stakeholder Advisory Committee July 23, 2012 Overview

Medicaid Benchmark Options Analysis Stakeholder Advisory Committee July 23, 2012 Overview Legal Requirements for Medicaid Benchmark Open Policy Questions Considerations for Designing Medicaid Benchmark 2 What Benchmark are We

338 views • 29 slides

The HPC Challenge Benchmark: The HPC Challenge Benchmark: A Candidate for Replacing A Candidate

2007 SPEC Benchmark Workshop January 21, 2007 Radisson Hotel Austin North The HPC Challenge Benchmark: The HPC Challenge Benchmark: A Candidate for Replacing A Candidate for Replacing LINPACK in the TOP500? LINPACK in the TOP500? Jack

1.02k views • 47 slides

Object-Oriented Databases Object Oriented Databases ODMG Standard Object Model, Object

Object-Oriented Databases Object Oriented Databases ODMG Standard Object Model, Object Definition Language, Object Query Language Programming Language Bindings Outlook October 17, 2008 Michael Grossniklaus Department of

787 views • 27 slides

Object oriented Object oriented Object oriented Object oriented approach and UML approach and

Object oriented Object oriented Object oriented Object oriented approach and UML approach and UML approach and UML approach and UML Goals The goals of this chapter are to introduce the object oriented approach to software systems

1.06k views • 92 slides

Challenges in Recognizing Challenges in Recognizing NFL with DY NFL with DY Accessibility

Challenges in Recognizing Challenges in Recognizing NFL with DY NFL with DY Accessibility & Accessibility & Proof & Proof & Awareness Documentation Awareness Documentation Impermanent Impermanent

581 views • 22 slides

Overview of the Recognizing Inference in TExt (RITE-2) at Recognizing Inference in

RITE-2 Overview of the Recognizing Inference in TExt (RITE-2) at Recognizing Inference in TExt@NTCIR10 NTCIR-10 Yotaro Yusuke Junta Tomohide Hiroshi Cheng- Watanabe Miyao Mizuno Shibata Kanayama Wei Lee Tohoku NII

406 views • 39 slides

Compilers Recognizing Handles Alex Aiken Recognizing Handles Bad News There are no known

Compilers Recognizing Handles Alex Aiken Recognizing Handles Bad News There are no known efficient algorithms to recognize handles Good News There are good heuristics for guessing handles On some CFGs, the heuristics always

479 views • 15 slides

Object Space Volume Rendering Object Space Volume Rendering Ronald Peikert SciVis 2010 - Object

Object Space Volume Rendering Object Space Volume Rendering Ronald Peikert SciVis 2010 - Object Space Volume Rendering 4-1 Object space volume rendering In object space rendering methods, the main loop is not over the pixels but over the

484 views • 30 slides

Finding Four-Leaf Clovers: A Benchmark for Fine-Grained Object Localization Gustavo Prez * ,

Finding Four-Leaf Clovers: A Benchmark for Fine-Grained Object Localization Gustavo Prez * , Laura Bravo * , Alejandro Pardo * , Pablo Arbelez *Indicates equal contribution 01/15 Finding Four-Leaf Clovers Goal: to create a reliable benchmark

491 views • 16 slides

Recognizing and Learning Object Categories Based on work and slides by R. Fergus, P. Perona, A.

Traditional Problem: Single Object Recognition Recognizing and Learning Object Categories Based on work and slides by R. Fergus, P. Perona, A. Zisserman, A. Efros, J. Ponce, S. Lazebnik, C. Schmid, F. DiMaio, and others Most Objects Exhibit

314 views • 29 slides

Recognizing and Learning Object Categories Based on work and slides by R. Fergus, P. Perona, A.

Recognizing and Learning Object Categories Based on work and slides by R. Fergus, P. Perona, A. Zisserman, A. Efros, J. Ponce, S. Lazebnik, C. Schmid, F. DiMaio, and others Traditional Problem: Single Object Recognition 1 Most Objects

915 views • 58 slides

Data Mining Concepts Duen Horng (Polo) Chau Assistant Professor Associate Director, MS

http://poloclub.gatech.edu/cse6242 CSE6242 / CX4242: Data & Visual Analytics Data Mining Concepts Duen Horng (Polo) Chau Assistant Professor Associate Director, MS Analytics Georgia Tech Partly based on materials by

542 views • 19 slides

Supervised Self-Organising Maps similarity/distance (Kohonen, 1982). Ron Wehrens Institute of

Self-organising maps Map high-dimensional data to a 2D grid of units according to Supervised Self-Organising Maps similarity/distance (Kohonen, 1982). Ron Wehrens Institute of Molecules and Materials, IMM Radboud University

191 views • 5 slides

CSE 258 Web Mining and Recommender Systems Introduction What is CSE 258? In this course we will

CSE 258 Web Mining and Recommender Systems Introduction What is CSE 258? In this course we will build models that help us to understand data in order to gain insights and make predictions Examples Recommender Systems Prediction: what

949 views • 60 slides

Congregation Shaarey Tefilla would like to thank our Sponsors . . . Matt and Laura Burton Bank

Congregation Shaarey Tefilla would like to thank our Sponsors . . . Matt and Laura Burton Bank of America Meyers Sales Congregation Shaarey Tefilla SILENT AUCTION SPONSORS Alexander Hotel SILENT AUCTION SPONSORS Barrington Jewels

1.04k views • 50 slides

Datamining Recursive partitioning trees Sren Hjsgaard Department of Mathematical Sciences

Datamining Recursive partitioning trees Sren Hjsgaard Department of Mathematical Sciences Aalborg University, Denmark August 22, 2012 Printed: August 22, 2012 File: datamining-slides.tex 2: August 22, 2012 Contents 1 Introduction 3

113 views • 9 slides

Classification with mixtures of curved Mahalanobis metrics or LMNN in Cayley-Klein geometries

Classification with mixtures of curved Mahalanobis metrics or LMNN in Cayley-Klein geometries arXiv:1609.07082 Frank Nielsen 1 , 2 Boris Muzellec 1 Richard Nock 3 , 4 , 5 1 Ecole Polytechnique, France 2 Sony CSL, Japan 3 Data61, Australia

559 views • 39 slides

Scalable Multi-Class Gaussian Process Classification using Expectation Propagation Carlos

Scalable Multi-Class Gaussian Process Classification using Expectation Propagation Carlos Villacampa-Calvo and Daniel Hern andezLobato Computer Science Department Universidad Aut onoma de Madrid http://dhnzl.org ,

963 views • 69 slides

Mining Useful Patterns Jilles Vreeken 22 May 2015 Questions of the day How can we find useful

Mining Useful Patterns Jilles Vreeken 22 May 2015 Questions of the day How can we find useful patterns? & How can we use patterns? Standard pattern mining For a database db a pattern language and a set of constraints the go

812 views • 46 slides