joint detection and segmentation with deep hierarchical
play

JOINT DETECTION AND SEGMENTATION WITH DEEP HIERARCHICAL NETWORKS Zhao Chen Machine Learning Intern, NVIDIA ABOUT ME 5th year PhD student in physics @ S tanford by day, deep learning computer vision scientist by night. Intern with Deep

0 downloads 5 Views 2,65 MB Size Report
  1. JOINT DETECTION AND SEGMENTATION WITH DEEP HIERARCHICAL NETWORKS Zhao Chen Machine Learning Intern, NVIDIA

  2. ABOUT ME • 5th year PhD student in physics @ S tanford by day, deep learning computer vision scientist by night. • Intern with Deep Learning Applied Research (Autonomous Vehicles) @ NVIDIA, Oct-Dec 2016. Zhao Chen, Joint Det ect ion and S egment at ion with Deep Hierarchical Net works, GTC 2017. 2

  3. TALK OVERVIEW (1) Problem statement and summary. (2) Dataset and preliminaries. (3) Model motivation. (4) Results and visualizations. Zhao Chen, Joint Det ect ion and S egment at ion with Deep Hierarchical Net works, GTC 2017. 3

  4. TALK OVERVIEW (1) Problem statement and summary. (2) Dataset and preliminaries. (3) Model motivation. (4) Results and visualizations. Zhao Chen, Joint Det ect ion and S egment at ion with Deep Hierarchical Net works, GTC 2017. 4

  5. FROM SINGLE TO MULTITASK LEARNING Putting deep learning to work in the real world . . . Detection Model Obj ect Bounding Boxes . . . S egmentation Model S egmentation Mask 5

  6. FROM SINGLE TO MULTITASK LEARNING Putting deep learning to work in the real world . . . Detection Model Obj ect Bounding Boxes . . . S egmentation Model S egmentation Mask Poor scalability + inefficient use of information! 6

  7. FROM SINGLE TO MULTITASK LEARNING Putting deep learning to work in the real world How do we use one model to perform multiple tasks faster and better? Obj ect Bounding Boxes . . . S hared Model S egmentation Mask 7

  8. FROM SINGLE TO MULTITASK LEARNING Putting deep learning to work in the real world How do we use one model to perform multiple tasks faster and better? Obj ect Bounding Boxes . . . S hared Model S egmentation Mask + edge detection, + surface normals, + distance estimation… 8

  9. FROM SINGLE TO MULTITASK LEARNING Putting deep learning to work in the real world How do we use one model to perform multiple tasks faster and better? Obj ect Bounding Boxes . . . S hared Model S egmentation Mask How do you relate various tasks to each other in a multi-task neural network? 9

  10. WHAT WE WILL SHOW • By ordering tasks based on receptive field and information density , we improve segmentation and detection accuracy by ~2% and ~8% over single networks, respectively. • The j oint network is robust and easy to tune compared to non-hierarchical baselines. Zhao Chen, Joint Det ect ion and S egment at ion with Deep Hierarchical Net works, GTC 2017. 10

  11. TALK OVERVIEW (1) Problem statement and summary. (2) Dataset and preliminaries. (3) Model motivation. (4) Results and visualizations. Zhao Chen, Joint Det ect ion and S egment at ion with Deep Hierarchical Net works, GTC 2017. 11

  12. CITYSCAPES DATASET • 2975 Training Images @ resolution 1024 x 2048. • 20 classes for semantic segmentation, including 8 obj ect classes. Of these 8, 4 are much more represented (car, bicycle, person, rider): the “ easy classes.” • Both segmentation, bounding box, and edge ground truth can be generated. S emantic Raw S eg. Image Edge Bounding Detection Box 12

  13. HOW TO TRAIN A SEGMENTATION NETWORK • S tandard FCN (S helhamer 2015) Architecture: Convolutions followed by a deconvolution to retrieve a pixel-dense prediction mask. Zhao Chen, Joint Det ect ion and S egment at ion with Deep Hierarchical Net works, GTC 2017. 13

  14. HOW TO TRAIN A DETECTION NETWORK • Network outputs confidence that a pixel lies near the center of an obj ect. • Points of high confidence produce bounding box coordinates. • Confidences are rougher than full segmentation but robust to occlusion. Zhao Chen, Joint Det ect ion and S egment at ion with Deep Hierarchical Net works, GTC 2017. 14

  15. TALK OVERVIEW (1) Problem statement and summary. (2) Dataset and preliminaries. (3) Model motivation. (4) Results and visualizations. Zhao Chen, Joint Det ect ion and S egment at ion with Deep Hierarchical Net works, GTC 2017. 15

  16. Input (1024 x S hared Feature Map (from base CNN) 2048) Low-Res S eg Obj . Confidence Bbox Coordinate Predictions Positions Positions (W x H x 20) Deconv L = α L seg + (1- α )L det Zhao Chen, Joint Det ect ion and S egment at ion with Deep Hierarchical Net works, GTC 2017. 16

  17. OUR BASELINE MODEL PERFORMANCE S eg. Weight = α Det. Weight ( α controls how much attention we pay to segmentation vs detection at training) Zhao Chen, Joint Det ect ion and S egment at ion with Deep Hierarchical Net works, GTC 2017. 17

  18. OUR BASELINE MODEL PERFORMANCE S eg. Weight = α Det. Weight ( α controls how much attention we pay to segmentation vs detection at training) Zhao Chen, Joint Det ect ion and S egment at ion with Deep Hierarchical Net works, GTC 2017. 18

  19. OUR BASELINE MODEL PERFORMANCE S eg. Weight = α Det. Weight ( α controls how much attention we pay to segmentation vs detection at training) Zhao Chen, Joint Det ect ion and S egment at ion with Deep Hierarchical Net works, GTC 2017. 19

  20. OUR BASELINE MODEL PERFORMANCE S eg. Weight = α Det. Weight ( α controls how much attention we pay to segmentation vs detection at training) Zhao Chen, Joint Det ect ion and S egment at ion with Deep Hierarchical Net works, GTC 2017. 20

  21. OUR BASELINE MODEL PERFORMANCE S eg. Weight = α Det. Weight ( α controls how much attention we pay to segmentation vs detection at training) Zhao Chen, Joint Det ect ion and S egment at ion with Deep Hierarchical Net works, GTC 2017. 21

  22. OUR BASELINE MODEL PERFORMANCE S eg. Weight = α Det. Weight ( α controls how much attention we pay to segmentation vs detection at training) Zhao Chen, Joint Det ect ion and S egment at ion with Deep Hierarchical Net works, GTC 2017. 22

  23. OUR BASELINE MODEL PERFORMANCE S eg. Weight = α Det. Weight ( α controls how much attention we pay to segmentation vs detection at training) Zhao Chen, Joint Det ect ion and S egment at ion with Deep Hierarchical Net works, GTC 2017. 23

  24. OUR BASELINE MODEL PERFORMANCE S eg. Weight = α Det. Weight ( α controls how much attention we pay to segmentation vs detection at training) Zhao Chen, Joint Det ect ion and S egment at ion with Deep Hierarchical Net works, GTC 2017. 24

  25. A LABEL HIERARCHY ALONG TWO AXES Required Receptive Field Obj ect Bounding Boxes Density of Information Zhao Chen, Joint Det ect ion and S egment at ion with Deep Hierarchical Net works, GTC 2017. 25

  26. A LABEL HIERARCHY ALONG TWO AXES Required Receptive Field Obj ect Bounding Boxes Obj ect Confidence Density of Information Zhao Chen, Joint Det ect ion and S egment at ion with Deep Hierarchical Net works, GTC 2017. 26

  27. A LABEL HIERARCHY ALONG TWO AXES Required Receptive Field Obj ect Bounding Boxes Obj ect Confidence S emantic S egmentation Density of Information Zhao Chen, Joint Det ect ion and S egment at ion with Deep Hierarchical Net works, GTC 2017. 27

  28. A LABEL HIERARCHY ALONG TWO AXES Required Receptive Field Obj ect Bounding Boxes Edge Detection (plus) Obj ect Confidence S emantic S egmentation Density of Information Zhao Chen, Joint Det ect ion and S egment at ion with Deep Hierarchical Net works, GTC 2017. 28

  29. Input (1024 x S hared Feature Map (from base CNN) 2048) Low-Res S eg Obj . Confidence Bbox Coordinate Predictions Positions Positions (W x H x 20) Deconv Zhao Chen, Joint Det ect ion and S egment at ion with Deep Hierarchical Net works, GTC 2017. 29

  30. Input (1024 x S hared Feature Map (from base CNN) 2048) S egmentation Obj . Confidence Obj . BBox Features Features Features Low-Res S eg Obj . Confidence Bbox Coordinate Predictions Positions Positions (W x H x 20) Deconv Zhao Chen, Joint Det ect ion and S egment at ion with Deep Hierarchical Net works, GTC 2017. 30

  31. Input (1024 x S hared Feature Map (from base CNN) 2048) S egmentation Obj . Confidence Obj . BBox Features Features Features Low-Res S eg Obj . Confidence Bbox Coordinate Predictions Positions Positions (W x H x 20) Deconv Decreasing information density Zhao Chen, Joint Det ect ion and S egment at ion with Deep Hierarchical Net works, GTC 2017. 31

  32. Input (1024 x S hared Feature Map (from base CNN) 2048) Edge S egmentation Obj . Confidence Obj . BBox Features Features Features Features Low-Res Edge Low-Res S eg Obj . Confidence Bbox Coordinate Predictions Predictions Positions Positions (W x H x 3) (W x H x 20) Deconv Deconv Decreasing information density Zhao Chen, Joint Det ect ion and S egment at ion with Deep Hierarchical Net works, GTC 2017. 32

  33. Input (1024 x S hared Feature Map (from base CNN) 2048) Edge S egmentation Obj . Confidence Obj . BBox Features Features Features Features Low-Res Edge Low-Res S eg Obj . Confidence Bbox Coordinate Predictions Predictions Positions Positions (W x H x 3) (W x H x 20) Deconv Deconv Decreasing information density Zhao Chen, Joint Det ect ion and S egment at ion with Deep Hierarchical Net works, GTC 2017. 33

Recommend Documents


detection and segmentation
Detection and Segmentation CS60010:

Detection and Segmentation CS60010: Deep Learning Abir Das IIT Kharagpur

detection and segmentation
Detection and Segmentation CS60010:

Detection and Segmentation CS60010: Deep Learning Abir Das IIT Kharagpur

contour detection and hierarchical image segmentation
Contour Detection and Hierarchical

Contour Detection and Hierarchical Image Segmentation P. Arbelaez, M. Maire,

object detection and segmentation from joint embedding of
Object Detection and Segmentation from

Object Detection and Segmentation from Joint Embedding of Parts and Pixels

introduction to object detection amp image segmentation
Introduction to Object Detection &

Introduction to Object Detection & Image Segmentation Abel Brown

detection segmentation overview object detection
Detection, Segmentation Overview

CS6501: Deep Learning for Visual Recognition Detection, Segmentation Overview

experiments from paper on hierarchical video segmentation
Experiments from paper on Hierarchical

Experiments from paper on Hierarchical Video Segmentation February 17, 2016

semantic segmentation
Semantic segmentation Image

Accel : A Corrective Fusion Network for Efficient Semantic Segmentation on

semantic segmentation instance segmentation based on deep
Semantic Segmentation / Instance

Semantic Segmentation / Instance Segmentation Based on Deep learning Yiding

comparing objective functions for segmentation and
Comparing Objective Functions for

Comparing Objective Functions for Segmentation and Detection of Microaneurysms

imperial oil resources ltd type a water licence hearing
Imperial Oil Resources Ltd. Type A

Imperial Oil Resources Ltd. Type A Water Licence Hearing S13L1-007 Norman

amendments to the medical practitioners and dentists act
AMENDMENTS TO THE MEDICAL

AMENDMENTS TO THE MEDICAL PRACTITIONERS AND DENTISTS ACT AND PROPOSED RULES,

small scale cannabis businesses in bc municipalities
Small Scale Cannabis Businesses in BC

Small Scale Cannabis Businesses in BC Municipalities: Micro Licences and the

q3 2018 financial review disclaimer
Q3 2018 Financial Review Disclaimer

MIL-QOD007-02112015-131227/MGadg 28 th November 2018 Q3 2018 Financial Review

finance is fun
Finance is Fun! Laura Stein, Director

Finance is Fun! Laura Stein, Director of Business and Finance Presentation

population
POPULATION, ITS ABOUT CONSUMPTION

ITS NOT ABOUT POPULATION, ITS ABOUT CONSUMPTION Source: Oxfam The welfare

the lhcf experiment
The LHCf experiment Koji Noda (INFN

The LHCf experiment Koji Noda (INFN Catania) on behalf of the LHCf

signs of relationship abuse intense interest at first
Signs Of Relationship Abuse Intense

Signs Of Relationship Abuse Intense Interest -At first this is flattering, but

hillcrest washington coles voluntary real estate
Hillcrest / Washington Coles Voluntary

Hillcrest / Washington Coles Voluntary Real Estate Acquisition &

talking points of new proposed 2704 regulations
TALKING POINTS OF NEW PROPOSED 2704

CPAs & ADVISORS experience direction // TALKING POINTS OF NEW PROPOSED

make it easy for the irs to say yes
MAKE IT EASY FOR THE IRS TO SAY YES

MAKE IT EASY FOR THE IRS TO SAY YES Texas Land Trust Conference March 2, 2017

the family office exclusion from the
The Family Office Exclusion from the

The Family Office Exclusion from the July 19, 2011 Definition of Investment