Beyond RetinaNet and Mask R-CNN Gang Yu yugang@megvii.com Outline - PowerPoint PPT Presentation

Beyond RetinaNet and Mask R-CNN Gang Yu yugang@megvii.com

Outline • Modern Object detectors • One Stage detector vs Two-stage detector • Challenges • Backbone • Head • Scale • Batch Size • Crowd • Conclusion

Modern Object detectors Postprocess Backbone Head NMS • Modern object detectors • RetinaNet • f1-f7 for backbone, f3-f7 with 4 convs for head • FPN with ROIAlign • f1-f6 for backbone, two fcs for head • Recall vs localization • One stage detector: Recall is high but compromising the localization ability • Two stage detector: Strong localization ability

One Stage detector: RetinaNet • FPN Structure • Focal loss Focal Loss for Dense Object Detection ， Lin etc, ICCV 2017 Best student paper

Two-Stage detector: FPN/Mask R-CNN • FPN Structure • ROIAlign Mask R-CNN ， He etc, ICCV 2017 Best paper

What is next for object detection? • The pipeline seems to be mature • There still exists a large gap between existing state-of-arts and product requirements • The devil is in the detail

Challenges Overview • Backbone • Head • Scale • Batch Size • Crowd Postprocess Backbone Head NMS

Challenges - Backbone • Backbone network is designed for classification task but not for localization task • Receptive Field vs Spatial resolution • Only f1-f5 is pretrained but randomly initializing f6 and f7 (if applicable)

Backbone - DetNet • DetNet: A Backbone network for Object Detection, Li etc, 2018, https://arxiv.org/pdf/1804.06215.pdf

Backbone - DetNet

Challenges - Head • Speed is significantly improved for the two-stage detector • RCNN - > Fast RCNN -> Faster RCNN - > RFCN • How to obtain efficient speed as one stage detector like YOLO, SSD? • Small Backbone • Light Head

Head – Light head RCNN • Light-Head R-CNN: In Defense of Two-Stage Object Detector, 2017, https://arxiv.org/pdf/1711.07264.pdf

Challenges - Scale • Scale variations is extremely large for object detection

Challenges - Scale • Scale variations is extremely large for object detection • Previous works • Divide and Conquer: SSD, DSSD, RON, FPN, … • Limited Scale variation • Scale Normalization for Image Pyramids, Singh etc, CVPR2018 • Slow inference speed • How to address extremely large scale variation without compromising inference speed?

Scale - SFace • SFace: An Efficient Network for Face Detection in Large Scale Variations, 2018, http://cn.arxiv.org/pdf/1804.06559.pdf

Challenges - Batchsize • Small mini-batchsize for general object detection • 2 for R-CNN, Faster RCNN • 16 for RetinaNet, Mask RCNN • Problem with small mini-batchsize • Long training time • Insufficient BN statistics • Inbalanced pos/neg ratio

Batchsize – MegDet • MegDet: A Large Mini-Batch Object Detector, CVPR2018, https://arxiv.org/pdf/1711.07240.pdf

Challenges - Crowd • NMS is a post-processing step to eliminate multiple responses on one object instance • Reasonable for mild crowdness like COCO and VOC • Will Fail in the case when the objects are in a crowd

Crowd - CrowdHuman • CrowdHuman: A Benchmark for Detecting Human in a Crowd, 2018, https://arxiv.org/pdf/1805.00123.pdf

Introduction to Face++ Detection Team • Category-level Recognition • Detection • Face Detection: • FAN: https://arxiv.org/pdf/1711.07246.pdf • Sface: https://arxiv.org/pdf/1804.06559.pdf • Human Detection: • Repulsion loss: https://arxiv.org/abs/1711.07752 • CrowdHuman: https://arxiv.org/pdf/1805.00123.pdf • General Object Detection: • Light Head: https://arxiv.org/pdf/1711.07264.pdf https://github.com/zengarden/light_head_rcnn • MegDet: https://arxiv.org/pdf/1711.07240.pdf • DetNet: https://arxiv.org/pdf/1804.06215.pdf • Segmentation • Large Kernel Matters: https://arxiv.org/pdf/1703.02719.pdf • DFN: https://arxiv.org/pdf/1804.09337.pdf • Skeleton: • CPN: https://arxiv.org/pdf/1711.07319.pdf • https://github.com/chenyilun95/tf-cpn

Thanks

Beyond RetinaNet and Mask R-CNN Gang Yu yugang@megvii.com Outline - PowerPoint PPT Presentation

Beyond RetinaNet and Mask R-CNN Gang Yu yugang@megvii.com Outline Modern Object detectors One Stage detector vs Two-stage detector Challenges Backbone Head Scale Batch Size Crowd Conclusion Modern

Object Detection in Recent 3 Years Beyond RetinaNet and Mask R-CNN Gang Yu

1. procedure ONE TO ALL BC( d , my id , X ) 2. begin mask := 2 d 1; 3. /* Set all d bits of

CS7015 (Deep Learning) : Lecture 12 Object Detection: R-CNN, Fast R-CNN, Faster R-CNN, You Only

Object Detection using R-CNN Experiments CS381V: Visual Recognition, Spring 2016 William Xie

WHOLEHEARTED Digging Deeper to Broaden Our Reach WE WEAR THE MASK We Wear the Mask BY PAUL

Single mask technology implementation Piotr Bielwka 10 th RD51 Stony Brook Single mask

A C N A I B Enhance Skin complexion Enhance Skin complexion Bianca Facial Mask Enhanced

Classless Subnetting Explained When given an IP Address, Major Network Mask, and a Subnet Mask,

BLACK SOAP GHASSOUL MASK CLAY MASK White Clay Green Clay MASSAGE OIL ARGAN OIL ESSENCE WATER

Critical Contact NIV mask fitting workshop Therapeutic Care October 2018 Learning objectives

Development of a unique reusable safety respirator The Elipse Half-Face Mask represents a major

Mask R-CNN OBJECT INSTANCE SEGMENTATION AND HUMAN POSE ESTIMATION Kaiming He Georgia Gkioxari

Mask R-CNN By Kaiming He, Georgia Gkioxari, Piotr Dollar and Ross Girshick Presented By Aditya

Decay vertex ID using CNN for p K+ Aaron Higuera University of Houston CNN Tools on

CNN Ba CNN Based ed Pi Pipeline peline for or Op Optical ical Fl Flow ow Tal Schuster,

CENG5030 Part 2-1: Introduction to Convolutional Nueral Network Bei Yu (Latest update: March 4,

LSE-239 and LSE-309 Summit-Base Tiger Team Kian-Tat Lim #lsst2018 #lsst2018 LSST Project and

Distribution Backbone Project July 14, 2011 Part 1 One Backbone: Two Sony Initiatives Although

Framework for Transformative Community Change Shiloh Turner President Executive Philanthropy

LIVE UNITED Mission United Orlando Backbone Support for Veteran Engagement Presenter: Laura

SSi Micro Ltd. Presentation to the House of Commons Standing Committee on Industry, Science and

PACE PREPARATORY ACADEMY The journey toward excellence and NCA CASI Accreditation OUR STORY

Collective Impact Network 2016 PAN Fall Conference Presented By: J. Evin Jones Backbone Support

NEWARK, NJ PRESENTED BY MONIQUE BAPTISTE-GOOD 1 What is Our Systems Approach? Operate as a