1 10 dive deeper into box for object detection
play

1 / 10 Dive Deeper Into Box for Object Detection Ran Chen 1 , Yong - PowerPoint PPT Presentation

1 / 10 Dive Deeper Into Box for Object Detection Ran Chen 1 , Yong Liu 2 , Mengdan Zhang 2 , Shu Liu 3 , Bei Yu 1 , Yu-Wing Tai 4 1 The Chinese University of Hong Kong 2 Tencent Youtu Lab 3 SmartMore 4 The Hong Kong University of Science and


  1. 1 / 10

  2. Dive Deeper Into Box for Object Detection Ran Chen 1 , Yong Liu 2 , Mengdan Zhang 2 , Shu Liu 3 , Bei Yu 1 , Yu-Wing Tai 4 1 The Chinese University of Hong Kong 2 Tencent Youtu Lab 3 SmartMore 4 The Hong Kong University of Science and Technology 2 / 10

  3. Introduction Box Decomposition and Recombination ◮ Reorganizing boundaries of boxes during training. ◮ Optimal boxes with tightening instances provide better localization. Semantic inconsistency in annotations ◮ Backgrounds regarded as positive pixels are the noise for the training. ◮ A self adaptive module is approached to tackle this problem. 3 / 10

  4. <latexit sha1_base64="GAFXR/1Om5fGSedTSPnsn0bcvsE=">ACFHicbZBNSwMxEIazftb6terRS7AIglB2a0XxIXjypWhbaUbHZWQ7PZJZkVy9If4cW/4sWDIl49ePfmNaK2vpC4OGdGSbzBqkUBj3vwxkbn5icmi7MFGfn5hcW3aXlc5NkmkONJzLRlwEzIWCGgqUcJlqYHEg4SJoH/bqFzegjUjUGXZSaMbsSolIcIbWarmbDYRbzE+Zanf3aCMEiay1Rfe/0f/BSsteWvLzoK/gBKZKDjlveCBOexaCQS2ZM3fdSbOZMo+ASusVGZiBlvM2uoG5RsRhM+8f1aXr1glplGj7FNK+3siZ7ExnTiwnTHDazNc65n/1eoZRrvNXKg0Q1D8a1GUSYoJ7SVEQ6GBo+xYFwL+1fKr5lmHG2ORuCP3zyKJxXyn61vH1SLR1UBnEUyCpZIxvEJzvkgByRY1IjnNyRB/JEnp1759F5cV6/WsecwcwK+SPn7ROSH50x</latexit> <latexit sha1_base64="m+AOn2kXJDEyvOaMqmUcRt7XPis=">AB9XicbVDLSgNBEOyNrxhfUY9eBoMYQcJuiOgx4MVjJOYBybrMTmaTIbOzy8ysGpb8hxcPinj1X7z5N04eB0saCiqunu8mPOlLbtbyuzsrq2vpHdzG1t7+zu5fcPmipKJKENEvFItn2sKGeCNjTnLZjSXHoc9ryh9cTv/VApWKRuNOjmLoh7gsWMIK1ke67IX4q1j3nvH7q2WdevmCX7CnQMnHmpABz1Lz8V7cXkSkQhOleo4dqzdFEvNCKfjXDdRNMZkiPu0Y6jAIVuOr16jE6M0kNBJE0Jjabq74kUh0qNQt90hlgP1KI3Ef/zOokOrtyUiTjRVJDZoiDhSEdoEgHqMUmJ5iNDMJHM3IrIAEtMtAkqZ0JwFl9eJs1yamULm4rhWp5HkcWjuAYiuDAJVThBmrQAISnuEV3qxH68V6tz5mrRlrPnMIf2B9/gBNYpEJ</latexit> Box Decomposition and Recombination Decomposition: Ranking: Recombination: Assignment: S 0 S 0 max( S 1 , S 0 0 ) 1 1 S 1 S 1 S 0 S 0 1 1 S 0 S 0 0 0 S 0 S 0 1 1 S 0 S 0 S 0 S 0 S 0 S 0 0 0 2 2 S 0 S 0 S 0 S 0 2 2 1 1 S 0 S 0 S 0 S 0 S 2 S 2 S 0 S 0 1 1 S 0 S 0 2 2 1 1 S 0 S 0 S 2 S 2 S 2 S 2 S 0 S 0 S 0 S 0 S 2 S 2 0 0 1 1 δ 3 S 2 S 2 S 0 S 0 S 0 S 0 δ 1 S 0 S 0 S 0 S 0 S 0 S 0 2 2 S 2 S 2 S 0 S 0 S 1 S 1 0 0 0 0 S 0 S 0 S 1 S 1 δ 2 2 2 S 0 S 0 S 0 S 0 Rank : δ 3 > δ 1 > δ 2 2 2 S 0 S 0 right I S 1 S 1 S 1 S 1 0 0 S 0 S 0 2 2 (a) (b) (c) (d) S 1 S 1 n L IoU = − 1 � � log( IoU ( p i , p ∗ I )) , (1) N pos I i IoU = 1 � I > S I } L IoU ( B ′ L D & R ( 1 { S ′ I , T I ) N pos (2) I + 1 { S I � S ′ I } L IoU ( B I , T I )) , 4 / 10

  5. Box Decomposition and Recombination 0 . 9 0 . 7 IoU 0 . 5 IoU w/ D&R IoU w/o D&R 0 . 3 0 2 4 6 8 10 12 epoch ◮ Boxes optimized by D&R have higher IoU scores and lower variances. 5 / 10

  6. Box Decomposition and Recombination 6 / 10

  7. Semantic Consistency � R I ↓ ← negative ,  C I ↓  � R I ↑ ← positive , C I ↑ (3)  x g c i = max j = 0 ( c j ) ∈ C I , Settings AP AP 50 AP 75 AP S AP M AP L None 33.6 53.1 35.0 18.9 38.2 43.7 PN 34.2 53.2 36.3 20.8 38.9 44.2 PNI 33.7 53.0 35.5 17.9 38.3 44.1 Ours 35.3 55.4 37.1 20.9 39.6 45.9 C I ↑ R I ↑ positive set negative set 7 / 10

  8. Semantic Consistency 8 / 10

  9. Results Modules AP AP 50 AP 75 AP S AP M AP L Baseline D&R Consistency � 33.6 53.1 35.0 18.9 38.2 43.7 � � 34.8 54.0 36.4 19.7 39.0 44.9 � � 37.2 55.4 39.5 21.0 41.7 48.6 � � � 38.0 56.5 40.8 21.6 42.4 50.4 9 / 10

  10. Results Method Backbone AP AP 50 AP 75 AP S AP M AP L Two-stage methods: Faster R-CNN w/ FPN ResNet-101-FPN 36.2 59.1 39.0 18.2 39.0 48.2 Faster R-CNN w/ TDM Inception-ResNet-v2-TDM 36.8 57.7 39.2 16.2 39.8 52.1 Faster R-CNN by G-RMI Inception-ResNet-v2 34.7 55.5 36.7 13.5 38.1 52.0 RPDet ResNet-101-DCN 42.8 65.0 46.3 24.9 46.2 54.7 Cascade R-CNN ResNet-101 42.8 62.1 46.3 23.7 45.5 55.2 One-stage methods: YOLOv2 DarkNet-19 21.6 44.0 19.2 5.0 22.4 35.5 SSD ResNet-101 31.2 50.4 33.3 10.2 34.5 49.8 DSSD ResNet-101 33.2 53.3 35.2 13.0 35.4 51.1 FSAF ResNet-101 40.9 61.5 44.0 24.0 44.2 51.3 RetinaNet ResNet-101-FPN 39.1 59.1 42.3 21.8 42.7 53.9 CornerNet Hourglass-104 40.5 56.5 43.1 19.4 42.7 53.9 ExtremeNet Hourglass-104 40.1 55.3 43.2 20.3 43.2 53.1 FCOS † ResNet-101-FPN 41.5 60.7 45.0 24.4 44.8 51.6 FCOS † ResNeXt-64x4d-101-FPN 43.2 62.8 46.6 26.5 46.2 53.3 FCOS † w/improvements ResNeXt-64x4d-101-FPN 44.7 64.1 48.4 27.6 47.5 55.6 DDBNet (Ours) ResNet-101-FPN 42.0 61.0 45.1 24.2 45.0 53.3 DDBNet (Ours) ResNeXt-64x4d-101-FPN 43.9 63.1 46.7 26.3 46.5 55.1 DDBNet (Ours) § ResNeXt-64x4d-101-FPN 45.5 64.5 48.5 27.8 47.7 57.1 10 / 10

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend