cosypose consistent multi view multi object 6d pose
play

CosyPose: Consistent multi-view multi-object 6D pose estimation - PowerPoint PPT Presentation

6th International Workshop on Recovering 6D Object Pose CosyPose: Consistent multi-view multi-object 6D pose estimation arXiv:2008.08465 Yann Labb 1,2 , Justin Carpentier 1,2 , Mathieu Aubry 4 , Josef Sivic 1,2,3 1 Inria 2 DI ENS, PSL 3 CIIRC,


  1. 6th International Workshop on Recovering 6D Object Pose CosyPose: Consistent multi-view multi-object 6D pose estimation arXiv:2008.08465 Yann Labbé 1,2 , Justin Carpentier 1,2 , Mathieu Aubry 4 , Josef Sivic 1,2,3 1 Inria 2 DI ENS, PSL 3 CIIRC, CTU in Prague 4 ENPC

  2. Multi-view 6D pose estimation Output 3D scene Input images

  3. CosyPose: Approach overview Single-view 6D pose estimation Robust multi-view multi-object reconstruction ... BOP 20 ... Challenge ...

  4. Single-view CosyPose 2D detection 6D pose 6D pose estimation Mask-RCNN 2D detections Coarse Refiner network network 6D pose estimation Coarse Refiner network network 6D pose estimation Input RGB image Coarse Refiner network network (only 3 networks trained per dataset)

  5. Pose estimation networks DeepIM, Li et al, ECCV 2018 + Network + Rotation parametrization + Loss + Data augmentation Input “canonical” pose (details in the paper arXiv:2008.08465) Input “coarse” pose “Refined” pose Pose update CNN CNN coarse refiner

  6. Key ingredients e vsd < 0.3 T-LESS Without data augmentation 63. 63.8 60 7 37. 37. 40 37.0 37.0 0 29. 0 29.5 29.5 5 20 0 Ours w/o Ours with data Pix2Pose data augmentation augmentation (more ablations in the paper, Pix2Pose, Park et al, ICCV 2019 Sec 3 Table 1b)

  7. Key ingredients e vsd < 0.3 T-LESS With data augmentation 63. 63.8 60 7 37. 37. 40 37.0 37.0 0 29. 0 29.5 29.5 5 20 0 Ours w/o Ours with data Pix2Pose data augmentation augmentation (more ablations in the paper, Pix2Pose, Park et al, ICCV 2019 Sec 3 Table 1b) + Access to a GPU cluster* training 1 pose network: ~10 hours on 32 GPUs *Jean-zay, French national cluster managed by GENCI-IDRIS

  8. Input image Predicted poses 3D visualization

  9. BOP20 results RGB-D BlenderProc: Denninger, Sundermeyer, RGB [1] Winkelbauer, Olefir, Hodan, Zidan, Elbadrawy, AR core (7 datasets) Knauer, Katam, Lodhi in RSS workshops. Synt (PBR [1]) Synt+Real [2] EPOS, Hodan et al, CVPR 2020 [3] CDPN, Li et al, ICCV 2019 [4] CosyPose, Labbé et al, ECCV 2020 [5] Pix2Pose, Park et al, ICCV 2019 [6] https://github.com/kirumang/Pix2Pose + running time < 0.5s per image

  10. Code ● State-of-the-art pre-trained models for multiple datasets ● RGB single-view and multi-view modular framework ● Full training code https://github.com/ylabbe/cosypose

  11. 6th International Workshop on Recovering 6D Object Pose CosyPose: Consistent multi-view multi-object 6D pose estimation arXiv:2008.08465 Yann Labbé 1,2 , Justin Carpentier 1,2 , Mathieu Aubry 4 , Josef Sivic 1,2,3 1 Inria 2 DI ENS, PSL 3 CIIRC, CTU in Prague 4 ENPC https://github.com/ylabbe/cosypose

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend