CosyPose: Consistent multi-view multi-object 6D pose estimation - PowerPoint PPT Presentation

Oct 26, 2022 •957 likes •1.08k views

6th International Workshop on Recovering 6D Object Pose CosyPose: Consistent multi-view multi-object 6D pose estimation arXiv:2008.08465 Yann Labb 1,2 , Justin Carpentier 1,2 , Mathieu Aubry 4 , Josef Sivic 1,2,3 1 Inria 2 DI ENS, PSL 3 CIIRC,

6th International Workshop on Recovering 6D Object Pose CosyPose: Consistent multi-view multi-object 6D pose estimation arXiv:2008.08465 Yann Labbé 1,2 , Justin Carpentier 1,2 , Mathieu Aubry 4 , Josef Sivic 1,2,3 1 Inria 2 DI ENS, PSL 3 CIIRC, CTU in Prague 4 ENPC
Multi-view 6D pose estimation Output 3D scene Input images
CosyPose: Approach overview Single-view 6D pose estimation Robust multi-view multi-object reconstruction ... BOP 20 ... Challenge ...
Single-view CosyPose 2D detection 6D pose 6D pose estimation Mask-RCNN 2D detections Coarse Refiner network network 6D pose estimation Coarse Refiner network network 6D pose estimation Input RGB image Coarse Refiner network network (only 3 networks trained per dataset)
Pose estimation networks DeepIM, Li et al, ECCV 2018 + Network + Rotation parametrization + Loss + Data augmentation Input “canonical” pose (details in the paper arXiv:2008.08465) Input “coarse” pose “Refined” pose Pose update CNN CNN coarse refiner
Key ingredients e vsd < 0.3 T-LESS Without data augmentation 63. 63.8 60 7 37. 37. 40 37.0 37.0 0 29. 0 29.5 29.5 5 20 0 Ours w/o Ours with data Pix2Pose data augmentation augmentation (more ablations in the paper, Pix2Pose, Park et al, ICCV 2019 Sec 3 Table 1b)
Key ingredients e vsd < 0.3 T-LESS With data augmentation 63. 63.8 60 7 37. 37. 40 37.0 37.0 0 29. 0 29.5 29.5 5 20 0 Ours w/o Ours with data Pix2Pose data augmentation augmentation (more ablations in the paper, Pix2Pose, Park et al, ICCV 2019 Sec 3 Table 1b) + Access to a GPU cluster* training 1 pose network: ~10 hours on 32 GPUs *Jean-zay, French national cluster managed by GENCI-IDRIS
Input image Predicted poses 3D visualization
BOP20 results RGB-D BlenderProc: Denninger, Sundermeyer, RGB [1] Winkelbauer, Olefir, Hodan, Zidan, Elbadrawy, AR core (7 datasets) Knauer, Katam, Lodhi in RSS workshops. Synt (PBR [1]) Synt+Real [2] EPOS, Hodan et al, CVPR 2020 [3] CDPN, Li et al, ICCV 2019 [4] CosyPose, Labbé et al, ECCV 2020 [5] Pix2Pose, Park et al, ICCV 2019 [6] https://github.com/kirumang/Pix2Pose + running time < 0.5s per image
Code ● State-of-the-art pre-trained models for multiple datasets ● RGB single-view and multi-view modular framework ● Full training code https://github.com/ylabbe/cosypose
6th International Workshop on Recovering 6D Object Pose CosyPose: Consistent multi-view multi-object 6D pose estimation arXiv:2008.08465 Yann Labbé 1,2 , Justin Carpentier 1,2 , Mathieu Aubry 4 , Josef Sivic 1,2,3 1 Inria 2 DI ENS, PSL 3 CIIRC, CTU in Prague 4 ENPC https://github.com/ylabbe/cosypose

Recommend

Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object

4/13/2017 OOP Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object oriented Programming (Using C++) ht t p: / / www. com pgeom . com / ~pi yush/ t each/ 3330 Objects: State (fields), Behavior (member

635 views • 6 slides

Towards Deep Multi-View Stereo Silvano Galliani October 2, 2017 1 / 40 Towards Deep Multi-View

Towards Deep Multi-View Stereo Towards Deep Multi-View Stereo Silvano Galliani October 2, 2017 1 / 40 Towards Deep Multi-View Stereo Multi View Stereo 2 / 40 Towards Deep Multi-View Stereo Outline 1 Gipuma: massively parallel multi-view

718 views • 40 slides

Feasibility of Consistent, Feasibility of Consistent, Feasibility of Consistent, Feasibility of

Feasibility of Consistent, Feasibility of Consistent, Feasibility of Consistent, Feasibility of Consistent, Available, Partition Available, Partition- -Tolerant Tolerant Web Services Web Services Meng Wang Jingxin Feng Feb 14, 2011 Feb

828 views • 48 slides

Human Pose Estimation by Yannic Jnike - 04.11.2019 https://www.youtube.com/watch?v=mxKlUO_tjcg

Human Pose Estimation by Yannic Jnike - 04.11.2019 https://www.youtube.com/watch?v=mxKlUO_tjcg 1 Human Pose Estimation 1. What is Human Pose Estimation 2. OpenPose Pipeline 3. Bottom Up or Top Down Approach 2 What is Human Pose

3.09k views • 33 slides

Hand Pose Estimation Matthew Krenik Advisor: Fabrizio Pece Agenda What is Hand Pose

Hand Pose Estimation Matthew Krenik Advisor: Fabrizio Pece Agenda What is Hand Pose Estimation? Why does it matter? How does it work? What has been done? 2 What is Hand Pose Estimation? Estimate full Degree of

637 views • 48 slides

Multi-Object Tracking Challenge CV3DST Lecture Exercises Multi-Object Tracking Multi-Object

Multi-Object Tracking Challenge CV3DST Lecture Exercises Multi-Object Tracking Multi-Object Tracking Origins SONAR, RADAR Given a raw stream of sensory data: Localize objects Estimate object identities over time

340 views • 16 slides

Cumbernauld Academy Existing aerial view from west Site Plan Aerial view from South Aerial view

Cumbernauld Academy Existing aerial view from west Site Plan Aerial view from South Aerial view from Kildrum Road Aerial view from western entrance View from southern playground View of school entrance View from site entrance View of

456 views • 18 slides

CSS Modules with BEM Consistent Design Consistent Design Different Module Versions Consistent

CSS Modules with BEM Consistent Design Consistent Design Different Module Versions Consistent Design Different Module Versions Combine Small Modules Non-modular Naming CSS Selector Nesting CSS Selector Nesting #2 Block-Element-Modifier

1.06k views • 24 slides

General Structure of a PW code Self-Consistent KS eqs. or Global Minimization approach

General Structure of a PW code Self-Consistent KS eqs. or Global Minimization approach http://www.quantum-espresso.org/ KS self-consistent equations KS self-consistent equations KS self-consistent equations KS self-consistent equations

544 views • 30 slides

LightTrack: A Generic Framework for Online Top-Down Human Pose Tracking Authors: Guanghan Ning,

LightTrack: A Generic Framework for Online Top-Down Human Pose Tracking Authors: Guanghan Ning, Heng Huang LightTrack: Online Multi-Person Pose Tracking q Overview of Proposed Framework 2 6/1/20 1:54 PM Proposed Framework q Single-Person Pose

442 views • 9 slides

Object-Oriented Databases Object Oriented Databases ODMG Standard Object Model, Object

Object-Oriented Databases Object Oriented Databases ODMG Standard Object Model, Object Definition Language, Object Query Language Programming Language Bindings Outlook October 17, 2008 Michael Grossniklaus Department of

787 views • 27 slides

Object oriented Object oriented Object oriented Object oriented approach and UML approach and

Object oriented Object oriented Object oriented Object oriented approach and UML approach and UML approach and UML approach and UML Goals The goals of this chapter are to introduce the object oriented approach to software systems

1.06k views • 92 slides

Tsinghua University Monocular Depth-Pose Prediction [R, t] Depth and Pose RGB PoseNet

Wang Zhao, Shaohui Liu, Yezhi Shu, Yong-Jin Liu Tsinghua University Monocular Depth-Pose Prediction [R, t] Depth and Pose RGB PoseNet Fails to Generalize! All Drift ! Depth estimation in Indoor environments with Visual Odometry with

610 views • 15 slides

Li Lightweight Multi-Vie View 3D 3D Pose ose Esti timati tion on th throu ough Ca

Li Lightweight Multi-Vie View 3D 3D Pose ose Esti timati tion on th throu ough Ca Camera-Di Dise sentangled Represe sentati tion on Edoardo Remelli Shangchen Han Sina Honari Pascal Fua Robert Wang Motivation Multi-view input

609 views • 18 slides

Object Representation Based On Gabor Wave Vector Binning : An Application to Human Head Pose

Object Representation Based On Gabor Wave Vector Binning : An Application to Human Head Pose Detection M. Dahmane and J. Meunier University of Montreal Introduction Head pose is important: Inferring important non verbal information

517 views • 23 slides

Deep Learning for Geometry Processing 3D Representations View-Based and Volumetric CNNs 3D

Deep Learning for Geometry Processing 3D Representations View-Based and Volumetric CNNs 3D Representations for Object Classification Multi-View CNNs Su et al. 2015 Multi-View CNNs Su et al. 2015 Multi-View CNNs Su et al. 2015 Multi-View

877 views • 86 slides

Handling of (meta) Data in IoT Information Models W3C WoT Open Day Presentation Mar 26, 2018

Semantic Annotation and Handling of (meta) Data in IoT Information Models W3C WoT Open Day Presentation Mar 26, 2018 Milan Milenkovic, IoTsense LLC and Intel Corporation milan@iotsense.com, milanx.milenkovic@intel.com Key Messages &

1k views • 27 slides

We lc ome ! Re c o rding , po ll re sults, no te s, a nd Q&A de b rie f will b e se nt to

We lc ome ! Re c o rding , po ll re sults, no te s, a nd Q&A de b rie f will b e se nt to pa rtic ipa nts to mo rro w. I nvite a c o lle a g ue to re g iste r a nd pa rtic ipa te using the QR c o de a t rig ht. Re g istratio n

686 views • 43 slides

Modelling of a Large Mining Network David Browne DIgSILENT Pacific DIgSILENT Pacific Training

Modelling of a Large Mining Network David Browne DIgSILENT Pacific DIgSILENT Pacific Training Module PF-1.01-03 1 Background Large mine development in North-West of W.A 500MW on-site power plant (not grid connected) Gas

564 views • 18 slides

COV OVID-19 Vaccine Sa Safety Grace M. Lee, MD MPH Chair, ACIP COVID-19 Vaccine Safety

COV OVID-19 Vaccine Sa Safety Grace M. Lee, MD MPH Chair, ACIP COVID-19 Vaccine Safety Technical Subgroup Associate CMO, Stanford Childrens Health Professor of Pediatrics, Stanford University School of Medicine Safety is not the absence of

535 views • 15 slides

Implementing Eye Gaze Technology & Communication for Emerging Communicator Patrick Brune M.S.

7/4/2019 Implementing Eye Gaze Technology & Communication for Emerging Communicator Patrick Brune M.S. CCC/SLP Tobii Dynavox Senior Member Learning Team 1 Agenda Eye Tracking Calibration and introducing access Overview of

616 views • 29 slides

23 Patterns in 80 Minutes: a Whirlwind Java- centric Tour of the Gang-of-Four Design Patterns

23 Patterns in 80 Minutes: a Whirlwind Java- centric Tour of the Gang-of-Four Design Patterns Josh Bloch Charlie Garrod School of Computer Science 15-214 1 Administrivia Homework 6 checkpoint due Friday 5 pm Final exam Tuesday, May

548 views • 50 slides

Efficient Anti-community Detection in Complex Networks Sebastian Lackner 1 , Andreas Spitz 1 ,

Efficient Anti-community Detection in Complex Networks Sebastian Lackner 1 , Andreas Spitz 1 , Mathias Weidemller 2 , and Michael Gertz 1 30 th International Conference on Scientific and Statistical Database Management (SSDBM) July 9 - 11, 2018,

901 views • 47 slides

Deployment automation for an AWS Serverless project: SAM vs CloudFormation vs Terraform vs

Deployment automation for an AWS Serverless project: SAM vs CloudFormation vs Terraform vs ServerlessFramework Bruno Amaro Almeida | 9 Sept 2019 @bruno_amaro Community Day 2019 Sponsors FUTURE. CO-CREATED. Nordic Roots, Global Mindset

238 views • 23 slides