Mar MartinHu Humenberger NA NAVERLA LABS BSEur urop
- pe,3D
3DVision
- nGr
Group
- up
http://europe.naverlabs.com
WhenDeepLearningmeets VisualLocalization Mar MartinHu Humenberger - - PowerPoint PPT Presentation
WhenDeepLearningmeets VisualLocalization Mar MartinHu Humenberger NA NAVERLA LABS BSEur urop ope,3D 3DVision onGr Group oup http://europe.naverlabs.com Outline
Mar MartinHu Humenberger NA NAVERLA LABS BSEur urop
3DVision
Group
http://europe.naverlabs.com
Wewanttoovercomecurrentlimitationsoftraditional,mainlygeometry-based,methodsof3D visionusingdatadrivenmachinelearningtechniques. Mainr researcht topics: a) Fundamentalmethodsof3Dvision
b) Cameraposeestimation
c) 3Dsceneunderstanding
d) Syntheticdatasetsanddomainadaptation
ChâteaudeSceaux GPSaccuracy sometimesnotenough. E.g.forpreciserobotnavigationor augmentedreality. Goal:Useanimagetoestimatethepr precise positionof thecamerawithinagivenarea(map).
Thisworksindooraswell! There,itisinparticularusefulsinceGPSisnotavailable.
referenceimage(map) viewpointandscale
illumination
viewpoint,occlusion,weather
Descriptormatchingtoget 2D-3Dcorrespondences Camerapose estimation
2DInputImage 3DMap
Inputimage Featuredetection& description
Takeapicture
Location
Descriptor matching to get 2D-3D correspondences Camera pose estimation
2D Input Image
Input image Image retrieval
Large 3D Map
Location
No 3D Map but… CNN
image camerapose image camerapose image camerapose
2D Input Image
Input image CNN to directly estimate the camera pose
CNN
Location
CNNtoregressdense 2D-3Dcorrespondences Camerapose estimation
2DInputImage 3DMap
Inputimage
Takeapicture
Location
Featuredetection& description
St Stru ructure-ba based meth methods ds ActiveSearch[1] OpenMVG [2] + Perform verywellonmostdatasets->highaccuracy
processingtime) Imager retrieval-ba based meth methods ds HF-Net[3] + Improve speedandrobustnessforlargescalesettings
Camerap pose regressionm methods PoseNet [4] + Interestingapproachbecauseno3Dmapsareneededand itis datadriven(canbetrainedforcertainchallenges)
Scenec coordinate regressionm methods DSAC++[5] + Veryaccurateinsmallscalesettings
[1]T.Sattleretal.,ImprovingImage-BasedLocalizationbyActiveCorrespondenceSearch,ECCV2012 [2]P.Moulon,OpenMVG:http://github.com/openMVG/openMVG [3]Sarlin etal.,FromCoarsetoFine:RobustHierarchicalLocalizationatLargeScale,CVPR2019 [4]A.Kendalletal.,PoseNet: http://mi.eng.cam.ac.uk/projects/relocalisation/,ICCV2015 [5]E.Brachmann etal.,LearningLessisMore 6DCameraLocalizationvia3DSurfaceRegression,CVPR2018
Mean Matching Accuracy (MMA)
Mean Matching Accuracy (MMA)
J.Schönberger,RobustMethodsforAccurateandEfficient3DModelingfromUnstructuredImagery,PhD,ETHZ
https://demuc.de/colmap/
http://imagine.enpc.fr/~moulonp/openMVG/
Alex Kendall, Matthew Grimes and Roberto Cipolla. PoseNet: A Convolut utiona nal Network for Real-Ti Time 6-DOF Camera Relocalization.
SlidecreditAlexKendall,https://pdfs.semanticscholar.org/4fc6/7b4dc62e9c8eee4259c3878b71c64958c373.pdf
RG RGB,SfM SfM
Jamie Shotton et al. Scene coordinate regression forests for camera relocalization in RGB-D images. CVPR 2013
Slide credit Alex Kendall, https://pdfs.semanticscholar.org/4fc6/7b4dc62e9c8eee4259c3878b71c64958c373.pdf
RGB-D, pose, dense reconstruction
OldinnercityofAachen,Germany
trainingimageexamples testimageexamples(day- night) 3Dreconstruction(sfm)
camerasandalidar scanner
Su Sunetal.,ADa Datase setforBe Benchmarking gImage ge-Ba Base sedLocalization,CVP CVPR1 R17
Tailoredtotestspecificchallengesofvisual localization,suchas:
Training:
Testing:
position
Download:https://europe.naverlabs.com/research/3d-vision/virtual-gallery-dataset/
ObjectsofInterest(OOI)aredistinctiveareaswithintheenvironmentwhichcanbedetected undervariousconditions. Pu Publ blisheda shedatC tCVPR VPR19
1)Startwithinputimage 3)Usecorrespondencesto computethecameralocation 2)FeedintoOOInetwork
Map=
Mainadvantage: Datadrivenapproach whichcanovercome commonVLchallenges. Pu Publ blisheda shedatC tCVPR VPR19
Paper:https://europe.naverlabs.com/research/publications/visual-localization-by-learning-objects-of-interest-dense-match-regression/
matching.
Csurka etal.,Fromhandcraftedtodeeplocalfeatures,arXiv 2018
1)Startwithinputimage
then-describe
2)Detectkeypoints
Keypoint detector Extract patches
3)Describekeypoints !
Patch descriptor
1)Startwithinputimage
then-describe
Detect-an and-describe
2)FeedintoR2D2network 3)Detectkeypoints & describethematonce Keypoints (nms) descriptor foreachkeypoint !
Repeatable? Reliable? Ourapproach
No Yes No Yes
Repeat atab ability:imagelocationsthatareinvarianttousualimagetransformations(e.g.corners)
Reliab ability:imagelocationsthataregood(discriminativeandrobust)formatchingpurpose è Allcasesarepossible: reliability andrepeatability areindependent 3
3
1
1
2
2
4
4
Imagewith top-scoredkeypoints repeatabilitymap reliabilitymap
Thecoloredcrossesindicatematchedkeypoints.Ascanbeseen,ourmethodevenworksundervery challengingconditionssuchasday-nightimagepairsandlargeviewpointchanges.
116sequencesof6images
MMA
Accuracy acy(higherisbetter) è R2D2:outperformsallotherapproaches,includingrecentones
fke keypo ypoints ts (lessisbetter) è R2D2:equalorlessthanotherapproaches
Featuredimension(lessisbetter) è R2D2:muchsmallerthanothertop-rankingapproaches(upto8xsmaller)
elsiz size e(memory,lessisbetter) è R2D2:muchsmallerthanothertop-rankingapproaches(upto15xsmaller) Codeandmodelswillbereleased!
R2D2 accuracy Classicapproach MagicLeap Google Benchmarkcreators
VSLAM VisualSimultaneousLocalizationandMapping
Example:ORB-SLAM2
Raúl Mur-Artal and Juan D. Tardós, ORB-SLAM2: an Open-Source SLAM System for Monocular, Stereo and RGB-D Cameras
Challenge:
areasinthescene.
mappingpart.
Codeisavailableonline!
SLAMANTIC- LeveragingSemanticstoImproveVSLAMInDynamic Environments
R2D2:https://github.com/naver/r2d2 SLAMANTIC:https://github.com/mthz/slamantic VKITTI:https://europe.naverlabs.com/research/computer-vision/proxy-virtual- worlds/ VirtualGallery:https://europe.naverlabs.com/research/3d-vision/virtual-gallery- dataset/ LocalFeaturesSurvey:https://arxiv.org/abs/1807.10254 COLMAP:https://colmap.github.io/ OpenMVG:https://github.com/openMVG/openMVG VisualLocalizationBenchmark:http://visuallocalization.net VisualLocalizationTutorial:https://sites.google.com/view/lsvpr2019/home BaiduIBLdataset:https://sites.google.com/site/xunsunhomepage/