Boosting Visual Object Tracking Using Deep Features and GPU Implementations
Michael Felsberg Computer Vision Laboratory Department of Electrical Engineering Linköping University michael.felsberg@liu.se
Martin Danelljan Fahad Khan
Boosting Visual Object Tracking Using Deep Features and GPU - - PowerPoint PPT Presentation
Boosting Visual Object Tracking Using Deep Features and GPU Implementations Michael Felsberg Computer Vision Laboratory Department of Electrical Engineering Linkping University michael.felsberg@liu.se Martin Danelljan Fahad Khan
Martin Danelljan Fahad Khan
Classification Task: Is there a dog in the image ? Detection Task: Where is a dog in the image ? Tracking Task: Where is the dog from the first frame in all subsequent frames of the image sequence ?
Problems:
clutter changes in viewpoint scale illumination motion articulation
[Jianbo Shi & Carlo Tomasi CVPR 1994 ]
C-COT
VOT2014 VOT2015 C-COT, ECCV2016 VOT2016 DSST, PAMI2016 SRDCF, ICCV2015 DeepSRDCF, VOT2015 ACT, CVPR2014
[Danelljan et al., ICCV 2015] [Danelljan et al., CVPR 2016]
[Danelljan et al., ICCV 2015]
[Danelljan et al., CVPR 2016]
Method Top-5 Error Method Description SuperVision (Toronto) 0.16422 CNN ISI (Tokyo) 0.26172 Hand-crafted features: SIFT, HOG and LBP OXFORD 0.26979 DPM + Hand-crafted features XRCE/INRIA 0.27058 Hand-crafted features
[Gladh et al., ICPR 2016, best paper] [Danelljan & Häger et al., VOT 2015]
[Danelljan et al., ECCV 2016]
C-COT ECO High-dimensional features No. of parameters (800,000) in online learning. Scarcity of training data in tracking
Discriminatively learn a lower-dimensional feature space by jointly minimizing the classification error. 80% reduction in the number of modell parameters [Danelljan et al., CVPR 2017]
C-COT ECO Large training sample set Significant computational burden. Memory size is limited due to large feature set Discarding old samples lead to over-fitting to recent appearance Model the training data as a mixture of Gaussian components. Compact and diverse representation of training data [Danelljan et al., CVPR 2017]
HC features Accuracy Speed C-COT 50.8 < 10 FPS ECO 52.9 60 FPS partial occlusion (the guitar) deformations
[Danelljan et al., CVPR 2017]
[Danelljan et al., CVPR 2017]
Paper submission 3 Apr, 2017 Author notification 26 May, 2017 Camera-ready paper 31 May, 2017 Early registration 16 Jun, 2017 Main conference 22-24 Aug, 2017 Invited speakers Alan Bovik Markus Vincze Christian Igel REACTS Workshop Pose estimation tutorial George Azzopardi Anders G. Buch
2D-to-3D 3D Vision Biomedical image and pattern analysis Biometrics Brain-inspired methods Document analysis Face and gestures Feature extraction Graph-based methods High-dimensional topology methods Human pose estimation Image/video indexing & retrieval Image restoration Keypoint detection Machine learning for image and pattern analysis Mobile multimedia Model-based vision Motion and tracking Object recognition Segmentation Shape representation and analysis Static and dynamic scene analysis Statistical models Surveillance Vision for robotics
General Chair Michael Felsberg Program Chairs Anders Heyden Norbert Krüger Industrial Liaison Zhibo Pang
The conference invites novel contributions to the automatic analysis of images and patterns, encompassing both new challenging application areas and substantial new theoretical developments in the field.