Performance Measures and the DukeMTMC Benchmark for Multi-Target Multi-Camera Tracking
Ergys Ristani
Performance Measures and the DukeMTMC Benchmark for Multi-Target - - PowerPoint PPT Presentation
Performance Measures and the DukeMTMC Benchmark for Multi-Target Multi-Camera Tracking Ergys Ristani In collaboration with: Francesco Solera Roger Zou Rita Cucchiara Carlo Tomasi Outline Problem Definition Performance Measures
Ergys Ristani
Francesco Solera Roger Zou Rita Cucchiara Carlo Tomasi
Multi-Target Multi-Camera Tracking
DukeMTMCT Benchmark
Multi-Target Multi-Camera Tracking
DukeMTMCT Benchmark
Multi-Target Multi-Camera Tracking
MTMCT Evaluation
DukeMTMCT Benchmark
Suspect
Suspect ID1 Tracker (a) Computed IDs
Suspect ID1 ID2 Tracker (a) Computed IDs
Suspect ID1 ID2 Tracker (a) Computed IDs 1 switch
Suspect ID1 ID2 Tracker (a) Computed IDs 1 switch 67% identification
Suspect Suspect ID1 ID2 Tracker (a) Computed IDs 1 switch 67% identification Tracker (b)
Suspect Suspect ID1 ID2 Tracker (a) Computed IDs 1 switch 67% identification 7 switches 67% identification Tracker (b)
Suspect Suspect Suspect ID1 ID2 Tracker (a) Computed IDs 1 switch 67% identification 7 switches 67% identification 7 switches 83% identification Tracker (b) Tracker (c)
Suspect Suspect Suspect ID1 ID2 Tracker (a) Computed IDs 1 switch 67% identification 7 switches 67% identification 7 switches 83% identification Tracker (b) Tracker (c)
Suspect Suspect Suspect ID1 ID2 Tracker (a) Computed IDs 1 switch 67% identification 7 switches 67% identification 7 switches 83% identification Tracker (b) Tracker (c)
Suspect Suspect Suspect ID1 ID2 Tracker (a) Computed IDs 1 switch 67% identification 7 switches 67% identification 7 switches 83% identification Tracker (b) Tracker (c)
Suspect Suspect Suspect ID1 ID2 Tracker (a) Computed IDs 1 switch 67% identification 7 switches 67% identification 7 switches 83% identification Tracker (b) Tracker (c)
MTMC Tracker Performance
Object detection Appearance model Motion model Interaction model
Optimization
Parameter tuning Training data
MTMC Tracker Performance
ML MT MOTA IDS FRG
[1] Evaluating Multiple Object Tracking Performance: The CLEAR MOT Metrics. Image Video Proc. 2008 [2] Learning to Associate: HybridBoosted Multi-Target Tracker for Crowded Scene. CVPR 2009
MTMC Tracker Performance
ML MT MOTA IDS MCTA
Handover (X, FRG)
FRG
[3] An Equalized Global Graph Model-Based Approach for Multi-Camera Object Tracking. IEEE TCAS 2016 [4] Inter-camera Association of Multi-target Tracks by On-Line Learned Appearance Affinity Models. ECCV 2010
MTMC Tracker Performance
MCTA
Handover (X, FRG)
MTMC Tracker Performance
ML MT MOTA IDS FRG
MCTA
Handover (X, FRG)
[5] Framework for Performance Evaluation of Face, Text, and Vehicle Detection and Tracking in Video: Data, Metrics, and Protocol. TPAMI 2008 [6] A New Benchmark and Protocol for Multi-Object Detection and Tracking. arXiv CoRR 2015 [7] Evaluating Multi-Object Tracking. CVPRWS 2005
MTMC Tracker Performance
ATA PR-MOTA STDA-D
TP
FIT FIO OP ML MT MOTA IDS FRG
T1 C1 C2 C1 T1 T2 T1 C2 True Computed
T1 C1 C2 T2 C3 True Computed
True Computed T1 C1 C2 T2 C3
True Computed T1 C1 C2 T2 C3
True Computed T1 C1 C2 T2 C3
MCTA Handover (X, FRG)
MTMC Tracker Performance
ATA PR-MOTA STDA-D TP FIT FIO OP
ML MT MOTA IDS FRG
True Computed T1 C1 C2 T2 C3
MCTA Handover (X, FRG)
MTMC Tracker Performance
ATA PR-MOTA STDA-D TP FIT FIO OP
ML MT MOTA IDS FRG
True Computed T1 C1 C2 T2 C3 time
True Computed
1
T1 C1 C2 T2 C3 time
3
T1 C1 C2 T2 C3 time True Computed
5
T1 C1 C2 T2 C3 time True Computed
7
T1 C1 C2 T2 C3 time True Computed
2 9
T1 C1 C2 T2 C3 time True Computed
2 1 11
T1 C1 C2 T2 C3 time True Computed
T1 C1 C2 T2 C3
2 2 13
time True Computed
2 1 2 14
T1 C1 C2 T2 C3 time True Computed
True Computed T1 C1 C2 T2 C3
MCTA Handover (X, FRG)
MTMC Tracker Performance
ATA PR-MOTA STDA-D TP FIT FIO OP
ML MT MOTA IDS FRG
True Computed T1 C1 C2 T2 C3
MCTA Handover (X, FRG)
MTMC Tracker Performance
ATA PR-MOTA STDA-D TP FIT FIO OP
ML MT MOTA IDS FRG
True Computed T1 C1 C2 T2 C3
MCTA Handover (X, FRG)
MTMC Tracker Performance
ATA PR-MOTA STDA-D TP FIT FIO OP
ML MT MOTA IDS FRG
MT, PT, ML
True Computed T1 C1 C2 T2 C3
MCTA Handover (X, FRG)
MTMC Tracker Performance
ATA PR-MOTA STDA-D TP FIT FIO OP
ML MT MOTA IDS FRG
True Computed T1 C1 C2 T2 C3
MCTA Handover (X, FRG)
MTMC Tracker Performance
ATA PR-MOTA STDA-D TP FIT FIO OP
ML MT MOTA IDS FRG
T1 T2 C1 C2 T1 C1 C2 T2 C3 time True Computed
True Computed T1 C1 C2 T2 C3
MCTA Handover (X, FRG)
MTMC Tracker Performance
ATA PR-MOTA STDA-D TP FIT FIO OP
ML MT MOTA IDS FRG
MT, PT, ML
Brittle due to mapping
MT, PT, ML
A A A A A A A A A A A A A A A A Camera 1 Camera 2 Truth
A A A A A A A A A A A A A A A A Camera 1 Camera 2 Truth b b b b b b b b b b b b Tracker 1 c b b b time
A A A A A A A A A A A A A A A A Camera 1 Camera 2 Truth b b b b b b b b b b b b Tracker 1 c b b b time
A A A A A A A A A A A A A A A A Camera 1 Camera 2 b b b b b b b b b b b b Truth Tracker 1 c b b b b b b b b b b b b b b b Tracker 2 b b b time
A A A A A A A A A A A A A A A A Camera 1 Camera 2 b b b b b b b b b b b b Truth Tracker 1 c b b b b b b b b b b b b b b b Tracker 2 b b b time
MT, PT, ML Ignore identity
A A A A A A A A time Truth A A
A A A A A A A A time Truth A A
Mostly Tracked: GT trajectories which are covered by tracker output for more than 80% in length
A A A A A A A A time Truth Tracker 1 b c c e e g g b A A f f
Mostly Tracked: GT trajectories which are covered by tracker output for more than 80% in length
A A A A A A A A time Truth Tracker 1 b c c e e g g b MT = 1 A A f f
Mostly Tracked: GT trajectories which are covered by tracker output for more than 80% in length
A A A A A A A A time b c c b g g Truth Tracker g b MT = 1 A A b b FRG = 4
Mostly Tracked: Percentage of GT trajectories which are covered by tracker output for more than 80% in length Fragments: The number of times that a GT trajectory is interrupted in tracking result
A A A A A A A A time Truth Tracker b c c e e g g b A A f f FRG = 4 MT = 1
A A A A A A A A time b c b b g b Truth Tracker g b b c c e e g g b Tracker++ MT = 1 A A b b f f FRG = 4
A A A A A A A A time b c b b g b Truth Tracker g b Tracker++ MT = 1 A A b b FRG = 4 b c c e e g g b f f MT = 1 FRG = 4
Bizarre combination
MT, PT, ML
A A A A A A A A time A A A A A A A A Camera 1 Camera 2 b b b b b b b b b b b b Truth Tracker 1 c b b b b b b b b b b b b b b b c b b b Tracker 2
A A A A A A A A time A A A A A A A A Camera 1 Camera 2 b b b b b b b b b b b b Truth Tracker 1 c b b b b b b b b b b b b b b b c b b b Tracker 2 Tracker 1 Tracker 2
Fragmentations
Handover | Within camera
A A A A A A A A time A A A A A A A A Camera 1 Camera 2 b b b b b b b b b b b b Truth Tracker 1 c b b b b b b b b b b b b b b b c b b b Tracker 2 Tracker 1 Tracker 2
Fragmentations
Handover | Within camera
Mapping not one-to-one
MT, PT, ML
A A A A A A A A time b b b True Computed c b c c c A A A A A A A A time b b b True Computed b c b b b
A A A A A A A A time b b b True Computed c b c c c A A A A A A A A time b b b True Computed b c b b b
Half explanation Nearly full explanation
True Computed T1 C1 C2 T2 C3
MCTA Handover (X, FRG)
MTMC Tracker Performance
ATA PR-MOTA STDA-D TP FIT FIO OP
ML MT MOTA IDS FRG
A A A A A A A A time A A A A A A A A Camera 1 Camera 2 b b b b b b c b b b b b b c True Computed
A A A A A A A A time A A A A A A A A Camera 1 Camera 2 b b b b b b c b b b b b b c b c A True Computed
A A A A A A A A time A A A A A A A A Camera 1 Camera 2 b b b b b b c b b b b b b c b c A
Cost on each edge is number of mis-assigned frames between two trajectories
4 14 16 12 2
True Computed
A A A A A A A A time A A A A A A A A Camera 1 Camera 2 b b b b b b b b b b b b b A
Cost on each edge is number of mis-assigned frames between two trajectories
4
True Computed
A A A A A A A A time A A A A A A A A Camera 1 Camera 2 b b b b b b c b b b b b b c b c A
Cost on each edge is number of mis-assigned frames between two trajectories
4 14 16 12 2
True Computed
A A A A A A A A time A A A A A A A A Camera 1 Camera 2 c c c A
Cost on each edge is number of mis-assigned frames between two trajectories
14
True Computed
A A A A A A A A time A A A A A A A A Camera 1 Camera 2 b b b b b b c b b b b b b c b c A
Cost on each edge is number of mis-assigned frames between two trajectories
4 14 16 12 2
True Computed
A A A A A A A A time A A A A A A A A Camera 1 Camera 2 A
Cost on each edge is number of mis-assigned frames between two trajectories
16
True Computed
A A A A A A A A time A A A A A A A A Camera 1 Camera 2 b b b b b b c b b b b b b c b c A
Cost on each edge is number of mis-assigned frames between two trajectories
4 14 16 12 2
True Computed
time Camera 1 Camera 2 b b b b b b b b b b b b b
Cost on each edge is number of mis-assigned frames between two trajectories
12
True Computed
A A A A A A A A time A A A A A A A A Camera 1 Camera 2 b b b b b b c b b b b b b c b c A
Cost on each edge is number of mis-assigned frames between two trajectories
4 14 16 12 2
True Computed
True Computed T1 C1 C2 T2 C3
MCTA Handover (X, FRG)
MTMC Tracker Performance
ATA PR-MOTA STDA-D TP FIT FIO OP
ML MT MOTA IDS FRG
True Computed T1 C1 C2 T2 C3
MCTA Handover (X, FRG)
MTMC Tracker Performance
ATA PR-MOTA STDA-D TP FIT FIO OP
ML MT MOTA IDS FRG
True Computed T1 C1 C2 T2 C3
MCTA Handover (X, FRG)
MTMC Tracker Performance
ATA PR-MOTA STDA-D TP FIT FIO OP
ML MT MOTA IDS FRG
True Detections Computed Detections
True Positives False Positives False Negatives
True Detections Computed Detections
True Positives False Positives False Negatives
True Detections Computed Detections
True Positives False Positives False Negatives
True Detections Computed Detections
True Positives False Positives False Negatives
number of ground-truth and computed detections
b c A
b c A b A c
b c A
4 14 16 12 2
Min Cost Matching
b A c
4 2
T1 C1 C2 T2 C3
T1 C1 C2 T2 C3
MOT 16 Challenge
Tracker MOTA ↑
HCC 0.492 LMP 0.487 FWT 0.477 NLLMPa 0.475 MDPNN16 0.471 NOMT_16 0.464 JMC 0.462 QuadMOT16 0.441
0.432 MHT_DAM_16 0.429 LINF1_16 0.410 EAMTT_pub 0.388 OVBT 0.384 LTTSC-CRF 0.375 LP2D_16 0.357 TBD_16 0.337 CEM_16 0.331 DP_NMS_16 0.321 GMPHD_HDA 0.305 SMOT_16 0.297 JPDA_m_16 0.261
MOT 16 Challenge
Tracker MOTA ↑
HCC 0.492 LMP 0.487 FWT 0.477 NLLMPa 0.475 MDPNN16 0.471 NOMT_16 0.464 JMC 0.462 QuadMOT16 0.441
0.432 MHT_DAM_16 0.429 LINF1_16 0.410 EAMTT_pub 0.388 OVBT 0.384 LTTSC-CRF 0.375 LP2D_16 0.357 TBD_16 0.337 CEM_16 0.331 DP_NMS_16 0.321 GMPHD_HDA 0.305 SMOT_16 0.297 JPDA_m_16 0.261
Tracker IDF1 ↑
NOMT_16 0.533 LMP 0.512 HCC 0.506
0.493 NLLMPa 0.473 JMC 0.463 MDPNN16 0.462 MHT_DAM_16 0.457 LINF1_16 0.456 FWT 0.442 EAMTT_pub 0.424 LTTSC-CRF 0.420 QuadMOT16 0.382 OVBT 0.378 CEM_16 0.357 LP2D_16 0.341 GMPHD_HDA 0.333 JPDA_m_16 0.311 DP_NMS_16 0.287 TBD_16 0.251 SMOT_16 0.185
Tracker MOTA ↑
HCC 0.492 LMP 0.487 FWT 0.477 NLLMPa 0.475 MDPNN16 0.471 NOMT_16 0.464 JMC 0.462 QuadMOT16 0.441
0.432 MHT_DAM_16 0.429 LINF1_16 0.410 EAMTT_pub 0.388 OVBT 0.384 LTTSC-CRF 0.375 LP2D_16 0.357 TBD_16 0.337 CEM_16 0.331 DP_NMS_16 0.321 GMPHD_HDA 0.305 SMOT_16 0.297 JPDA_m_16 0.261
MOT 16 Challenge
Tracker IDF1 ↑
NOMT_16 0.533 LMP 0.512 HCC 0.506
0.493 NLLMPa 0.473 JMC 0.463 MDPNN16 0.462 MHT_DAM_16 0.457 LINF1_16 0.456 FWT 0.442 EAMTT_pub 0.424 LTTSC-CRF 0.420 QuadMOT16 0.382 OVBT 0.378 CEM_16 0.357 LP2D_16 0.341 GMPHD_HDA 0.333 JPDA_m_16 0.311 DP_NMS_16 0.287 TBD_16 0.251 SMOT_16 0.185
Rank Improvement
+5
+5
+1
+2 +2
+1 +2
+2
+2 +3
Private detections Public detections
NOMT LMP HCC
FWT CEM JPDA NLLMPa JMC POI NOMTwSDP DeepSORT KDNT
[8] Performance Measures and a Data Set for Multi-Target, Multi-Camera Tracking. ECCV 2016 BMTT Workshop
ML MT MOTA IDS IDF1 IDP IDR MCTA
Handover (X, FRG)
FRG
MTMC Tracker Performance
ATA PR-MOTA STDA-D
TP
FIT FIO OP
EPFL APIDIS
NLPR 1, 2 NLPR 3
ISSIA
PETS2009
Trainval 0-50 min Test-hard 50-60 min Test-easy 60-85 min
Camera i Person detection Multi-Camera Identities
+1 +1 +1 +0.4 +1 +1 +1 +1 +1
MTMC Tracking as Correlation Clustering
Tracking Multiple People Online and in Real Time, E. Ristani and C. Tomasi, ACCV 2014
Incorrect Matches
Correct Matches
IDP IDR
Fragmentations
Merges
IDP IDR
Fragmentations
Merges
ML MT MOTA IDS IDF1 IDP IDR MCTA
Handover (X, FRG)
FRG
MTMC Tracker Performance
understand tracker performance
ATA PR-MOTA STDA-D
TP
FIT FIO OP
Object detection Appearance model Motion model Interaction model
Optimization
Parameter tuning Training data
Suspect Suspect Suspect Tracker (a) 1 switch 67% identification 7 switches 67% identification 83% identification Tracker (b) Tracker (c)
tracking systems
[1] Performance Measures and Dataset for Multi-Target, Multi-Camera Tracking Ergys Ristani, Francesco Solera, Roger Zou, Rita Cucchiara, Carlo Tomasi ECCV 2016 Workshop on Benchmarking Multi-Target Tracking [3] Tracking Multiple People Online and in Real Time Ergys Ristani and Carlo Tomasi ACCV 2014 [2] Tracking Social Groups Within and Across Cameras Francesco Solera, Simone Calderara, Ergys Ristani, Carlo Tomasi, Rita Cucchiara IEEE Transactions on Circuits and Systems 2016
Code/Data: vision.cs.duke.edu/DukeMTMC/