a deep learning strategy for
play

A deep learning strategy for wide-area surveillance 17/05/2016 Mr - PowerPoint PPT Presentation

A deep learning strategy for wide-area surveillance 17/05/2016 Mr Alessandro Borgia Supervisor: Prof Neil Robertson Heriot-Watt University EPS/ISSS Visionlab Roke Manor Research partnership 17/05/2016 Implementation details of the CNN


  1. A deep learning strategy for wide-area surveillance 17/05/2016 Mr Alessandro Borgia Supervisor: Prof Neil Robertson Heriot-Watt University – EPS/ISSS – Visionlab Roke Manor Research partnership

  2. 17/05/2016 Implementation details of the CNN for re-identification Outline • Outline • The proposed re-identification system: • Motivation ﹣ A boostrap process for tracking: unifying tracking and deep • Proposed system learning-based re-identifications • Intra-camera tracking ﹣ Intra-camera tracking scheme • Time transition ﹣ Inter-camera tracking: time transition distributions estimation distribution over the network • Spacial distribution estimation • Cross-Input Neighborhood Differences (CIND) CNN: • Advantages • CIND-CNN • A more flexible approach for CNN: • CUHK-03 dataset ﹣ Going deeper by residual learning • A more flexible ﹣ Triplet network training scheme approach ﹣ Residual learning ﹣ Batch normalization ﹣ Triplet network ﹣ Batch norm. • Simulations • Simulations • Visualizing deep features • Features appearance • Next step • References Alessandro Borgia Heriot-Watt University - EPS/ISSS - Visionlab Roke Manor Research

  3. 17/05/2016 Implementation details of the CNN for re-identification Motivation • Outline • Context : people tracking in multiple non-overlapping cameras • Motivation • Proposed system • Problem : dealing with targets disappearing for extended periods of • Intra-camera tracking time (long occlusions) • Time transition distribution • Challenges arising in different camera views: complex variations of lightings, poses, viewpoints, occlusions. • Spacial distribution estimation • Traditional approaches : engineering hand-crafted features • Advantages • CIND-CNN • Actual approach : employing a deep learning-based (DL) re- • CUHK-03 dataset identification strategy • A more flexible approach • Why? : a deep architecture allows to model effectively the mixture of ﹣ Residual learning complex multimodal photometric and geometric transforms that ﹣ Triplet network targets undergo. ﹣ Batch norm. • Simulations • Novelty : the proposed DL-based re-identification scheme is • Features appearance proposed as a boostrap process for the inter-camera tracking task, • Next step defining a unified framework Alessandro Borgia Heriot-Watt University - EPS/ISSS - Visionlab Roke Manor Research

  4. 11/05/2016 Implementation details of the CNN for re-identification The proposed system • Iterative adaptive interaction between the re-identification and • Outline tracking tasks • Motivation • Effect: boosting each other: more powerful tracking capabilities in • Proposed system presence of disappearing targets and • Intra-camera tracking • The re-id stage feeds the process of automatic refinement of the • Time transition distribution logical topology and temporal interdependences of the network • Spacial distribution (automatically learned from observations) estimation • The temporal distributions, by feeding the CNN classifier (and back- • Advantages tuning the weights accordingly) enable the CNN to take more • CIND-CNN reliable context-aware re-id decisions. • CUHK-03 dataset • A more flexible approach ﹣ Residual learning ﹣ Triplet network ﹣ Batch norm. • Simulations • Features appearance • Next step Alessandro Borgia Heriot-Watt University - EPS/ISSS - Visionlab Roke Manor Research

  5. 11/05/2016 Implementation details of the CNN for re-identification Intra-camera tracking scheme • Outline • Motivation • Proposed system • Intra-camera tracking • Time transition distribution • Spacial distribution estimation • Investigated context: a wide area surveillance network with unknown, • Advantages unconstrained topology and non-calibrated static CCTV cameras • CIND-CNN • Tracking based only on re-identifications by a CNN. • CUHK-03 dataset • A more flexible • Gathering entry and exit points of all the built trajectories approach ﹣ Residual learning • Estimation of the entry/exit regions by Gaussian Mixture Model and ﹣ Triplet network Expectation Maximization algorithm ﹣ Batch norm. • Simulations • Entry/exit points represent the network nodes according to which to buid • Features appearance the network logical topology • Next step Alessandro Borgia Heriot-Watt University - EPS/ISSS - Visionlab Roke Manor Research

  6. 11/05/2016 Implementation details of the CNN for re-identification Time transition distribution over all links • Outline • Motivation • Proposed system • Intra-camera tracking • Time transition distribution • Spacial distribution estimation • Advantages C a C b • CIND-CNN • CUHK-03 dataset • A more flexible approach ﹣ Residual learning ﹣ Triplet network ﹣ Batch norm. • Simulations • Features appearance • Next step Alessandro Borgia Heriot-Watt University - EPS/ISSS - Visionlab Roke Manor Research

  7. 11/05/2016 Implementation details of the CNN for re-identification Advantages • Outline • Achieved context-aware decisions that boost the tracking of • Motivation people going out-of-view • Proposed system • Intra-camera tracking • More accurate intra-view tracks provided by the strong • Time transition discrimination capabilities of a deep architecture in re-id distribution • Spacial distribution • Re-identifications based on posterior probabilities built from both estimation the spatio-temporal priors over the network • Advantages • CIND-CNN • Automatic and adaptive learning of the logical topology and the • CUHK-03 dataset time transition relationships of the network • A more flexible approach •  Robustness against cameras breakdown ﹣ Residual learning ﹣ Triplet network ﹣ Batch norm. • Simulations • Features appearance • Next step Alessandro Borgia Heriot-Watt University - EPS/ISSS - Visionlab Roke Manor Research

  8. 11/05/2016 Implementation details of the CNN for re-identification • Outline • Motivation • Proposed system • Intra-camera tracking • Time transition distribution • Spacial distribution estimation 1 st CNN implemented • Advantages • CIND-CNN • CUHK-03 dataset • A more flexible approach ﹣ Residual learning ﹣ Triplet network ﹣ Batch norm. • Simulations • Features appearance • Next step Alessandro Borgia Heriot-Watt University - EPS/ISSS - Visionlab Roke Manor Research

  9. 11/05/2016 Implementation details of the CNN for re-identification 1 st CNN: Cross-Input Neighborhood Differences CNN • Outline • Motivation • Proposed system • Intra-camera tracking • Time transition distribution • Spacial distribution estimation • Advantages • CIND-CNN • CUHK-03 dataset • A more flexible approach ﹣ Residual learning ﹣ Triplet network • Each output a j can be interpreted of the softmax function in terms of ﹣ Batch norm. predicted probability p j =P(y=j| x ) for the j th class given a sample vector x : • Simulations • Features appearance • Next step Alessandro Borgia Heriot-Watt University - EPS/ISSS - Visionlab Roke Manor Research

  10. 11/05/2016 Implementation details of the CNN for re-identification Data augmentation and data balancing (minibatches) • Outline • Motivation • Applying label-preserving operations: • Proposed system random 2D translational transforms on • Intra-camera tracking each pedestrian image • Time transition distribution • Uncovered stripes of the bounding-box • Spacial distribution filled with pixels randomly selected estimation from the original image • Advantages • CIND-CNN • First, the gradient of the loss over a mini-batch is an estimate of the • CUHK-03 dataset gradient over the training set, whose quality improves as the batch • A more flexible approach size increases. ﹣ Residual learning ﹣ Triplet network • Second, computation over a batch can be much more efficient than ﹣ Batch norm. m computations for individual examples, due to the parallelism • Simulations afforded by the modern computing platforms. • Features appearance • Minibatches size: 256 images • Next step Alessandro Borgia Heriot-Watt University - EPS/ISSS - Visionlab Roke Manor Research

  11. 11/05/2016 Implementation details of the CNN for re-identification CIND-CNN limitations • Outline • Motivation • Issue: huge peak • Proposed system (~1e20) within the • Intra-camera tracking first epoch after • Time transition some mini-batch distribution iterations • Spacial distribution estimation • Advantages • CIND-CNN • BP+SGD make it very sensible to initialization values and to the initial • CUHK-03 dataset learning rate value • A more flexible approach • Not very deep ﹣ Residual learning ﹣ Triplet network ﹣ Batch norm. • Deep learning paradigm violation: the function approximated is constrained at the level of the difference layer • Simulations • Features appearance • This CNN performs feature extraction and classification by a fully • Next step connected layer preventing to make sense of how the features are getting distributed in their space Alessandro Borgia Heriot-Watt University - EPS/ISSS - Visionlab Roke Manor Research

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend