Embedded Multi-Person Pedestrian Tracking and Detection MSCV19 - PowerPoint PPT Presentation

Embedded Multi-Person Pedestrian Tracking and Detection MSCV19 Capstone Project, Internal(CMU) Team Member: Yongxin Wang, Chunhui Liu Advisor: Dr. Kris Kitani 05/03/2019

Introduction Motivation ● Multi-person pedestrain tracking ○ Real-time performance on embedded system ○ Visual analysis, automatic driving, robotics ○ Problem ● Detect and track multiple people ○ Deal with new object, out-of-view objects, ○ occlusion, large appearance changes Solution ● Track by detection - SiameseRPN (Single ○ Object) Multiple object extension ○ 2

Past, Present, Future Past: Present: Future: January: Single-Obejct SiameseRPN September 15: ● with Region of Interest (RoI) Finish RoI Align verification Start From Single Obejct ● ● Align Merge Multi-Object SiamRPN ● March: Multi Object SiamRPN SiamRPN with RoI Align ● Distractor October 15: Train/Finetune Single Obejct ● Data Association & NMS SiamRPN on VOT dataset ● October 31: April: Single Obejct SiamRPN with ROI Integrate object detection to ● ● handle new objects Align December 15: Multi Object SiamRPN Baseline ● Optimize and deploy ● algorithm on NVIDIA Jetson Machine 3

Past Single Object SiamRPN ● Implement Train Code & Verify ○ Present Fintune on VOT ○ ROI Align For Single Object SiamRPN ● Future Implement Code ○ Train and Verify on VOT ○ Multi Object SiamRPN ● Baseline Model ○ Multi Object Evaluatoin Code ○ 4

Past: Single Object SiamRPN Conv Template Features (4, 4, 2k ⨉ 256) CLS Score (FG/BG) (17, 17, 2k) AlexNet Image Features Conv Template (20, 20, 256) Feature (6, 6, 256) Conv Template Features (4, 4, 4k ⨉ 256) Bounding Box (x, y, w, h) AlexNet （ 17, 17, 4k) Image Feature (22, 22, 256) Image Features Conv (20, 20, 256) Li, Bo et al. “High Performance Visual Tracking with Siamese Region Proposal Network.” 2018 IEEE/CVF Conference on Computer Vision and 5 Pattern Recognition 2018

Past: Single Object SiamRPN Re-implementating trainign code Siamese RPN (training & testing) ● Official repository only has testing code ○ Sanity check of training process ○ Finetuned from pretrained model (trained with VID) on VOT dataset ■ RoI Align for Single Object SiamRPN - Need for SPEED ● Image Features (20, 20, 256) Image Features (20, 20, 256) 6

Past: Single Object SiamRPN Model Pretrained Finetune Test Data EAO ↑ DaSiamRPN (Official, YoutubeBB + - VOT 2015 0.446 SOTA) ImageNet VID SiamRPN ImageNet VID VOT 2015 (First 40 sequences) VOT 2015 (First 40 sequences) 0.5240 SiamRPN RoI ImageNet VID VOT 2015 (First 40 sequences) VOT 2015 (First 40 sequences) 0.6045 SiamRPN (with location & ImageNet VID - VOT 2015 0.3426 size penalty) SiamRPN ImageNet VID - VOT 2015 0.2647 SiamRPN - - VOT 2015 IP SiamRPN RoI - - VOT 2015 IP 7

Past: Single Object SiamRPN Red - SiamRPN (finetuned) Black - DaSiameseRPN Blue - SiamRPN RoI (finetuned) Green - Ground Truth 8

Past: Multi Object Tracking From Single Object Tracking to Multiple Object Tracking: ● A network that can handle several templates . ○ NMS & Data Association for matching labels . ○ Decide when to add and delete tempaltes . ○ Template Adapter . (Decide how to update the templates for the next frame) Conv Template Features (4, 4, 2k × 256) Cls Score (FG/BG) (17, 17, 2k) CNN Frame Features NMS + Data Conv Template Feature (20, 20, 256) Association (6, 6, 256) Templates Conv Template Features Bounding Box (x, y, w, h) (4, 4, 4k × 256) CNN (17, 17, 4k) Frame Feature (22, 22, 256) Frame Features Frame T Conv 9 (20, 20, 256) (255, 255, 3)

Past: Multi Object SiamRPN From Single Object Tracking to Multiple Object Tracking: ● A network that can handle several templates . ○ NMS & Data Association for matching labels . ○ Decide when to add and delete tempaltes . ○ Template Adapter . (Decide how to update the templates for the next frame) Conv Template Features (4, 4, 2k × 256) Cls Score (FG/BG) (17, 17, 2k) CNN Frame Features NMS + Data Conv Template Feature (20, 20, 256) Association (6, 6, 256) Templates Conv Template Features Bounding Box (x, y, w, h) (4, 4, 4k × 256) CNN (17, 17, 4k) Frame Feature (22, 22, 256) Frame Features Frame T Conv 10 (20, 20, 256) (255, 255, 3)

Past: Multi Object Extension Baseline Idea: ● Pre-compute correlation filters for each template ○ All templates share the RPN network to do tracking independently ○ Introduce Communication among templates (1) ● Concatenate all correlation filters as a bigger filter ○ Re-train RPN network to perform multi-object classification ○ Introduce Communication among templates (2) ● Add Distractor-aware loss and fine-tune RPN ○ 11

Network 0: Baseline (Pretrained Weight) n: number of templates k: number of anchors for each spatial pixel Conv CNN Template Features (4, 4, 2k × 256) Cls Score (FG/BG) Template Feature (17, 17, 2k) (6, 6, 256) Frame Features Template Feature Conv (20, 20, 256) (n, 6, 6, 256) Templates Conv Template Features (4, 4, 4k × 256) Bounding Box (x, y, w, h) CNN (17, 17, 4k) Frame Feature (22, 22, 256) Frame Features Conv (20, 20, 256) Frame T (255, 255, 3) 12

Visualization Results (MOT Dataset) 13

Visualization Response Template: Template: 14

Past: Multi Object SiamRPN Baseline Idea: ● Pre-compute correlation filters for each template ○ All templates share the RPN network to do tracking independtly ○ Introduce Communication among templates (1) ● Concatenate all correlation filters as a bigger filter ○ Re-train RPN network to perform multi-object classification ○ Introduce Communication among templates (2) ● Add Distractor-aware loss and fine-tune RPN ○ 15

Network 1: Abandoned n: number of templates k: number of anchors for each spatial pixel Conv CNN Template Features (4, 4, nk × 256) Cls Score (FG/BG) Template Feature (17, 17, (n+1)k) (6, 6, 256n) Frame Features Template Feature Conv (20, 20, 256) (n, 6, 6, 256) Templates Conv Template Features (4, 4, 4nk × 256) Bounding Box (x, y, w, h) CNN (17, 17, 4k) Frame Feature (22, 22, 256) Frame Features Conv (20, 20, 256) Frame T (255, 255, 3) 16

Past Single Object SiamRPN ● Training from scratch ○ Present Verifying Effect of RoI ○ Multi Object SiamRPN ● Future Try to fix Distractor Issue ○ 17

Present: Multi Object SiamRPN Baseline Idea: ● Pre-compute correlation filters for each template ○ All templates share the RPN network to do tracking independtly ○ Introduce Communication among templates (1) ● Concatenate all correlation filters as a bigger filter ○ Re-train RPN network to perform multi-object classification ○ Introduce Communication among templates (2) ● Add Distractor-aware loss and fine-tune RPN ○ 18

Network 2: Softmax (Pretrained Weight) Cls Score (FG/BG) (17, 17, 2k) RPN CNN SoftMax Cls Score (FG/BG) Cls Score (FG/BG) (17, 17, nk) (17, 17, 2k) Template Feature (n, 6, 6, 256) Templates Cls Score (FG/BG) (17, 17, 2k) CNN Frame Feature (22, 22, 256) Bounding Box (x, y, w, h) (17, 17, 4k) Frame T (255, 255, 3) 19

Present: Deal with Distractor Add a Layer to handle distractor-aware labelling ● Freeze the SiamRPN, only train the Association Network ○ E.g. A fully connect network ○ Cls Score (FG/BG) (17, 17, 2k) Neural RPN CNN Cls Score (FG/BG) Network Cls Score (FG/BG) (17, 17, nk) (17, 17, 2k) Template Feature Templates Cls Score (FG/BG) (n, 6, 6, 256) (17, 17, 2k) CNN Frame Feature (22, 22, 256) Bounding Box (x, y, w, h) Frame T 20 (17, 17, 4k) (255, 255, 3)

Present: Single Object SiamRPN ROI Align: Quantitative and Qualitative Verification ● Whole Image as Input Cropped Feature Cropped Image as Input Whole Feature 21

Past Finish RoI Align Verification for Single ● Object SiamRPN (September 15) Present Achieve similar EAO as in SiamRPN ○ paper Future Merge Multi Object SiamRPN with RoI ● Align (September 15) Achieve similar performance as ○ without RoI Align Data Association and NMS Network ● (October 15) Assign correct ID to correct person ○ Integrate Object Detection (October 31) ● Learn a universal template that has ○ high response on all pedestrians Test Speed and Deploy (December 15) ● 22

Future: Detect New Objects Sep 15 Oct 15 Oct 31 Nov 15 Dec 15 Finish RoI Align Verification for Single Object SiamRPN (September 15) ● Achieve similar EAO as in SiamRPN paper ○ Merge Multi Object SiamRPN with RoI Align (September 15) ● Achieve similar performance as without RoI Align ○ 23

Future: Detect New Objects Sep 15 Oct 15 Oct 31 Nov 15 Dec 15 Finish RoI Align Verification for Single Object SiamRPN (September 15) ● Achieve similar EAO as in SiamRPN paper ○ Merge Multi Object SiamRPN with RoI Align (September 15) ● Achieve similar performance as without RoI Align ○ Data Association and NMS Network (October 15) ● Assign correct ID to correct person ○ 24

Embedded Multi-Person Pedestrian Tracking and Detection MSCV19 - PowerPoint PPT Presentation

Embedded Multi-Person Pedestrian Tracking and Detection MSCV19 Capstone Project, Internal(CMU) Team Member: Yongxin Wang, Chunhui Liu Advisor: Dr. Kris Kitani 05/03/2019 Introduction Motivation Multi-person pedestrain tracking

Temporary Covered Temporary Drop Off Pedestrian Bridge Pedestrian Bri Foundations Pedestrian

Pedestrian Pedestrian Pedestrian C Pedestrian C C Crossing confusion rossing confusion

People-Tracking-by-Detection and People-Detection-by-Tracking Mykhaylo Andriluka Stefan Roth

Understanding Pedestrian Collisions Partnering Conference September 10, 2013 Pedestrian Safety

Mayors Pedestrian Advisory Council Wednesday, February 15 Annual Pedestrian Fatalities 2005 -

Understanding driver/pedestrian conflicts: Driver Understanding driver/pedestrian conflicts:

Connect Currituck Pedestrian Master Plan Connect Currituck Pedestrian Master Plan Key Benefits

Multi-Object Tracking Challenge CV3DST Lecture Exercises Multi-Object Tracking Multi-Object

Tracking H akan Ard o February 22, 2012 H akan Ard o Tracking February 22, 2012 1

Detection of neutral particles detection of neutrons detection of neutrinons detection of low

Study sites 50 unsignalized Introduction pedestrian crossings in Warsaw High pedestrian

Group 3 3.5 Background Invariant Laser-spot Detection and Tracking for Embedded Systems, PI: L.

Embedded Multi-Target Tracking System CN052 Wang Shuhui, Wang Qiaoyuan, Wei Longping Lu Xiaofeng

Tracking H akan Ard o March 4, 2013 H akan Ard o Tracking March 4, 2013 1 / 57

Foreground detection and tracking in 2D/3D Jos Luis Landabaso Montse Pards Outline 2D

Embedded PC The modular Industrial PC for mid-range control Embedded PC 1 Embedded OS

Learning Normalized Inputs for Iterative Estimation in Medical Image Segmentation Michal

Enterprise Application Integration Building the European Biodiversity through Service-Oriented

C-major A Music Production Language The Ensemble Stephanie Huang Andrew OReilly Jonathan

MELODY TONG GE JINGSI LI SHUO YANG Music programming language .mc .csv .midi

Len Preston Chief, Labor Market Information New Jersey Department of Labor & Workforce

1.3 EXCEL to New Heights PACE April 2017 Chad Carter And Bonnie Chisholm Why should I use

Project: IEEE P802.15 Working Group for Wireless Personal Area Networks ( etworks (WPANs WPANs)

Supervisor: Prof Robert W Stewart Dr Louise Crockett Outline Motivation and Objective

Embedded Multi-Person Pedestrian Tracking and Detection MSCV19 - PowerPoint PPT Presentation

Embedded Multi-Person Pedestrian Tracking and Detection MSCV19 Capstone Project, Internal(CMU) Team Member: Yongxin Wang, Chunhui Liu Advisor: Dr. Kris Kitani 05/03/2019 Introduction Motivation Multi-person pedestrain tracking

Temporary Covered Temporary Drop Off Pedestrian Bridge Pedestrian Bri Foundations Pedestrian

Pedestrian Pedestrian Pedestrian C Pedestrian C C Crossing confusion rossing confusion

People-Tracking-by-Detection and People-Detection-by-Tracking Mykhaylo Andriluka Stefan Roth

Understanding Pedestrian Collisions Partnering Conference September 10, 2013 Pedestrian Safety

Mayors Pedestrian Advisory Council Wednesday, February 15 Annual Pedestrian Fatalities 2005 -

Understanding driver/pedestrian conflicts: Driver Understanding driver/pedestrian conflicts:

Connect Currituck Pedestrian Master Plan Connect Currituck Pedestrian Master Plan Key Benefits

Multi-Object Tracking Challenge CV3DST Lecture Exercises Multi-Object Tracking Multi-Object

Tracking H akan Ard o February 22, 2012 H akan Ard o Tracking February 22, 2012 1

Detection of neutral particles detection of neutrons detection of neutrinons detection of low

Study sites 50 unsignalized Introduction pedestrian crossings in Warsaw High pedestrian

Group 3 3.5 Background Invariant Laser-spot Detection and Tracking for Embedded Systems, PI: L.

Embedded Multi-Target Tracking System CN052 Wang Shuhui, Wang Qiaoyuan, Wei Longping Lu Xiaofeng

Tracking H akan Ard o March 4, 2013 H akan Ard o Tracking March 4, 2013 1 / 57

Foreground detection and tracking in 2D/3D Jos Luis Landabaso Montse Pards Outline 2D

Embedded PC The modular Industrial PC for mid-range control Embedded PC 1 Embedded OS

Learning Normalized Inputs for Iterative Estimation in Medical Image Segmentation Michal

Enterprise Application Integration Building the European Biodiversity through Service-Oriented

C-major A Music Production Language The Ensemble Stephanie Huang Andrew OReilly Jonathan

MELODY TONG GE JINGSI LI SHUO YANG Music programming language .mc .csv .midi

Len Preston Chief, Labor Market Information New Jersey Department of Labor &amp; Workforce

1.3 EXCEL to New Heights PACE April 2017 Chad Carter And Bonnie Chisholm Why should I use

Project: IEEE P802.15 Working Group for Wireless Personal Area Networks ( etworks (WPANs WPANs)

Supervisor: Prof Robert W Stewart Dr Louise Crockett Outline Motivation and Objective

Len Preston Chief, Labor Market Information New Jersey Department of Labor & Workforce