ActEV18: Activities in Extended Video PIs: Afzal Godil, Jonathan - PowerPoint PPT Presentation

ActEV18: Activities in Extended Video PIs: Afzal Godil, Jonathan Fiscu cus Yooyoung g Lee Lee , David Joy, Andrew Delgado TRECVID 2018 Workshop November 13-15, 2018

Disclaimer er Certain commercial equipment, instruments, software, or materials are identified in this paper to specify the experimental procedure adequately. Such identification is not intended to imply recommendation or endorsement by NIST, nor necessarily the best available for the purpose. The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements, either expressed or implied, of IARPA, NIST, or the U.S. Government. 2/14/19 2/14/19 2 2

Ou Outline • ActEV Overview • Evaluation Framework • Tasks and Measures • ActEV18 Evaluations • ActEV18 Dataset • ActEV18 Results and Analyses • Next Steps 2/14/19 2/14/19 3 3

ActEV Overview 2/14/19 4 4

Wha What is s Ac ActEV? • ActEV (Activities in Extended Video) is an extension of TRECVID Surveillance Event Detection (SED) evaluations • Goal • To advance video analytics technology that can automatically detect a target activity and identify and track objects associated with the activity. • A series of challenges are also designed for: • Activity detection in a multi-camera environment • Temporal (and spatio-temporal) localization of the activity for reasoning 2/14/19 2/14/19 5 5

Wha What’s s New? w? (S (SED ED -> > Ac ActEV) • New activity-annotated and unannotated data for 4 years! • DARPA Video and Image Retrieval and Analysis Tool (VIRAT) data (16, 28 hrs) • Newly-collected DIVA data (Rough est. ~200 hrs, ~20K hrs) • New evaluation tasks • Activity Detection (AD) : similar to the retrospective SED task • Activity and Object Detection (AOD): activity + object detection • Activity and Object Detection and Tracking (AODT): activity + object detection + tracking • A series of evaluations rather than one per year • Blind: participants deliver system output (typical TRECVID) • Leader board: participants deliver many system output • Independent: participants deliver working systems for NIST to test on sequestered data 2/14/19 2/14/19 6 6

NI NIST, I IARPA, a and Ki Kitware • NIST developed the ActEV evaluation series to support the metrology needs of the Intelligence Advanced Research Projects Activity (IARPA) Deep Intermodal Video Analytics (DIVA) Program • The ActEV’s datasets collected and annotated by Kitware, Inc. 2/14/19 2/14/19 7 7

Evaluation Framework 2/14/19 2/14/19 8 8

Ev Evaluation Framework • Target applications • Retrospective analysis of archives (e.g., forensic analytics) • Real-time analysis of live video streams (e.g., alerting) • Evaluation Type • Self-reported evaluation • Independent (& sequestered) evaluation • Evaluation conditions • Activity-level (1.A phase evaluation) • Reference temporal segmentation • Leaderboard 2/14/19 2/14/19 9 9

Tasks and Measures (AD, AOD, AODT) 2/14/19 2/14/19 10 10

Ev Evaluation Tasks (AD) • Activity Detection (AD) • Given a target activity, a system automatically 1) detects its presence and then temporally localizes all instances of the activity in video sequences • The system output includes: • Start and end frames indicating the temporal location of the target activity • A presence confidence score that indicates how likely the activity occurred 2/14/19 2/14/19 11 11

Ev Evaluation Tasks (AOD) • Activity and Object Detection (AOD) • A system not only 1) detects/localizes the target activity, but also 2) detects the presence of required objects and spatially localizes the objects that are associated with the activity • The system output includes: • Start and end frames indicating the temporal location of the target activity • A presence confidence score that indicates how likely the activity occurred • Coordinates of object bounding boxes and object presence confidence scores • Scoring protocol: AOD_AD and AOD_AOD. 2/14/19 2/14/19 12 12

Evaluation Tasks (AODT) T) • Activity Object Detection/Tracking (AODT) • A system 1) correctly detects/localizes the target activity, 2) correctly detects/localizes the required objects in that activity, and 3) correctly tracks those objects over time. • The AODT task is NOT addressed in ActEV18 evaluations 2/14/19 2/14/19 13 13

Pe Performance Measures (AD) • Primary metrics • J. Fiscus, “TRECVID Surveillance Event Detection Evaluation.” https://www.nist.gov/itl/iad/mig/trecvid- 2017-evaluation-surveillance-event-detection • Secondary metrics • K. Bernardin and R. Stiefelhagen, “Evaluating Multiple Object Tracking Performance: The CLEAR MOT Metrics,” EURASIP J. Image Video Process. , vol. 2008 2/14/19 2/14/19 14 14

Primary: Act ctivity Occu ccurrence ce Detect ction Reference System Output (Instances) (Instances) Step1: Instance Alignment ) *+ (&) ! "#$$ (&) = ) ,-./01$2314/ Step2: Confusion Matrix Computation ) 67 (&) 5 67 (&) = 89:;<=>?@AB9A>C;D Step3 : Summary Performance Metrics • ! "#$$ at 5 67 = 0.15 Step4: Result Visualization • Further details in “ActEV 2018 Evaluation Plan”, https://actev.nist.gov/ 2/14/19 2/14/19 15 15

Se Secon ondary: T : Temp mpor oral Loc Localization on • N_MIDE (Normalized Multiple Instance Detection Error) 34 # 89 # * +,--./ (1 "$ ∗ 34 # + 14 # + 1 67 ∗ 4:; < − 34 # + 14 # + !> # ) ! "#$% = ' ! @ABBCD #() Further detail in “ActEV 2018 Evaluation Plan”, https://actev.nist.gov/ 2/14/19 2/14/19 16 16

Pe Performance Measures (AOD) • Primary • Similar to AD, however, instance alignment step uses an additional term for the object detection congruence • Secondary • N_MODE (Normalized Multiple Object Detection Error) , -./012 3 45 ∗"$ 7 & 83 9: ∗;< 7 & ! "#$% & = ∑ )*+ ?-./012 , @ 7 ∑ 7=> • The minimum N_MODE value (minMODE) is calculated for object detection performance • 1-minMODE is used for the object detection congruence term 2/14/19 2/14/19 17 17

Performance Measures (AODT) T) • Primary • Similar to AD, however, instance alignment step uses an additional term for the object tracking congruence • Secondary • MOTE (Multiple Object Tracking Error) + ,-./01 2 34 ∗ !6 ( % + 2 89 ∗ :; ( % + 2 <4 ∗ =>?@ABCDE B (% !"#$ % = ' + ,-./01 H I ( ∑ ()* ()* • The minimum MOTE value (minMOTE) is calculated for object tracking performance • 1-minMOTE is used for the tracking congruence term 2/14/19 2/14/19 18 18

ActEV18 Evaluations 2/14/19 19 19

Act ctEV18 Evaluations are focu cusing on • The AD and AOD tasks only • Retrospective analysis applications in mind • The single camera view and at the activity observation level • Self-reported evaluation only • A series of the evaluations: • Activity-level • Reference temporal segmentation (RefSeg) • Leaderboard 2/14/19 2/14/19 20 20

ActEV18 Dataset 2/14/19 21 21

Ac Activities and Number of Instances VIRAT V1 dataset Additional 7 activities for leaderboard 12 activities for activity-level/RefSeg Activity Type Train Validation Activity Type Train Validation Closing 126 132 Interacts 88 101 Closing_trunk 31 21 Pull 21 22 Entering 70 71 Riding 21 22 Exiting 72 65 Talking 67 41 Loading 38 37 Activity_carrying 364 237 Open_Trunk 35 22 Specialized_talking_phone 16 17 Opening 125 127 Specialized_texting_phone 20 5 Transport_HeavyCarry 45 31 Due to ongoing evaluations, the Unloading 44 32 Vehicle_turning_left 152 133 test sets are not included in the Vehicle_turning_right 165 137 table Vehicle_u_turn 13 8 2/14/19 2/14/19 22 22

ActEV18 Results and Analyses 2/14/19 23 23

ActEV1 V18 Activity-Le Level E Evaluation on • 15 Participants from the academic and industrial sectors • AD • 20 systems from 13 teams (including baseline) • Activity Detection (Primary): • ! "#$$ at % &' = 0.15, ! "#$$ at % &' = 1 • Temporal Localization (Secondary): • . /012 at % &' = 0.15, . /012 at % &' = 1 • AOD • 16 systems from 11 teams • Two scoring protocols Detection Error Tradeoff (DET) curve • AOD_AD: the same with the AD task • AOD_AOD: In addition to the AD metrics, 3456789! "#$$ at % &' = 0.5 is used for object detection 2/14/19 2/14/19 24 24

ActEV18: Activities in Extended Video PIs: Afzal Godil, Jonathan - PowerPoint PPT Presentation

ActEV18: Activities in Extended Video PIs: Afzal Godil, Jonathan Fiscu cus Yooyoung g Lee Lee , David Joy, Andrew Delgado TRECVID 2018 Workshop November 13-15, 2018 Disclaimer er Certain commercial equipment, instruments, software, or

Extended Project Qualification Introduction What is an Extended Project? What does an

Video Games Written and Researched by: Patrick Kania First Video Game The first Video Game made

NVIDIA VIDEO TECHNOLOGIES Abhijit Patait, 3/20/2019 NVIDIA Video Technologies Overview Turing

NVIDIA VIDEO TECHNOLOGIES Abhijit Patait, 3/26/2018 NVIDIA Video Technologies Overview Video

Video Sur Video Sur rveillance, rveillance, , Video Analyti Video Analyti ics, and You.

Extended Day Program March 20, 2014 Bobby Kaplow, Director, Extended Day Ben Harris, Assistant

Extended Binary Trees Recurrence relations Today: Extended Binary Trees (basis for much of

HD Video Activities in Korea HD Video Activities in Korea JongWon Kim, Ph.D. APAN HDTV WG

EUREKA ACTIVITIES DEPARTMENT Jason Green- Activities Director Cindy Hirsch- Activities

Sharing Your Story Through Online Video SHARING YOUR STORY THROUGH VIDEO Agenda 1 The power of

Learning from Unlabeled Video Carl Vondrick Columbia University Survivor Bias of Video Data

Image and Video Coding: Introduction bitstream encoder decoder Motivation Image and Video

091031 091031 VIDEO SIGNALS VIDEO SIGNALS Lecturer: Marco Marcon 091032 - AUDIO AND VIDEO

7. Video databases Video data representations Video = time-ordered sequence of correlated

Image and Video Coding: Video Coding Extensions Screen Content Coding Screen Content Coding

HELPFUL TIPS WHEN MAKING A KICKSTARTER VIDEO KICKSTARTER VIDEO KICKSTARTER VIDEO KICKSTARTER

Annual Commission Meeting Program Briefing & Progress Report Se pt e mbe r 2 5 , 2 0 1 8

How to Use Inbound Marke keting to Grow in a Hyper-Co Competit itiv ive Indust stry By

LEAR CONFERENCE 2015 ROME, June 26 PCW value for consumer? One stop

Goal-Oriented RE for e-services Zlatko Zlatev, Pascal van Eck, Roel Wieringa, and Jaap Gordijn 1

In todays logistics, only Synfioos predictions can help us reduce uncertainty about

OSRAA OUTREACH AND EDUCATION JULY 2016 Todays Agenda Whats new on the OSRAA Website

Calls-to-Action that Fail: The most common causes for why CTAs fail (and how you can achieve

Fourth Quarter Review 14 / November / 2013 Forward-Looking Statements / Safe Harbor This

Sambuz

Useful Links

Newsletter

Mail Us

ActEV18: Activities in Extended Video PIs: Afzal Godil, Jonathan - PowerPoint PPT Presentation

ActEV18: Activities in Extended Video PIs: Afzal Godil, Jonathan Fiscu cus Yooyoung g Lee Lee , David Joy, Andrew Delgado TRECVID 2018 Workshop November 13-15, 2018 Disclaimer er Certain commercial equipment, instruments, software, or

Extended Project Qualification Introduction What is an Extended Project? What does an

Video Games Written and Researched by: Patrick Kania First Video Game The first Video Game made

NVIDIA VIDEO TECHNOLOGIES Abhijit Patait, 3/20/2019 NVIDIA Video Technologies Overview Turing

NVIDIA VIDEO TECHNOLOGIES Abhijit Patait, 3/26/2018 NVIDIA Video Technologies Overview Video

Video Sur Video Sur rveillance, rveillance, , Video Analyti Video Analyti ics, and You.

Extended Day Program March 20, 2014 Bobby Kaplow, Director, Extended Day Ben Harris, Assistant

Extended Binary Trees Recurrence relations Today: Extended Binary Trees (basis for much of

HD Video Activities in Korea HD Video Activities in Korea JongWon Kim, Ph.D. APAN HDTV WG

EUREKA ACTIVITIES DEPARTMENT Jason Green- Activities Director Cindy Hirsch- Activities

Sharing Your Story Through Online Video SHARING YOUR STORY THROUGH VIDEO Agenda 1 The power of

Learning from Unlabeled Video Carl Vondrick Columbia University Survivor Bias of Video Data

Image and Video Coding: Introduction bitstream encoder decoder Motivation Image and Video

091031 091031 VIDEO SIGNALS VIDEO SIGNALS Lecturer: Marco Marcon 091032 - AUDIO AND VIDEO

7. Video databases Video data representations Video = time-ordered sequence of correlated

Image and Video Coding: Video Coding Extensions Screen Content Coding Screen Content Coding

HELPFUL TIPS WHEN MAKING A KICKSTARTER VIDEO KICKSTARTER VIDEO KICKSTARTER VIDEO KICKSTARTER

Annual Commission Meeting Program Briefing &amp; Progress Report Se pt e mbe r 2 5 , 2 0 1 8

How to Use Inbound Marke keting to Grow in a Hyper-Co Competit itiv ive Indust stry By

LEAR CONFERENCE 2015 ROME, June 26 PCW value for consumer? One stop

Goal-Oriented RE for e-services Zlatko Zlatev, Pascal van Eck, Roel Wieringa, and Jaap Gordijn 1

In todays logistics, only Synfioos predictions can help us reduce uncertainty about

OSRAA OUTREACH AND EDUCATION JULY 2016 Todays Agenda Whats new on the OSRAA Website

Calls-to-Action that Fail: The most common causes for why CTAs fail (and how you can achieve

Fourth Quarter Review 14 / November / 2013 Forward-Looking Statements / Safe Harbor This

Sambuz

Useful Links

Newsletter

Mail Us

Annual Commission Meeting Program Briefing & Progress Report Se pt e mbe r 2 5 , 2 0 1 8