Co Compute mputer r Vision 3: Detection, , Segmentation and - PowerPoint PPT Presentation

Co Compute mputer r Vision 3: Detection, , Segmentation and and Tr Trac acking king CV3DST | Prof. Leal-Taixé 1

The The Te Team Lecturers Prof. Dr. Laura Dr. Aljosa Leal-Taixé Osep CV3DST | Prof. Leal-Taixé 2

Wh What at this cou ourse e is: • A course on Computer Vision – Object detection – Instance and semantic segmentation – Multiple object tracking in 2D and 3D • Other CV courses: – Computer Vision 2: Multiple View Geometry (WS) CV3DST | Prof. Leal-Taixé 3

Wh What at this cou ourse e is NOT: • An Introduction to Deep Learning – Take “Introduction to Deep Learning” if you are not familiar with basic DL concepts • A practical project course – Take “Advanced Deep Learning for Computer Vision” • A theoretical introduction into 3D Vision – Take “Computer Vision 2: Multiple View Geometry (WS)” CV3DST | Prof. Leal-Taixé 4

Wh What at is Com omputer er Vi Vision on? First defined in the 60s in artificial intelligence groups • “Mimic the human visual system” • Center block of robotic intelligence • CV3DST | Prof. Leal-Taixé 5

CV3DST | Prof. Leal-Taixé 6

Comp Computer er Vis ision ion Give eyes to a computer CV3DST | Prof. Leal-Taixé 9

Comp Computer er Vis ision ion Understand every pixel of an image CV3DST | Prof. Leal-Taixé 10

Comp Computer er Vis ision ion Understand every pixel of an image tree car person Semantic segmentation road CV3DST | Prof. Leal-Taixé 11

Comp Computer er Vis ision ion Understand every pixel of an image tree person 2 car Instance- based segmentation Semantic person 1 segmentation person 3 road CV3DST | Prof. Leal-Taixé 12

Comp Computer er Vis ision ion Understand every pixel of a video Multiple object tracking Instance- based segmentation Semantic segmentation CV3DST | Prof. Leal-Taixé 13

Dyn Dynamic c Sce Scene Understa tanding Understand every pixel of a video Multiple object tracking Instance- based segmentation Semantic segmentation CV3DST | Prof. Leal-Taixé 14

Au Auto tono nomous drivi ving ng CV3DST | Prof. Leal-Taixé 15

Au Auto tono nomous drivi ving ng CV3DST | Prof. Leal-Taixé 16

Un Underst standin ing a an ima image CV3DST | Prof. Leal-Taixé 17 Credit: Li/Karpathy/Johnson

Un Underst standin ing a an ima image K. He, G. Gkioxari, P. Dollar, R. Girshick. Mask R-CNN. ICCV 2017. CV3DST | Prof. Leal-Taixé 18

Un Underst standin ing a an ima image CV3DST | Prof. Leal-Taixé 19

Un Underst standin ing a an ima image • Different representations depending on the granularity – Detections (coarse) – Segmentations (precise) – Semantic with/without instances (person 1, person 2) • Goes well with Deep Learning CV3DST | Prof. Leal-Taixé 22

Un Underst standin ing a an v vid ideo • Temporal domain which brings us advantages – A lot of redundancy – A smoothness assumption: things do not change much from one frame to another • … but also disadvantages – At 30 FPS, image the computation one has to do to process a video…. – Occlusions, multiple objects moving and interacting… CV3DST | Prof. Leal-Taixé 23

Un Underst standin ing a an v vid ideo: t : then CV3DST | Prof. Leal-Taixé 24

Un Underst standin ing a an v vid ideo: n : now CV3DST | Prof. Leal-Taixé 25

Un Underst standin ing a an v vid ideo • Where is every object going? • How are objects interacting? • Get consistent results in the temporal dimension CV3DST | Prof. Leal-Taixé 26

Rou Rough schedu edule/c e/con onten ent • 1. Introduction • 2. Object Detection 1 • 3. Object Detection 2 • 4. Single/Multiple object tracking • 5. Multiple object tracking • 6. Trajectory prediction • 7. Semantic segmentation • 8. Instance Segmentation • 9. Video object segmentation • 10. Going towards 3D tracking and segmentation CV3DST | Prof. Leal-Taixé 27

Rou Rough schedu edule/c e/con onten ent • RCNN, Fast RCNN and Faster RCNN • YOLO, SSD, RetinaNet • Siamese networks – Person Re-Identification • Message Passing Networks • Network (non-neural) flow for tracking • Generative Adversarial Networks – trajectory prediction Mask-RCNN, UPSNet (panoptic segmentation) • Deformable/atrous convolutions • 3D – data, algorithms. • CV3DST | Prof. Leal-Taixé 28

Our Our Research rch Lab Dynamic Vision and Learning Group https://dvl.in.tum.de/ CV3DST | Prof. Leal-Taixé 29

Ab About t the the lectu ture Theory: 10-11 lectures • Every Wednesday 16:00-18:00 (MI HS 2) • Lectures will be recorded this year! • – Due to the virus situation, we will make all slides and video recordings available every Wednesday https://dvl.in.tum.de/teaching/cv3dst-ss20/ CV3DST | Prof. Leal-Taixé 30

Gra Grading syst g system Exam: tbd tbd • There will be a retake exam as this course will be • moved permanently to Summer Semester Completing the practical part successfully gives a • bonus of 0.3 CV3DST | Prof. Leal-Taixé 31

Pr Pract ctica cal l pa part • Internal Kaggle competition • We will have a tracking challenge. • You will all start from the same point (code) • After that, it will be an open competition. • Check out the video of the presentation of the challenge! CV3DST | Prof. Leal-Taixé 32

Moodl Moodle • Announcements via Moodle - IMPORT PORTANT ANT! – Sign up in TUM online for access: https://www.moodle.tum.de/ – We will share common information (e.g., regarding exam) – Ask content questions online so others benefit – Don’t post solutions CV3DST | Prof. Leal-Taixé 33

Ema Email ils & Slid ides es • All material will be uploaded on Moodle and the web • Questions regarding the syllabus, exercises or contents of the lecture, use Moodle! • Questions regarding organization of the course: dst@dvl.in.tum.de • Emails to the individual addresses will not be answered. CV3DST | Prof. Leal-Taixé 34

Co Compute mputer r Vision 3: Detection, , Segmentation and and Tr Trac acking king CV3DST | Prof. Leal-Taixé 40

Co Compute mputer r Vision 3: Detection, , Segmentation and - PowerPoint PPT Presentation

Co Compute mputer r Vision 3: Detection, , Segmentation and and Tr Trac acking king CV3DST | Prof. Leal-Taix 1 The The Te Team Lecturers Prof. Dr. Laura Dr. Aljosa Leal-Taix Osep CV3DST | Prof. Leal-Taix 2 Wh What at

Segmentation Bottom-up Segmentation Semantic / instance segmentation Many Slides from L.

VIDEO SIGNALS Segmentation WHAT IS SEGMENTATION WHAT IS SEGMENTATION Segmentation is a

Semantic Segmentation / Instance Segmentation Based on Deep learning Yiding Liu 2018.12.08

Segmentation using Segmentation using Bayesian Decision Theory Bayesian Decision Theory

Segmentation Segmentation Segmentation Define the accurate boundaries of all objects in an image

Detection and Segmentation CS60010: Deep Learning Abir Das IIT Kharagpur March 04 and 05, 2020

Detection of neutral particles detection of neutrons detection of neutrinons detection of low

Detection and Segmentation of Detection and Segmentation of Touching Characters in Touching

Semantic segmentation Image classification Object detection Semantic segmentation Evolution

Detection, Segmentation Overview Object Detection deer cat Object Detection as Classification

Pixel-Level Im Image Understanding wit ith Semantic Segmentation and Panoptic Segmentation

Lecture 8: Image Segmentation Peng Chao Face++ Researcher pengchao@megvii.com Nov. 2017

Co-Segmentation of 3D Shapes via Subspace Clustering Ruizhen Hu Lubin Fan

Introduction to RFM segmentation Karolis Urbonas Head of Data Science, Amazon DataCamp

Image Segmentation Machine Learning Study Group Presented by Yaochen Xie Jan 25, 2018 Outline

1 1 easy to compute , 1 easy to compute 2

CS344M Autonomous Multiagent Systems Patrick MacAlpine Department of Computer Science The

Introduction to Particle Physics Masterclasses Konrad Jende, High School Teacher Programme 2014 1

I. Introduction Highly parallel Graphics computing GPU (Parallel computing) 4 cores 512

CS 488 Computer Graphics 1 Introduction to Real-Time Rendering Dr. Angus Forbes, Instructor Kyle

M4 meeting Sept 2004 Steve Renals University of Edinburgh Agenda - Thursday 0900-0930 -

Virtual Address Translation via Learned Page Table Indexes Artemiy Margaritovy Dmitrii Ustiugovz

Two-photon experiments with detector MD1 at VEPP-4 Valery Telnov Budker INP and Novosibirsk St.

-ray detector CATANA for E1 response of neutron-rich nuclei Tokyo Institute of Technology