co compute mputer r vision 3 detection segmentation and
play

Co Compute mputer r Vision 3: Detection, , Segmentation and - PowerPoint PPT Presentation

Co Compute mputer r Vision 3: Detection, , Segmentation and and Tr Trac acking king CV3DST | Prof. Leal-Taix 1 The The Te Team Lecturers Prof. Dr. Laura Dr. Aljosa Leal-Taix Osep CV3DST | Prof. Leal-Taix 2 Wh What at


  1. Co Compute mputer r Vision 3: Detection, , Segmentation and and Tr Trac acking king CV3DST | Prof. Leal-Taixé 1

  2. The The Te Team Lecturers Prof. Dr. Laura Dr. Aljosa Leal-Taixé Osep CV3DST | Prof. Leal-Taixé 2

  3. Wh What at this cou ourse e is: • A course on Computer Vision – Object detection – Instance and semantic segmentation – Multiple object tracking in 2D and 3D • Other CV courses: – Computer Vision 2: Multiple View Geometry (WS) CV3DST | Prof. Leal-Taixé 3

  4. Wh What at this cou ourse e is NOT: • An Introduction to Deep Learning – Take “Introduction to Deep Learning” if you are not familiar with basic DL concepts • A practical project course – Take “Advanced Deep Learning for Computer Vision” • A theoretical introduction into 3D Vision – Take “Computer Vision 2: Multiple View Geometry (WS)” CV3DST | Prof. Leal-Taixé 4

  5. Wh What at is Com omputer er Vi Vision on? First defined in the 60s in artificial intelligence groups • “Mimic the human visual system” • Center block of robotic intelligence • CV3DST | Prof. Leal-Taixé 5

  6. CV3DST | Prof. Leal-Taixé 6

  7. Comp Computer er Vis ision ion Give eyes to a computer CV3DST | Prof. Leal-Taixé 9

  8. Comp Computer er Vis ision ion Understand every pixel of an image CV3DST | Prof. Leal-Taixé 10

  9. Comp Computer er Vis ision ion Understand every pixel of an image tree car person Semantic segmentation road CV3DST | Prof. Leal-Taixé 11

  10. Comp Computer er Vis ision ion Understand every pixel of an image tree person 2 car Instance- based segmentation Semantic person 1 segmentation person 3 road CV3DST | Prof. Leal-Taixé 12

  11. Comp Computer er Vis ision ion Understand every pixel of a video Multiple object tracking Instance- based segmentation Semantic segmentation CV3DST | Prof. Leal-Taixé 13

  12. Dyn Dynamic c Sce Scene Understa tanding Understand every pixel of a video Multiple object tracking Instance- based segmentation Semantic segmentation CV3DST | Prof. Leal-Taixé 14

  13. Au Auto tono nomous drivi ving ng CV3DST | Prof. Leal-Taixé 15

  14. Au Auto tono nomous drivi ving ng CV3DST | Prof. Leal-Taixé 16

  15. Un Underst standin ing a an ima image CV3DST | Prof. Leal-Taixé 17 Credit: Li/Karpathy/Johnson

  16. Un Underst standin ing a an ima image K. He, G. Gkioxari, P. Dollar, R. Girshick. Mask R-CNN. ICCV 2017. CV3DST | Prof. Leal-Taixé 18

  17. Un Underst standin ing a an ima image CV3DST | Prof. Leal-Taixé 19

  18. Un Underst standin ing a an ima image CV3DST | Prof. Leal-Taixé 20

  19. Un Underst standin ing a an ima image CV3DST | Prof. Leal-Taixé 21

  20. Un Underst standin ing a an ima image • Different representations depending on the granularity – Detections (coarse) – Segmentations (precise) – Semantic with/without instances (person 1, person 2) • Goes well with Deep Learning CV3DST | Prof. Leal-Taixé 22

  21. Un Underst standin ing a an v vid ideo • Temporal domain which brings us advantages – A lot of redundancy – A smoothness assumption: things do not change much from one frame to another • … but also disadvantages – At 30 FPS, image the computation one has to do to process a video…. – Occlusions, multiple objects moving and interacting… CV3DST | Prof. Leal-Taixé 23

  22. Un Underst standin ing a an v vid ideo: t : then CV3DST | Prof. Leal-Taixé 24

  23. Un Underst standin ing a an v vid ideo: n : now CV3DST | Prof. Leal-Taixé 25

  24. Un Underst standin ing a an v vid ideo • Where is every object going? • How are objects interacting? • Get consistent results in the temporal dimension CV3DST | Prof. Leal-Taixé 26

  25. Rou Rough schedu edule/c e/con onten ent • 1. Introduction • 2. Object Detection 1 • 3. Object Detection 2 • 4. Single/Multiple object tracking • 5. Multiple object tracking • 6. Trajectory prediction • 7. Semantic segmentation • 8. Instance Segmentation • 9. Video object segmentation • 10. Going towards 3D tracking and segmentation CV3DST | Prof. Leal-Taixé 27

  26. Rou Rough schedu edule/c e/con onten ent • RCNN, Fast RCNN and Faster RCNN • YOLO, SSD, RetinaNet • Siamese networks – Person Re-Identification • Message Passing Networks • Network (non-neural) flow for tracking • Generative Adversarial Networks – trajectory prediction Mask-RCNN, UPSNet (panoptic segmentation) • Deformable/atrous convolutions • 3D – data, algorithms. • CV3DST | Prof. Leal-Taixé 28

  27. Our Our Research rch Lab Dynamic Vision and Learning Group https://dvl.in.tum.de/ CV3DST | Prof. Leal-Taixé 29

  28. Ab About t the the lectu ture Theory: 10-11 lectures • Every Wednesday 16:00-18:00 (MI HS 2) • Lectures will be recorded this year! • – Due to the virus situation, we will make all slides and video recordings available every Wednesday https://dvl.in.tum.de/teaching/cv3dst-ss20/ CV3DST | Prof. Leal-Taixé 30

  29. Gra Grading syst g system Exam: tbd tbd • There will be a retake exam as this course will be • moved permanently to Summer Semester Completing the practical part successfully gives a • bonus of 0.3 CV3DST | Prof. Leal-Taixé 31

  30. Pr Pract ctica cal l pa part • Internal Kaggle competition • We will have a tracking challenge. • You will all start from the same point (code) • After that, it will be an open competition. • Check out the video of the presentation of the challenge! CV3DST | Prof. Leal-Taixé 32

  31. Moodl Moodle • Announcements via Moodle - IMPORT PORTANT ANT! – Sign up in TUM online for access: https://www.moodle.tum.de/ – We will share common information (e.g., regarding exam) – Ask content questions online so others benefit – Don’t post solutions CV3DST | Prof. Leal-Taixé 33

  32. Ema Email ils & Slid ides es • All material will be uploaded on Moodle and the web • Questions regarding the syllabus, exercises or contents of the lecture, use Moodle! • Questions regarding organization of the course: dst@dvl.in.tum.de • Emails to the individual addresses will not be answered. CV3DST | Prof. Leal-Taixé 34

  33. Co Compute mputer r Vision 3: Detection, , Segmentation and and Tr Trac acking king CV3DST | Prof. Leal-Taixé 40

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend