Lecture 2 Applications of CNNs Lin ZHANG, PhD School of Software - PowerPoint PPT Presentation

Lecture 2 Applications of CNNs Lin ZHANG, PhD School of Software Engineering Tongji University Fall 2017 SSE, Tongji University

Outline • Vision‐based Parking‐slot Detection • Human‐body Keypoint Detection SSE, Tongji University

Outline • Vision‐based Parking‐slot Detection • Background Introduction • General Flowchart • Surround‐view Synthesis • Parking‐slot Detection from Surround‐view • Experiments • Semantic Segmentation • Human‐body Keypoint Detection SSE, Tongji University

Background Introduction • 同济大学智能型新能源协同创新中心（国家 2011 计划） SSE, Tongji University

Background Introduction—ADAS Architecture 环境感知系统中央决策系统底层控制系统毫米波雷达+前视相机+环视相机驱/制动控制中央决策控制器车道线检测车辆及行人检测转向控制车道保持自动泊车库位线检测交通标识检测挡位控制变道辅助前向防撞多源传感器信息融合车身控制 SSE, Tongji University

Background Introduction • Embarrassment in parking is one of the most difficult problems for drivers • It is a challenge for a novice driver to park a car in a limited space Automatic parking system is a hot research area in ADAS field SSE, Tongji University

Background Introduction—ADAS Architecture How to detect a parking‐slot and return its position with respect to the vehicle coordinate system? SSE, Tongji University

Different Ways to Locate a Parking‐slot • Infrastructure‐based solutions • Need support from the parking site • Usually, the vehicle needs to communicate with the infrastructure SSE, Tongji University

Different Ways to Locate a Parking‐slot • Infrastructure‐based solutions • On‐vehicle‐sensor based solutions • Parking‐vacancy detection • Ultrasonic radar • Stereo‐vision • Depth camera SSE, Tongji University

Different Ways to Locate a Parking‐slot • Infrastructure‐based solutions • On‐vehicle‐sensor based solutions • Parking‐vacancy detection our focus • Parking‐slot (defined by lines, vision‐based) detection SSE, Tongji University

Research Gaps and Our Contributions • Research Gaps • There is no publicly available dataset in this area • All the existing methods are based on low‐level vision primitives (edges, corners, lines); large room for performance improvement • Our contributions  Construct a large‐scale labeled surround‐view image dataset  Introduce machine learning theory into this field  Develop a real system that has been deployed on SAIC Roewe E50 SSE, Tongji University

Outline • Vision‐based Parking‐slot Detection • Background Introduction • General Flowchart • Surround‐view Synthesis • Parking‐slot Detection from Surround‐view • Experiments • Human‐body Keypoint Detection SSE, Tongji University

General Flowchart front cam front view surround view parking-slot send parking generation left cam detection left view slots info decision module parking slot surround positions view back cam back view right cam right view Overall flowchart of the vision‐based parking slot detection system SSE, Tongji University

Surround‐view Synthesis • Surround view camera system is an important ADAS technology allowing the driver to see a top‐down view of the 360 degree surroundings of the vehicle • Such a system normally consists of 4~6 wide‐angle (fish‐eye lens) cameras mounted around the vehicle, each facing a different direction SSE, Tongji University

Surround‐view Synthesis • The surround‐view is composed of the four bird’s‐eye views (front, left, back, and right) • To get the bird’s‐eye view, the essence is generating a look‐up table mapping a point on bird’s‐eye view to a point on the fish‐eye image • Decide the similarity transformation matrix , mapping a point from the P  B W bird’s‐eye view coordinate system to the world coordinate system • Decide the projective transformation matrix , mapping a point from P  W U the world coordinate system to the undistorted image coordinate system • Decide the look‐up table , mapping a point from the undistorted image T  U F coordinate system to the fish‐eye image coordinate system SSE, Tongji University

Surround‐view Synthesis • Process to get the bird’s‐eye view Bird’s-eye-view World CS Undistorted Fisheye image CS image CS image A similarity matrix A homography matrix a mapping look‐up table P  P  T  B W W U U F a look‐up table T  x B x F B F SSE, Tongji University

Surround‐view Synthesis • Process to get the bird’s‐eye view • Distortion coefficients of a fish‐eye camera and also the mapping look‐up table can be determined by the calibration routines provided in T  U F openCV3.0 fisheye image undistorted image SSE, Tongji University

Surround‐view Synthesis • Process to get the bird’s‐eye view • Determine P  W U The physical plane (in WCS) and the undistorted image plane can be linked via a homography matrix P  W U  x x P  U W U W   N x , x If we know a set of correspondence pairs , i  Ui Wi 1 P  can be estimated using the least‐square method W U SSE, Tongji University

Surround‐view Synthesis • Process to get the bird’s‐eye view • Determine P  W U SSE, Tongji University

Surround‐view Synthesis (a) (b) (e)  Image is of the size 600 600   10 10 physical region m m (c) (d) SSE, Tongji University

Surround‐view Synthesis How to detect the parking‐slot given a surround‐view image? SSE, Tongji University

Challenges • It is not an easy task due to the existence of  Various types of road textures  Various types of parking‐slots  Illumination variation  Partially damaged parking‐lines  Non‐uniform shadow Making the low‐level vision based algorithms difficult to succeed SSE, Tongji University

Challenges SSE, Tongji University

DeepPS: A DCNN‐based Approach • Motivation  Detect marking‐points A  Decide the validity of entrance‐lines and their types (can be solved as a classification problem) B C Both of them can be solved by DCNN‐based techniques D SSE, Tongji University

DeepPS: A DCNN‐based Approach • Marking‐point detection by using a DCNN‐based framework • We adopt YoloV2 as the detection framework • R‐CNN (Region‐baed convolutional neural networks) (CVPR 2014) • SPPNet (Spatial Pyramid Pooling Network) (T‐PAMI 2015) • Fast‐RCNN (ICCV 2015) • Faster‐RCNN (NIPS 2015) • Yolo (You Only Look Once) (CVPR 2016) • SSD (Single Shot Multibox Detector) (ECCV 1016) Accurate enough, fastest! • Yolov2 (ArXiv 2016) SSE, Tongji University

DeepPS: A DCNN‐based Approach • Marking‐point detection by using a DCNN‐based framework • We adopt YoloV2 as the detection framework • Manually mark the positions of marking‐points and define regions with fixed size centered at marking‐points as “marking‐point patterns” SSE, Tongji University

DeepPS: A DCNN‐based Approach • Marking‐point detection by using a DCNN‐based framework • We adopt YoloV2 as the detection framework • Manually mark the positions of marking‐points and define regions with fixed size centered at marking‐points as “marking‐point patterns” • To make the detector rotation‐invariant, we rotate the training images (and the associated labeling information) to augment the training dataset SSE, Tongji University

DeepPS: A DCNN‐based Approach • Given two marking points A and B, classify the local pattern formed by A and B for two purposes • Judge whether “AB” is a valid entrance‐line • If it is, decide the type of this entrance‐line A A B Local pattern formed by A and B B (48*192) size normalized SSE, Tongji University

DeepPS: A DCNN‐based Approach • Given two marking points A and B, classify the local pattern formed by A and B for two purposes • Judge whether “AB” is a valid entrance‐line • If it is, decide the type of this entrance‐line We define 7 types of local patterns formed by two marking‐points A A B B A B A B A A A B B B Typical samples of 7 types of local patterns SSE, Tongji University

Lecture 2 Applications of CNNs Lin ZHANG, PhD School of Software - PowerPoint PPT Presentation

Lecture 2 Applications of CNNs Lin ZHANG, PhD School of Software Engineering Tongji University Fall 2017 SSE, Tongji University Outline Visionbased Parkingslot Detection Humanbody Keypoint Detection SSE, Tongji University

Deep Learning for Geometry Processing 3D Representations View-Based and Volumetric CNNs 3D

Understanding Geometry of Encoder-Decoder CNNs (E-D CNNs) Jong Chul Ye & Woon Kyoung Sung

Introduction to CNNs and RNNs with PyTorch Introduction to CNNs and RNNs with PyTorch Presented

Table of Contents Convolutional Neural Nets (CNNs) 1 Deep Q Learning 2 Lecture 6: CNNs and Deep

Table of Contents Convolutional Neural Nets (CNNs) 1 Deep Q Learning 2 Lecture 6: CNNs and Deep

Texture attribute synthesis and transfer using feed-forward CNNs Thomas Irmer, Tobias Glasmachers,

Distributed Optimization of CNNs and RNNs GTC 2015 William Chan williamchan.ca

Geirhos et al. (2019) Introduction ImageNet classifjcation with CNNs Which image cues are

Lecture 11: CNNs in Practice Fei-Fei Li & Andrej Karpathy & Justin Johnson Fei-Fei Li

CNNs Applications M. Soleymani Sharif University of Technology Spring 2019 Most slides have

Malaysian Healthy Ageing Society Plenary Lecture Plenary Lecture Plenary Lecture Plenary

Volumetric and Multi-View CNNs for Object Classification on 3D Data Charles R. Qi, Hao Su,

Advanced Section #3: CNNs and Object Detection AC 209B: Data Science Javier Zazo Pavlos

NETWORKS Pavlo Molchanov Stephen Tyree Tero Karras Timo Aila Jan Kautz 2017 WHY WE CAN PRUNE

Neural Architecture Ligeng Zhu May 4th 1 The Blooming of CNNs 2 Bypass Connection x `

A Case for Dynamic Activation Quantization in CNNs Karl Taht, Surya Narayanan, Rajeev

Loewner equations, evolution families and their boundary fixed points P avel G umenyuk Cortona

Leptogenesis Koichi Hamaguchi (University of Tokyo) Revealing the history of the universe with

Discussion: Robust Sparse Quadratic Discrimination Han Xiao Department of Statistics &

Two-sided problems with choice functions, matroids and lattices as Fleiner 1 Tam Summer School

Rudik Dmitry AAP Workshop CEA 2014 1 1 Outline: RED Collaboration RED-1 & obtained

Ar Arti tifici ficial al In Inte tell llig igenc ence e (A (AI) I) Co Computer ter

Un Unin info form rmed ed Se Sear arch ch Computer ter Sc Science ce cpsc3 c322 22,

CS-5630 / CS-6630 Visualization for Data Science Sets and Text Alexander Lex alex@sci.utah.edu

Lecture 2 Applications of CNNs Lin ZHANG, PhD School of Software - PowerPoint PPT Presentation

Lecture 2 Applications of CNNs Lin ZHANG, PhD School of Software Engineering Tongji University Fall 2017 SSE, Tongji University Outline Visionbased Parkingslot Detection Humanbody Keypoint Detection SSE, Tongji University

Deep Learning for Geometry Processing 3D Representations View-Based and Volumetric CNNs 3D

Understanding Geometry of Encoder-Decoder CNNs (E-D CNNs) Jong Chul Ye &amp; Woon Kyoung Sung

Introduction to CNNs and RNNs with PyTorch Introduction to CNNs and RNNs with PyTorch Presented

Table of Contents Convolutional Neural Nets (CNNs) 1 Deep Q Learning 2 Lecture 6: CNNs and Deep

Table of Contents Convolutional Neural Nets (CNNs) 1 Deep Q Learning 2 Lecture 6: CNNs and Deep

Texture attribute synthesis and transfer using feed-forward CNNs Thomas Irmer, Tobias Glasmachers,

Distributed Optimization of CNNs and RNNs GTC 2015 William Chan williamchan.ca

Geirhos et al. (2019) Introduction ImageNet classifjcation with CNNs Which image cues are

Lecture 11: CNNs in Practice Fei-Fei Li &amp; Andrej Karpathy &amp; Justin Johnson Fei-Fei Li

CNNs Applications M. Soleymani Sharif University of Technology Spring 2019 Most slides have

Malaysian Healthy Ageing Society Plenary Lecture Plenary Lecture Plenary Lecture Plenary

Volumetric and Multi-View CNNs for Object Classification on 3D Data Charles R. Qi*, Hao Su*,

Advanced Section #3: CNNs and Object Detection AC 209B: Data Science Javier Zazo Pavlos

NETWORKS Pavlo Molchanov Stephen Tyree Tero Karras Timo Aila Jan Kautz 2017 WHY WE CAN PRUNE

Neural Architecture Ligeng Zhu May 4th 1 The Blooming of CNNs 2 Bypass Connection x `

A Case for Dynamic Activation Quantization in CNNs Karl Taht, Surya Narayanan, Rajeev

Loewner equations, evolution families and their boundary fixed points P avel G umenyuk Cortona

Leptogenesis Koichi Hamaguchi (University of Tokyo) Revealing the history of the universe with

Discussion: Robust Sparse Quadratic Discrimination Han Xiao Department of Statistics &amp;

Two-sided problems with choice functions, matroids and lattices as Fleiner 1 Tam Summer School

Rudik Dmitry AAP Workshop CEA 2014 1 1 Outline: RED Collaboration RED-1 &amp; obtained

Ar Arti tifici ficial al In Inte tell llig igenc ence e (A (AI) I) Co Computer ter

Un Unin info form rmed ed Se Sear arch ch Computer ter Sc Science ce cpsc3 c322 22,

CS-5630 / CS-6630 Visualization for Data Science Sets and Text Alexander Lex alex@sci.utah.edu

Understanding Geometry of Encoder-Decoder CNNs (E-D CNNs) Jong Chul Ye & Woon Kyoung Sung

Lecture 11: CNNs in Practice Fei-Fei Li & Andrej Karpathy & Justin Johnson Fei-Fei Li

Volumetric and Multi-View CNNs for Object Classification on 3D Data Charles R. Qi, Hao Su,

Discussion: Robust Sparse Quadratic Discrimination Han Xiao Department of Statistics &

Rudik Dmitry AAP Workshop CEA 2014 1 1 Outline: RED Collaboration RED-1 & obtained