(DEFECT SEGMENTATION) Peter Pyun Ph.D. Andrew Liu Ph.D. Relevant - PowerPoint PPT Presentation

DL-BASED INDUSTRIAL INSPECTION (DEFECT SEGMENTATION) Peter Pyun Ph.D. Andrew Liu Ph.D.

Relevant Links: Defect Segmentation Nvidia Industrial Inspection White Paper V2.0: https://nvidia-gpugenius.highspot.com/viewer/5c949687a2e3a90445b8431f Using U-net and public DAGM dataset (with Nvidia GPU T4, TRT5), it shows 23.5x perf. boost using T4/TRT5, compared to CPU-TF. 2

Industrial Defect Inspection Nvidia GPU Cloud (NGC) Docker images DL Model set up - Unet AGENDA Data preparation Defect segmentation – precision/recall Automatic Mixed Precision - AMP GPU accelerated inferencing – TF-TRT & TRT 3

INDUSTRIAL DEFECT INSPECTION 4

Industrial Inspection Use-case Display panel Automotive Manufacturing Panel CPU socket PCB Battery surface defects (Electric car, Mobile phone) Foundry/Wafer IC Packaging 5

2 Main Scenarios – Industrial/Manufacturing inspection Without With AOI AOI 6

NVIDIA DEEP LEARNING PLATFORM Data (Curated/Annotated) AI INFERENCING @EDGE AI TRAINING @DATA CENTER TensorRT Tesla/Turing Runtime Optimizer DGX Nvidia GPU Cloud (NGC) Tesl docker container DRIVE AGX Jetson AGX TensorRT a DNN 7

NGC DOCKER IMAGES 8

Benefits for Deep Learning Workflow High Level Benefits and Feature Set Develop once, deploy Scale across teams of anywhere Single software stack practitioners Developer, DevOp, QC

Defect classification workflow Rapid prototyping for production with NGC Pre- Trainin Inference Training g TF-TRT / TensorRT Tensorflow: NGC optimized docker image 1. NGC TensorFlow 1. NGC TensorFlow 2. NGC TensorRT V100 DGX-1V DGX-1 / 2 V100 T4 Used in industrial inspection white paper

MODEL SET UP 11

DL FOR DEFECT INSPECTION Supervised unsupervised Segmentation Classification Object Detection Autoencoder (Defect / Non Defect) Bounding-Box Polygons Mask Itself 12

FROM LITERATURE: CNN/LENET (2016) Source: Design of Deep Convolutional Neural Network Architectures for Automated Feature Extraction in Industrial Inspection, D. Weimer et al, 2016 13

FROM LITERATURE CNN/LENET (2016) Coarse segmentation results - can we do better? Source: Design of Deep Convolutional Neural Network Architectures for Automated Feature Extraction in Industrial Inspection, D. Weimer et al, 2016 14

U-Net structure 32 16 16 1 16 16 512 2 512 2 512 2 512 2 512 2 512 2 16 32 32 64 32 32 256 2 256 2 256 2 256 2 256 2 256 2 32 64 64 64 128 64 3X3 Conv2d+ReLU 128 2 128 2 128 2 128 2 128 2 128 2 2X2 MaxPool 2X2 Conv2dTranspose 128 128 64 256 128 128 copy and concatenate 64 2 64 2 64 2 64 2 64 2 64 2 128 256 256 32 2 32 2 32 2

KERAS-TF IMPLEMENTATION- ENCODING Convolution 16

KERAS-TF IMPLEMENTATION- ENCODING deconvolution 17

Image segmentation on medical images Same process among various use cases Data Science BOWL Data Science BOWL Data Science BOWL 2016 2017 2018 MRI image CT image Image Nuclei Left ventricle Nodule Drug discovery heart disease Lung cancer 18

Many others Different verticals Drone Surveillance Autonomous Car Path Space Human Road Space Navigation Anomaly Detection Space for Self Driving Car 19

MANUFACTURING Defect Inspection 20

DATA PREPARATION 21

DATASET FOR INDUSTRIAL OPTICAL INSPECTION DAGM (from German Association for Pattern Recognition) • http://resources.mpi-inf.mpg.de/conferences/dagm/2007/prizes.html 22

DAGM DATASET Pass NG Pass NG Pass NG NG 23 Pass

DAGM DETAILS Original images are 512 x 512 grayscale format • Output is a tensor of size 512 x 512 x 1 • Each pixel belongs to one of two classes • 6 defect classes • Training set consist of 100 defect images • Validation set consist of 50 defect images • 24

DAGM EXAMPLES WITH LABELS 25

Dice Metric (IOU) for unbalanced dataset Metric to compare the similarity of two samples: • 2 𝐵 𝑜𝑚 ________________________________ 𝐵 𝑜 + 𝐵 𝑚 Where: • • A n is the area of the contour predicted by the network A l is the area of the contour from the label • A nl is the intersection of the two • The area of the contour that is predicted correctly by the network • 1.0 means perfect score. • • More accurately compute how well we’re predicting the contour against the label • We can just count pixels to give us the respective areas 26

27 LEARNING CURVES 27

U-NET / DAGM FOR INDUSTRIAL INSPECTION • DAGM merged binary classification dataset: 6000 defect-free, 132 defect images • Challenges : Not all deviations from the texture are necessarily defects. 28

DEFECT SEGMENTATION – PRECISION/RECALL 29

FINAL DECISION 30

DEFECT VS NON-DEFECT BY THRESHOLDING Declare as defect (white) Segmentation model outputs Numpy array of class probability of each class (example 2 if probability is higher classes) than threshold (=0.5) Thresholding query image 512x512 31

INFERENCE PIPELINE Domain expertise involved decision making (not a black-box) Inference decision making (defect vs. non-defect) Data Center / Cloud DGX Server / V100 TF-TRT & TensorRT Determine threshold Domain Criteria Camera ฀ Defect Pattern Ratio Precision/ Recall ฀ Defect Level ฀ Defect region size ฀ Defect counts Result Composite Detectors/Classifiers/Segment … Metadata Inspection Machine Edge TF-TRT & TensorRT T4 / V100 32

(Example) Precision/Recall diagram 33

(Example) Simple binary anomaly detector Threshold of probability of defect: higher number means harder for classifier to detect as defect class. Higher threshold: FP lower, precision (TP/(TP+FP)) higher FN higher, recall (TP/(TP+FN)) lower TP: True Positive, FP: False Positive, FN: False Negative, TN: True Negative. red arrow means moving threshold of probability on defect detection into higher value. 34

Precision/Recall Results Experimental results verifies precision/recall trade-off. Domain expert knowledge involved: choose threshold per your application and business needs threshold 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 TP 137 135 135 135 135 135 135 133 131 TN 885 893 899 899 899 899 899 900 901 FP 16 8 2 2 2 2 2 1 0 FN 1 3 3 3 3 3 3 5 7 FP rate 0.0178 0.0089 0.0023 0.0023 0.0023 0.0023 0.0023 0.0011 0.0000 precision 0.8954 0.9441 0.9854 0.9854 0.9854 0.9854 0.9854 0.9925 1.0000 recall 0.9928 0.9783 0.9783 0.9783 0.9783 0.9783 0.9783 0.9638 0.9493 Choose: threshold = 0.8 for high precision = 0.9925 & small FP rates = 0.0011 35

Precision/Recall - reducing false positives Precision =TP/(TP+FP) : 99.25% Recall = TP/(TP+FN) : 96.38% False alarm rate = FP/(FP+TN): 0.11% Actual defect defect free defect 99.25% (TP) 0.75% (FP) Predict defect free 0.55% (FN) 99.45% (TN) *sensitivity=recall=true positive rate, specificity=true negative rate=TN/(TN+FP), false alarm rate=false positive rate 36

Final decision Defect segmentation (U-net + Thresholding) 37

AUTOMATIC MIXED PRECISION FOR U-NET ON V100 38

TENSOR CORES FOR DEEP LEARNING Mixed Precision implementation using Tensor Cores on Volta and Turing GPUs Tensor Cores A revolutionary technology that accelerates AI performance by enabling • efficient mixed-precision implementation Accelerate large matrix multiply and accumulate operations in a single • operation Mixed Precision Technique combined use of different numerical precisions in a computational method; focus is on FP16 and FP32 combination. Benefits Decreases the required amount of memory enabling training of larger models or • training with larger mini-batches Shortens the training or inference time by lowering the required resources by • using lower-precision arithmetic 39 https://developer.nvidia.com/tensor-cores

Automatic Mixed Precision Easy to Use, Greater Performance and Boost in Productivity Insert two lines of code to introduce ● Automatic Mixed-Precision in your training layers for up to a 3x performance improvement . The Automatic Mixed Precision feature uses a ● graph optimization technique to determine FP16 operations and FP32 operations. Available in TensorFlow, PyTorch and MXNet ● More details: https://developer.nvidia.com/automatic-mixed-precision via our NGC Deep Learning Framework Containers. Unleash the next generation AI performance and get faster to the market! 40

Enable Automatic Mixed Precision Add Just A Few Lines of Code, Get Upto 3X Speedup TensorFlow PyTorch MXNet os.environ['TF_ENABLE_AUTO_MIXED_PRECISION'] = '1' model, optimizer = amp.initialize(model, optimizer) amp.init() amp.init_trainer(trainer) with amp.scale_loss(loss, optimizer) as with amp.scale_loss(loss, trainer) as scaled_loss: OR thru NGC scaled_loss: autograd.backward(scaled_loss) export TF_ENABLE_AUTO_MIXED_PRECISION=1 scaled_loss.backward() More details: https://developer.nvidia.com/automatic-mixed-precision 41

(DEFECT SEGMENTATION) Peter Pyun Ph.D. Andrew Liu Ph.D. Relevant - PowerPoint PPT Presentation

DL-BASED INDUSTRIAL INSPECTION (DEFECT SEGMENTATION) Peter Pyun Ph.D. Andrew Liu Ph.D. Relevant Links: Defect Segmentation Nvidia Industrial Inspection White Paper V2.0: https://nvidia-gpugenius.highspot.com/viewer/5c949687a2e3a90445b8431f

Segmentation Bottom-up Segmentation Semantic / instance segmentation Many Slides from L.

VIDEO SIGNALS Segmentation WHAT IS SEGMENTATION WHAT IS SEGMENTATION Segmentation is a

Semantic Segmentation / Instance Segmentation Based on Deep learning Yiding Liu 2018.12.08

Defect Removal Metrics SE 350 Software Process & Product Quality 1 Objectives Understand

Defect Removal Metrics September 30, 2004 Swami Natarajan RIT Software Engineering Defect

Segmentation Segmentation Segmentation Define the accurate boundaries of all objects in an image

Segmentation using Segmentation using Bayesian Decision Theory Bayesian Decision Theory

Lecture 8: Image Segmentation Peng Chao Face++ Researcher pengchao@megvii.com Nov. 2017

Pixel-Level Im Image Understanding wit ith Semantic Segmentation and Panoptic Segmentation

Co-Segmentation of 3D Shapes via Subspace Clustering Ruizhen Hu Lubin Fan

Introduction to RFM segmentation Karolis Urbonas Head of Data Science, Amazon DataCamp

Image Segmentation Machine Learning Study Group Presented by Yaochen Xie Jan 25, 2018 Outline

Analyzing fluid flows via the ergodicity defect ergodicity defect Sherry E. Scott FFT 2013

A Defect- -Tolerant Tolerant A Defect Computer Architecture: Computer Architecture:

Circuit Analysis and Defect Characteristics Estimation Method Using Bimodal Defect-Centric Random

Defect Prevention and Removal SE 350 Software Process & Product Quality 1 Objectives

Foreground detection and tracking in 2D/3D Jos Luis Landabaso Montse Pards Outline 2D

Fast Connected Component Labeling Algorithm Using A Divide and Conquer Technique by Jung-Me Park

More manifestos In 1990 Stonebraker responded to Atkinsons 1989 database manifesto with The

Large Objects in Databases John von Neumann, PhD Lecturer Born: December 28, 1903 in Budapest

An Introduction to Modern Object Detection Gang Yu yugang@megvii.com Visual Recognition A

Basic Data Structures Divide and Conquer Algorithms, Biostatistics 615/815 Lecture 5: . . 1 /

Use of Gwyddion libraries for your little tasks Petr Klapetek, David Neas Anna Charvtov

Object-Oriented Finite Element Analysis of Material Microstructures Stephen A. Langer

(DEFECT SEGMENTATION) Peter Pyun Ph.D. Andrew Liu Ph.D. Relevant - PowerPoint PPT Presentation

DL-BASED INDUSTRIAL INSPECTION (DEFECT SEGMENTATION) Peter Pyun Ph.D. Andrew Liu Ph.D. Relevant Links: Defect Segmentation Nvidia Industrial Inspection White Paper V2.0: https://nvidia-gpugenius.highspot.com/viewer/5c949687a2e3a90445b8431f

Segmentation Bottom-up Segmentation Semantic / instance segmentation Many Slides from L.

VIDEO SIGNALS Segmentation WHAT IS SEGMENTATION WHAT IS SEGMENTATION Segmentation is a

Semantic Segmentation / Instance Segmentation Based on Deep learning Yiding Liu 2018.12.08

Defect Removal Metrics SE 350 Software Process &amp; Product Quality 1 Objectives Understand

Defect Removal Metrics September 30, 2004 Swami Natarajan RIT Software Engineering Defect

Segmentation Segmentation Segmentation Define the accurate boundaries of all objects in an image

Segmentation using Segmentation using Bayesian Decision Theory Bayesian Decision Theory

Lecture 8: Image Segmentation Peng Chao Face++ Researcher pengchao@megvii.com Nov. 2017

Pixel-Level Im Image Understanding wit ith Semantic Segmentation and Panoptic Segmentation

Co-Segmentation of 3D Shapes via Subspace Clustering Ruizhen Hu Lubin Fan

Introduction to RFM segmentation Karolis Urbonas Head of Data Science, Amazon DataCamp

Image Segmentation Machine Learning Study Group Presented by Yaochen Xie Jan 25, 2018 Outline

Analyzing fluid flows via the ergodicity defect ergodicity defect Sherry E. Scott FFT 2013

A Defect- -Tolerant Tolerant A Defect Computer Architecture: Computer Architecture:

Circuit Analysis and Defect Characteristics Estimation Method Using Bimodal Defect-Centric Random

Defect Prevention and Removal SE 350 Software Process &amp; Product Quality 1 Objectives

Foreground detection and tracking in 2D/3D Jos Luis Landabaso Montse Pards Outline 2D

Fast Connected Component Labeling Algorithm Using A Divide and Conquer Technique by Jung-Me Park

More manifestos In 1990 Stonebraker responded to Atkinsons 1989 database manifesto with The

Large Objects in Databases John von Neumann, PhD Lecturer Born: December 28, 1903 in Budapest

An Introduction to Modern Object Detection Gang Yu yugang@megvii.com Visual Recognition A

Basic Data Structures Divide and Conquer Algorithms, Biostatistics 615/815 Lecture 5: . . 1 /

Use of Gwyddion libraries for your little tasks Petr Klapetek, David Neas Anna Charvtov

Object-Oriented Finite Element Analysis of Material Microstructures Stephen A. Langer

Defect Removal Metrics SE 350 Software Process & Product Quality 1 Objectives Understand

Defect Prevention and Removal SE 350 Software Process & Product Quality 1 Objectives