RECOGNIZING PORNOGRAPHIC IMAGES USING DEEP CONVOLUTIONAL NEURAL - - PowerPoint PPT Presentation

recognizing pornographic images using deep convolutional
SMART_READER_LITE
LIVE PREVIEW

RECOGNIZING PORNOGRAPHIC IMAGES USING DEEP CONVOLUTIONAL NEURAL - - PowerPoint PPT Presentation

RECOGNIZING PORNOGRAPHIC IMAGES USING DEEP CONVOLUTIONAL NEURAL NETWORKS Olarik Surinta and Thananchai Khamket Faculty of Informatics Mahasarakham University th Present at the 4 International Conference on Digital Arts, Media and


slide-1
SLIDE 1

Olarik Surinta and Thananchai Khamket Faculty of Informatics Mahasarakham University

RECOGNIZING PORNOGRAPHIC IMAGES USING DEEP CONVOLUTIONAL NEURAL NETWORKS

Present at the 4

th

International Conference on Digital Arts, Media and Technology (ICDAMT2019) January 30 – February 2, 2019

slide-2
SLIDE 2

2

Outline:

l Introduction l Pornographic image recognition methods l Experimental settings and results l Conclusion

slide-3
SLIDE 3

3

Introduction:

l In pornographic image recognition, image

processing and machine learning techniques are proposed to use.

l Due to the image processing techniques,

l the human skin is extracted from the whole

image.

l The RGB is converted into HSV and YCbCr color

spaces to extract the skin color.

l The whole image region is calculated and decided

as the pornographic image when the ratio is more than the threshold value.

slide-4
SLIDE 4

4

Introduction:

l For the machine learning technique, l First, the color image is converted into HSV,

YCbCr color space to extract skin area.

l Then, extracted the feature from the skin

area.

l Finally, the machine learning technique

such as SVM and MLP are used to create a model and classify.

slide-5
SLIDE 5

5

Introduction:

l Rattanee and Chiracharit (2016)

Nudity detection based on face color and body morphology

slide-6
SLIDE 6

6

Introduction:

l Wijaya, et al. (2015) Pornographic image recognition based on

skin probability and Eiganporn of skin ROIs images

slide-7
SLIDE 7

7

Introduction:

l Wijaya, et al. (2015) Phonographic image recognition using

fusion of scale invariant descriptor

slide-8
SLIDE 8

8

Introduction:

l We evaluate the performance of 16 different

techniques on a TI-UNRAM pornographic image dataset.

l The use of existing deep CNN architectures

(ResNet, GoogLeNet, and AlexNet) and a BOW method are presented.

l This paper is combining three well-known

local descriptor methods, called LBP, HOG, and SIFT and three machine learning technique (SVM, MLP, and KNN).

slide-9
SLIDE 9

9

Pornographic image recognition methods:

l Deep Residual Networks (ResNet) l ResNet architecture has very deep network

and shown good performance in many image recognition.

l He et al. proposed the deep ResNet

architecture with a depth of 18, 34, 50, 101, and 152 layers.

l The ResNet-152 is deeper 22 and 7 times

than AlexNet and GoogLeNet, respectively.

slide-10
SLIDE 10

10

Pornographic image recognition methods:

l The novel architecture called shortcut

connections, is proposed.

l The shortcut directly uses the input of the

previous layer to the next output.

Plain network Residual network

slide-11
SLIDE 11

11

Experimental settings and results:

l The TI-UNRAM pornographic image dataset l Experimental setup l Experimental results

slide-12
SLIDE 12

12

TI-UNRAM dataset:

l This dataset includes two classes and contains

685 pornographic, 715 non-pornographic images (1400 images)

l These images are collected from the Internet l We randomly divided 50% of the whole

dataset into training and test set

slide-13
SLIDE 13

13

Non-pornographic images:

slide-14
SLIDE 14

14

Complex images:

Can you guess which images are pornographic?

slide-15
SLIDE 15

15

Experimental setup:

l We use 2-fold cross validation according to

Wijaya et al. (2015a, 2015b).

l We compute the average and standard

deviation for evaluating the test performance

  • f

l deep CNN architectures l Local descriptors combined with machine

learning techniques

l bag of words (BOW)

slide-16
SLIDE 16

16

Experimental results:

Recognition results using deep CNN methods

slide-17
SLIDE 17

17

Experimental results:

Recognition results using different local descriptors and machine learning techniques

slide-18
SLIDE 18

18

Conclusion:

l We have presented a comparative study on

the TI-UNRAM pornographic image dataset including

l local descriptors combined with machine

learning techniques

l a bag of visual words (BOW) l deep convolutional neural networks (CNNs)

slide-19
SLIDE 19

19

Conclusion:

l First, we proposed to use the LBP, HOG, and

SIFT as for the local descriptor methods.

l These three descriptor methods combined

with 3 machine learning techniques;

l SVM, MLP, and KNN l The results show that the LBP+SVM

  • utperforms the other combinations.

l The LBP+SVM method also gives a better result

than the BOW method.

slide-20
SLIDE 20

20

Conclusion:

l Second, we compared three deep CNN

architectures

l ResNet, GoogLeNet, and AlexNet

architectures

l To make a fair comparison, in these

experiments, the transfer learning and the data augmentation are not performed.

l The results show that the best

recognition accuracy is the ResNet, GoogLeNet, and AlexNet, respectively.

slide-21
SLIDE 21

21

Conclusion:

l Finally, the ResNet architecture which is the

best result in our experiment, also slightly higher than the LBP+SVM.

l Future work: l We want to improve the result of the deep

CNN by using transfer learning and data augmentation.

l We also consider the deep learning

approach that requires less memory usage and a decrease in training computing time.

slide-22
SLIDE 22

22

ICDAMT2019:

l Thank you for your kind attention.