detecting facial manipulation deepfakes
play

Detecting Facial Manipulation Deepfakes Evan Kravitz, Huazhe Xu - PowerPoint PPT Presentation

Detecting Facial Manipulation Deepfakes Evan Kravitz, Huazhe Xu April 21, 2020 1 2 https://www.instagram.com/p/ByaVigGFP2U/ 3 https://www.instagram.com/p/ByaVigGFP2U/ What is a deepfake? Synthetic image/video of a person that looks


  1. Detecting Facial Manipulation Deepfakes Evan Kravitz, Huazhe Xu April 21, 2020 1

  2. 2 https://www.instagram.com/p/ByaVigGFP2U/

  3. 3 https://www.instagram.com/p/ByaVigGFP2U/

  4. What is a deepfake? ● Synthetic image/video of a person that looks realistic to human viewers, which can be used to perpetrate fraud or spread misinformation ● Deepfakes are a form of social engineering attack ● We have focused our research on detecting facial deepfakes 4

  5. Social engineering attacks 5

  6. Face synthesis StyleGan (2019) 6

  7. Face swap Deepfake FaceSwap (2020) FaceSwap (2016) 7

  8. Face attribute StarGAN (2018) 8

  9. Facial expression Face2Face (2016) 9

  10. Protecting against deepfake ● We need a system for authenticating media Real/fake? Authenticator 10

  11. Convolutional Neural Network (CNN) 11

  12. Convolutional Neural Network (CNN) cont. Convolutional Real/fake? Neural Network 12

  13. 13

  14. Optical flow Amerini et al., 2019 14

  15. CNN’s with optical flow Convolutional Real/fake? Neural Network 15

  16. CNN’s with self-labeled data (Li et al., 2019) 1. Generate “negative” examples that contain deepfake generation artifacts 2. Use “negative” examples to train a CNN Convolutional Real/fake? Neural Network 16

  17. Forensic deepfake detection ● Forensic approach ○ Generate correlations between facial features in a video to determine “signature motion” (Agarwal et al., 2019) 17

  18. 18

  19. Forensic deepfake detection cont. Cor(X 1 , X 1 ) Cor(X 1 , X 2 ) Real/fake? SVM Cor(X 1 , X 2 ) : : Cor(X i , X j ) 19

  20. Our contribution ● We aim to improve upon existing neural network and forensic feature models. ✓ Feature augmentation and enhancement Better classification model ✓ 20

  21. Original labeled data Altered labeled data Dataset Face2Face Entire YouTube 8M Dataset Cropped faces from video frames Face2Face manipulated video frames 21

  22. Dataset cont. ● 704 videos for training (368,135 images) ● 150 videos for validation (75,526 images) ● 50 videos for testing (77,745 images) 22

  23. Forensic analysis of facial landmarks 68 (x,y) PCA 50 features coordinates = 136 features Classifier Prediction 23

  24. Principal Component Analysis (PCA) ● Popular technique for dimensionality reduction ● Transform feature space into orthogonal basis features, only capture most prominent features ● Fewer features → less variance, less overfitting 24

  25. Method: Random forest classifier ● Pros: ○ Works with few features ○ Lower variance compared to regular decision tree ○ Explainable model ○ Low cost ● Cons: ○ Hard to tune 25 https://towardsdatascience.com/random-forest-classification-and-its-implementation-d5d840dbead0

  26. Method: Support vector machine ● Pros: ○ Supports non-linear decision boundaries ● Cons: ○ Hard to tune kernel and hyperparameters 26 https://pythonmachinelearning.pro/classification-with-support-vector-machines/

  27. Method: Neural Network with facial landmarks FC Facial neural Output Landmark Net detector Loss: Cross Entropy loss Features PCA for dimension Pros: reduction Lightweight --- single GPU training Large batchsize Cons: Data hungry 27 Need extensive tuning

  28. Metrics Accuracy: (True Positive + True Negative) / total samples Precision: True Positives / All the predicted positives Recall: True Positives / All the actual positives 28

  29. Results: in-distribution samples (small scale) - Near perfect performance for random forest SVM Random NN Forest - What does this imply? We can perfectly detect Accuracy 80.00% 98.10% 85.12% fake/real across the web if Table 1: Accuracy for different models we have label for part of a Random Forest NN clip. Precision 98.52% 92.81% - 10K training images Recall 98.72% 85.01% Table 2: Precision and Recall for top 2 models 29

  30. Results: out-of-distribution training and testing - Both methods drops SVM Random NN Forest significantly - Neural Net performs slightly Accuracy N/A 70.50% 73.78% better (the training accuracy for Table 1: Accuracy of Random and NN model NN is 90% and for random forest 99.9%) Random Forest NN - Training data is too little! - 14K training images Precision 77.15% 79.23% Recall 58.82% 63.44% Table 2: Precision and Recall for top 2 models 30

  31. Public Benchmark Results w/ ~5 times our current training data - Larger net - More data - Utilize video property 31 http://kaldir.vc.in.tum.de/faceforensics_benchmark/index.php?sortby=dface2face

  32. Visualized Examples Original Image Altered Image 32

  33. Next steps Temporal Features Scale up & Analysis Compare with public Benchmark CNN + Forensic Features 33

  34. Thank you! 34

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend