cost aware pre training for multiclass cost sensitive
play

Cost-aware Pre-training for Multiclass Cost-sensitive Deep Learning - PowerPoint PPT Presentation

Cost-aware Pre-training for Multiclass Cost-sensitive Deep Learning Yu-An Chung 1 Hsuan-Tien Lin 1 Shao-Wen Yang 2 1 Dept. of Computer Science and Information Engineering National Taiwan University, Taiwan 2 Intel Labs Intel Corporation, USA


  1. Cost-aware Pre-training for Multiclass Cost-sensitive Deep Learning Yu-An Chung 1 Hsuan-Tien Lin 1 Shao-Wen Yang 2 1 Dept. of Computer Science and Information Engineering National Taiwan University, Taiwan 2 Intel Labs Intel Corporation, USA IJCAI 2016 Y.-A. Chung, H.-T. Lin, S.-W. Yang Cost-sensitive Deep Learning IJCAI 2016 1 / 17

  2. Outline Cost-sensitive Classification Setup 1 Estimate the costs - Regression Network 2 A novel Cost-aware Pre-training Technique 3 Conclusions 4 Y.-A. Chung, H.-T. Lin, S.-W. Yang Cost-sensitive Deep Learning IJCAI 2016 2 / 17

  3. Cost-sensitive Classification Setup Outline Cost-sensitive Classification Setup 1 Estimate the costs - Regression Network 2 A novel Cost-aware Pre-training Technique 3 Conclusions 4 Y.-A. Chung, H.-T. Lin, S.-W. Yang Cost-sensitive Deep Learning IJCAI 2016 3 / 17

  4. Cost-sensitive Classification Setup What is the status of the patient? ?? H1N1-infected Cold-infected Healthy A classification problem – grouping patients into different status. Which mistake is more serious? Predicting ... H1N1 as Healthy vs. Cold as Healthy Y.-A. Chung, H.-T. Lin, S.-W. Yang Cost-sensitive Deep Learning IJCAI 2016 4 / 17

  5. Cost-sensitive Classification Setup Cost-sensitive Classification Measuring the Mis-classification Costs by Cost Matrix Predicted H1N1 Cold Healthy Actual C = H1N1 0 1000 100000 Cold 100 0 3000 Healthy 100 30 0 C ( i , j ): cost of classifying a class i example as class j Regular classification: special case of cost-sensitive classificaiton Cost-sensitive Classification Setup Input: A training set S = { ( x n , y n ) } N n =1 and a cost matrix C , where x n ∈ X , y n ∈ Y = { 1 , 2 , ..., K } Goal: Use S and C to train a classifier g : X → Y such that the expected cost C ( y , g ( x )) on test example ( x , y ) is minimal Y.-A. Chung, H.-T. Lin, S.-W. Yang Cost-sensitive Deep Learning IJCAI 2016 5 / 17

  6. Cost-sensitive Classification Setup Our Contributions Where are we? Shallow Models Deep Learning (e.g., SVM) Regular (Cost-insensitive) Popular and Well-studied Classification undergoing Cost-sensitive Classification Well-studied Our work First work that studies Cost-sensitive Deep Learning a novel Cost-sensitive Loss ( CSL ) for training any deep models 1 (end-to-end) a Cost-sensitive Autoencoder ( CAE ) equipped with CSL for 2 pre-training deep models (layer-wise) a combination of 1) and 2) as a complete Cost-sensitive Deep Neural 3 Network ( CSDNN ) solution extensive experimental results have shown that deep models indeed 4 outperformed shallow ones (potential to study more!) Y.-A. Chung, H.-T. Lin, S.-W. Yang Cost-sensitive Deep Learning IJCAI 2016 6 / 17

  7. Estimate the costs - Regression Network Outline Cost-sensitive Classification Setup 1 Estimate the costs - Regression Network 2 A novel Cost-aware Pre-training Technique 3 Conclusions 4 Y.-A. Chung, H.-T. Lin, S.-W. Yang Cost-sensitive Deep Learning IJCAI 2016 7 / 17

  8. Estimate the costs - Regression Network Regression Network Network: to estimate the per-class costs Training: motivated by an earlier cost-sensitive SVM work, a Cost-sensitive Loss ( CSL ) that trains the network cost-sensitively is derived in this work (see paper or poster for details) Prediction: g ( x ) ≡ argmin r k ( x ) 1 � k � K Y.-A. Chung, H.-T. Lin, S.-W. Yang Cost-sensitive Deep Learning IJCAI 2016 8 / 17

  9. A novel Cost-aware Pre-training Technique Outline Cost-sensitive Classification Setup 1 Estimate the costs - Regression Network 2 A novel Cost-aware Pre-training Technique 3 Conclusions 4 Y.-A. Chung, H.-T. Lin, S.-W. Yang Cost-sensitive Deep Learning IJCAI 2016 9 / 17

  10. A novel Cost-aware Pre-training Technique Recap on Unsupervised Pre-training A classical way of training DNNs Two steps Unsupervised layer-wise pre-training Autoencoder, Restricted Boltzmann Machine (RBM) Several Autoencoders or RBMs can then be stacked to form a DNN. End-to-end supervised fine-tuning Cost-aware Pre-training Embed the proposed Cost-sensitive Loss ( CSL ) into Autoencoder a cost-sensitive version of Autoencoder (CAE) conduct cost-related features extraction Y.-A. Chung, H.-T. Lin, S.-W. Yang Cost-sensitive Deep Learning IJCAI 2016 10 / 17

  11. A novel Cost-aware Pre-training Technique Autoencoder (AE) Autoencoder (AE): Let L CE denotes the reconstruction errors of the AE to be minimized (CE stands for cross-entropy). Y.-A. Chung, H.-T. Lin, S.-W. Yang Cost-sensitive Deep Learning IJCAI 2016 11 / 17

  12. A novel Cost-aware Pre-training Technique Cost-sensitive Autoencoder (CAE) for cost-aware pre-training Cost-sensitive Autoencoder (CAE): CAE: Reconstruct x and estimate C simultaneously Objective function for CAE: (1 − β ) × L CE + β × L CSL β ∈ [0 , 1] When β = 0, CAE ≡ AE Y.-A. Chung, H.-T. Lin, S.-W. Yang Cost-sensitive Deep Learning IJCAI 2016 12 / 17

  13. A novel Cost-aware Pre-training Technique Experimental Results (Selected) 3 methods were compared to show the validity of CSL and CAE: Cost-sensitive pre-training? Cost-sensitive training? DNN no no DNN + CSL no yes CSDNN yes yes Y.-A. Chung, H.-T. Lin, S.-W. Yang Cost-sensitive Deep Learning IJCAI 2016 13 / 17

  14. Conclusions Outline Cost-sensitive Classification Setup 1 Estimate the costs - Regression Network 2 A novel Cost-aware Pre-training Technique 3 Conclusions 4 Y.-A. Chung, H.-T. Lin, S.-W. Yang Cost-sensitive Deep Learning IJCAI 2016 14 / 17

  15. Conclusions Conclusions CSL: make any deep model cost-sensitive (see paper for details) CSDNN = CAE pre-training + CSL fine-tuning: both techniques lead to significant improvements Extensive experimental results showed the superiority of CSDNN (see paper or poster) Y.-A. Chung, H.-T. Lin, S.-W. Yang Cost-sensitive Deep Learning IJCAI 2016 15 / 17

  16. Conclusions Thank you! Y.-A. Chung, H.-T. Lin, S.-W. Yang Cost-sensitive Deep Learning IJCAI 2016 16 / 17

  17. Supplementary Materials β vs. Test Costs MNIST imb bg−img−rot imb 0.24 5.2 5 0.22 4.8 0.2 4.6 4.4 0.18 4.2 0.16 4 0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1 SVHN imb CIFAR−10 imb 7.6 0.34 7.4 0.32 7.2 0.3 7 0.28 6.8 6.6 0.26 6.4 0.24 0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1 Y.-A. Chung, H.-T. Lin, S.-W. Yang Cost-sensitive Deep Learning IJCAI 2016 17 / 17

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend