Cla lass Prior Shif ift and Asymmetric Error Nontawat - PowerPoint PPT Presentation

Positive-Unlabeled Cla lassification under Cla lass Prior Shif ift and Asymmetric Error Nontawat Charoenphakdee 1,2 and Masashi Sugiyama 2,1 The University of Tokyo 1 RIKEN AIP 2

2 Supervised binary ry classification (P (PN classification) Positive and Negative data are given. Binary Data collection Classifier Features (input) Labels (output) + - Machine learning No noise robustness https://t.pimg.jp/006/570/886/1/6570886.jpg https://www.kullabs.com/uploads/meauring-clip-art-at-clker-com-vector-clip-art-online-royalty-free-H2SJHF-clipart.png https://d3njjcbhbojbot.cloudfront.net/api/utilities/v1/imageproxy/https://coursera.s3.amazonaws.com/topics/ml/large-icon.png\

3 Positive-unlabeled classification (P (PU classification) Positive and Unlabeled data are given. Binary Data collection Classifier Features (input) Labels (output) + Machine learning No noise robustness https://t.pimg.jp/006/570/886/1/6570886.jpg https://www.kullabs.com/uploads/meauring-clip-art-at-clker-com-vector-clip-art-online-royalty-free-H2SJHF-clipart.png https://d3njjcbhbojbot.cloudfront.net/api/utilities/v1/imageproxy/https://coursera.s3.amazonaws.com/topics/ml/large-icon.png\

4 Why PU classification? Unlabeled data are cheaper to obtain. Sometimes, negative data are hard to describe . In some real-world applications, collecting negative data is impossible . Applications : • Bioinformatics (Yang+, 2012, Singh-Blom+ 2013, Ren+, 2015) • Text classification (Li+, 2003) • Time series classification (Nguyen+, 2011) • Medical diagnosis (Zuluaga+, 2011) • Remote-sensing classification (Li+, 2011)

5 Class prior shift The ratio of positive - negative in the training and test data are different. pos. pos. Train Test neg. neg. Decision boundary is also shifted Lead to low accuracy! Examples : • Collect unlabeled data from the internet . • Collect unlabeled data from all users/patients/etc. for personalized application .

6 Class prior shift (c (cont.) Existing PU classification work assumes class prior of training and test data are the same (du Plessis+, 2014 2015, Kiryo+, 2017). Existing class prior shift work is not applicable since they require positive - negative data (Saerens, 2002, du Plessis+, 2012) .

7 PU classification under class prior shift Observed Given : Two sets of data Positive Unlabeled : Class prior shift! Unobserved Test Q: Does class prior shift heavily degrade the performance?

8 Classifier may fail miserably under class prior shift… Accuracy reported in mean and std. error of 10 trials with density ratio method. Accuracy drops heavily !! Our method Dataset Accuracy (no shift) Accuracy (shifted) Accuracy (shifted) banana 90.1 (0.6) 87.9 (0.3) 82.3 (0.5) ijcnn1 72.9 (0.4) 71.7 (0.3) 37.8 (0.7) MNIST 86.0 (0.4) 69.8 (0.7) 82.5 (0.6) susy 79.5 (0.5) 75.9 (0.5) 57.5 (0.9) cod-rna 87.4 (0.6) 84.7 (0.4) 78.5 (0.6) magic 76.7 (0.5) 79.0 (0.5) 60.6 (1.4) No shift: Shift!

9 Problem setting • Given : Two sets of data and test class prior Positive Unlabeled • Goal : Find a prediction function that minimizes

10 Proposed methods We proposed two approaches for PU classification under class prior shift: • Risk minimization approach : Learn a classifier based on empirical risk minimization principle (Vapnik, 1998) . • Density ratio approach : 1. Estimate a density ratio of positive and unlabeled densities. 2. Use an appropriate threshold to classify. Later, we will show that our methods are also applicable for PU classification with asymmetric error .

11 Risk minimization approach Consider the following classification risk: With , we can rewrite as Equivalent to existing methods (du Plessis+, 2015) if . No access to distribution: we minimize empirical error (Vapnik, 1998) :

12 Surrogate losses for binary ry classification Directly minimize 0-1 loss is difficult. • NP-Hard, discontinuous, not differentiable (Ben-david+, 2003, Feldman+, 2012) In practice, minimize a surrogate loss (regularization can also be added) :

13 Density ratio estimation Goal : Estimate the density ratio: from two sets of data Please check this book to learn more about Applications : outlier detection (Hido+, 2011) , density ratio estimation (Sugiyama+, 2012) change-point detection (Liu+, 2013) , robot control (Hachiya+, 2009) event detection in images/movies/text (Yamanaka, 2011, Matsugu, 2011, Liu, 2012) , etc. Naïve approach : estimate , separately then perform division . Does not work well (estimation error is amplified from division operation).

14 Unconstrained le least-squares im important fi fitting (uLSIF) (Kanamori+, 2012) Goal : Estimate the density ratio: How : estimate by minimizing squared loss objective: Squared loss decomposition: Empirical minimization (constant can be safely ignored):

15 Unconstrained le least-squares im important fi fitting (c (cont.) (Kanamori+, 2012) Model: linear-in parameter model : basis function (e.g., Gaussian kernel) Objective : : regularization parameter : identity matrix Global solution can be computed analytically: Parameter tuning (regularization, basis) can be done by cross-validation.

16 Density ratio approach Consider Bayes-optimal classifier of binary classification (no prior shift) We can rewrite it as Density ratio! Another formulation is Q1: How to modify when class prior shift occurs? Q2: Which formulation is preferable?

17 Q1: : Density ratio approach (s (shift) Consider Bayes-optimal classifier of binary classification We can rewrite it as Density ratio! Another formulation is Simply modifying the threshold can solve this problem!

18 Q2: : Difficulty of f density ratio estimation In general, density ratio is unbounded . is unbounded when . This raises issues of robustness and stability. We show that the density ratio is bounded in PU classification.

19 Q2: : Density ratio in PU In PU classification , density ratio is bounded. Lower and upper bounded Unbounded from above Insight: estimate is preferable. Our experimental results agree with this observation.

20 Experiments: class prior shift t train 0.7 .7 -> test 0.3 .3 Datasets: banana, ijcnn1, MNIST, susy, cod-rna, magic Methods: 𝒒 • Density ratio ( 𝒗 uLSIF ) 𝒗 • Density ratio ( 𝒒 uLSIF ) • Linear-in input model (Lin): Double hinge loss (DH-Lin) , squared loss (Sq-Lin) • Kernel model (Ker): Double hinge loss (DH-Ker) , squared loss (Sq-Ker) Parameter selection: (regularization, kernel width) 5-fold cross-validation. We also investigated when wrong test class prior is given. Results reported in mean and std. error of accuracy of 10 trials. Outperforming methods are bolded based on one-sided t-test with significance level 5%. Dataset information and more experiments and can be found in the paper.

21 Results: class prior shift Correct test prior is given Wrong test prior is given Traditional PU Preferable method in our experiments 𝒒 (density ratio 𝒗 uLSIF)

22 PU classification with asymmetric error • Given : Given two sets of sample: Positive Unlabeled • Goal : Find a prediction function that minimizes Reduce to symmetric error when

23 The equivalence of f pri rior shif ift and asymmetric error We can relate these problems based on the analysis of Bayes-optimal classifier.

24 Conclusion Class prior shift may heavily degrade the performance of positive-unlabeled classification (PU classification) . • Proposed two approaches for handling this problem effectively : ▪ Risk minimization approach ▪ Density ratio approach • Showed the equivalence of class prior shift and asymmetric error problems in PU classification . ▪ Our methods are applicable for both problems . ▪ Also applicable when considering both problems simultaneously . • Poster: #31: May 2 nd from 7:00-9:00PM

Cla lass Prior Shif ift and Asymmetric Error Nontawat - PowerPoint PPT Presentation

Positive-Unlabeled Cla lassification under Cla lass Prior Shif ift and Asymmetric Error Nontawat Charoenphakdee 1,2 and Masashi Sugiyama 2,1 The University of Tokyo 1 RIKEN AIP 2 2 Supervised binary ry classification (P (PN classification)

Cla lass of f 2022 Cla lass Meeting MR. R. MORGAN Pri rincip ipal MS. . FU FUQUAY Ass

Cla lass hierarcies inheritance method overriding super multiple inheritance Call

Cla lass hierarcies inheritance method overriding super multiple inheritance Call

Welcome! C LASS OF 2022 Source: This Is Pawsome Session IV Excellence Academy Welcome C LASS

GREA Notes on Benefits Presentation 1 May 2019 2 We are now in a SHIF (School Health

Good Morning! Good Morning! Resource Development Session II Session II Excellence Academy 1

17 o f 46 Ac c ide nts 26 o f 58 Ac c ide nts 35% 45% $76,858 in WC Cla ims Pa id Out $78,628

Chapter 11: The R.M.S. Error for Regression Errors: A has a large positive error B has a large

ILC Elaine Cristina Ferreira Silva Fortes Institute of Theoretical Physics- IFT/Unesp In

Fir irst Cla lass Support for College Students on the Autism Spectrum: Practical Advice for

WORLD LD CLA LASS SS SOLU LUTI TION ON Adia Ad iabatic atic, , evaporat orative ve pre

Cla lass of 2020 Tuesday Sept. 10, 2019 6:00pm Cafeteria Heather Handy Principal

Every ry Chil ild Every ry Cla lass Every ry Day! Charter School Advisory Board October 8,

Welcome Cla lass of 2026! Lets follow a day in the life! Daily Routine MMS doors open at

Na Naviga vigate te th then en Gr Grad aduate uate Cla lass of f 2020 2020 Mr.

Cla lass s of 2022 2022 NFHS S Principa ipal, Mr. Jeff f Chene ney Raider U Caseload

6/9/2017 Using deep learning to delineate pathological correlates of mammographic breast density

The Poisson Voronoi tessellation in hyperbolic space Elliot Paquette Weizmann Institute of

Financial results presentation Q2 FY04: quarter ended 30 September 2003 6 November 2003 Forward

Q1 2017 Delivering on our Commitments Today and Tomorrow Cautionary notes CAUTIONARY NOTE

Coping with the COVID 19 Pandemic HINDA DUBIN, M.D. Objectives At the conclusion of this

Outline Workforce Scheduling DMP204 SCHEDULING, TIMETABLING AND ROUTING 1. Transportation

Lecture: Image Resizing Juan Carlos Niebles and Ranjay Krishna 15-Oct-2019 Stanford Vision and

Kocsi = = Coa Coach National Youth Sport Institute Singapore Effective Youth Coaching &

Sambuz

Useful Links

Newsletter

Mail Us

Cla lass Prior Shif ift and Asymmetric Error Nontawat - PowerPoint PPT Presentation

Positive-Unlabeled Cla lassification under Cla lass Prior Shif ift and Asymmetric Error Nontawat Charoenphakdee 1,2 and Masashi Sugiyama 2,1 The University of Tokyo 1 RIKEN AIP 2 2 Supervised binary ry classification (P (PN classification)

Cla lass of f 2022 Cla lass Meeting MR. R. MORGAN Pri rincip ipal MS. . FU FUQUAY Ass

Cla lass hierarcies inheritance method overriding super multiple inheritance Call

Cla lass hierarcies inheritance method overriding super multiple inheritance Call

Welcome! C LASS OF 2022 Source: This Is Pawsome Session IV Excellence Academy Welcome C LASS

GREA Notes on Benefits Presentation 1 May 2019 2 We are now in a SHIF (School Health

Good Morning! Good Morning! Resource Development Session II Session II Excellence Academy 1

17 o f 46 Ac c ide nts 26 o f 58 Ac c ide nts 35% 45% $76,858 in WC Cla ims Pa id Out $78,628

Chapter 11: The R.M.S. Error for Regression Errors: A has a large positive error B has a large

ILC Elaine Cristina Ferreira Silva Fortes Institute of Theoretical Physics- IFT/Unesp In

Fir irst Cla lass Support for College Students on the Autism Spectrum: Practical Advice for

WORLD LD CLA LASS SS SOLU LUTI TION ON Adia Ad iabatic atic, , evaporat orative ve pre

Cla lass of 2020 Tuesday Sept. 10, 2019 6:00pm Cafeteria Heather Handy Principal

Every ry Chil ild Every ry Cla lass Every ry Day! Charter School Advisory Board October 8,

Welcome Cla lass of 2026! Lets follow a day in the life! Daily Routine MMS doors open at

Na Naviga vigate te th then en Gr Grad aduate uate Cla lass of f 2020 2020 Mr.

Cla lass s of 2022 2022 NFHS S Principa ipal, Mr. Jeff f Chene ney Raider U Caseload

6/9/2017 Using deep learning to delineate pathological correlates of mammographic breast density

The Poisson Voronoi tessellation in hyperbolic space Elliot Paquette Weizmann Institute of

Financial results presentation Q2 FY04: quarter ended 30 September 2003 6 November 2003 Forward

Q1 2017 Delivering on our Commitments Today and Tomorrow Cautionary notes CAUTIONARY NOTE

Coping with the COVID 19 Pandemic HINDA DUBIN, M.D. Objectives At the conclusion of this

Outline Workforce Scheduling DMP204 SCHEDULING, TIMETABLING AND ROUTING 1. Transportation

Lecture: Image Resizing Juan Carlos Niebles and Ranjay Krishna 15-Oct-2019 Stanford Vision and

Kocsi = = Coa Coach National Youth Sport Institute Singapore Effective Youth Coaching &amp;

Sambuz

Useful Links

Newsletter

Mail Us

Kocsi = = Coa Coach National Youth Sport Institute Singapore Effective Youth Coaching &