Fast Rates for a k-NN Classifier Robust to Unknown Asymmetric Label - PowerPoint PPT Presentation

Aug 03, 2023 •295 likes •416 views

Fast Rates for a k-NN Classifier Robust to Unknown Asymmetric Label Noise Henry W J Reeve and Ata Kabn University of Birmingham, United Kingdom International Conference on Machine Learning 2019 Pacific Ballroom #187 Learning with asymmetric

Fast Rates for a k-NN Classifier Robust to Unknown Asymmetric Label Noise Henry W J Reeve and Ata Kabán University of Birmingham, United Kingdom International Conference on Machine Learning 2019 Pacific Ballroom #187
Learning with asymmetric label noise Suppose we have a distribution over . Our goal is to obtain a classifier which minimizes We would like uncorrupted data: i.i.d. Instead, we have corrupted data: i.i.d.
Learning with asymmetric label noise There exist label noise probabilities with 1. 2. Samples consist of a feature vector and a noisy label .
Applications Asymmetric class-conditional label noise occurs in numerous applications: • Nuclear particle classification - distinguishing neutrons from gamma rays (Blanchard et al., 2016) • Protein classification and other problems with Positive and Unlabelled data (Elkan & Noto, 2009)
The Robust k-NN classifier of Gao et al. (2018) Let be the k-nearest neighbors regression estimator based on 1) Estimate the label noise probabilities 2) Binary k-nearest neighbor prediction with a label noise dependent threshold:
The Robust k-NN classifier of Gao et al. (2018) The Robust k-NN classifier was introduced by Gao et al. (2018) who: 1) Conducted a comprehensive empirical study which demonstrates that the method typically outperforms a range of competitors. 2) Proved finite sample bounds. However, a) Fast rates ( ) have not been established. b) The bounds assume prior knowledge of the label noise . In our work the label noise probabilities are unknown!
Range assumption We adopt the range assumption of Menon et al. (2015):
Non-parametric assumptions We also adopt the following non-parametric assumptions: A) Measure-smoothness assumption : B) Tysbakov’s margin assumption :
Fast rates for the Robust k-NN classifier Main result (Reeve & Kabán, 2019) Suppose that satisfies (1) the range assumption, (2) the measure-smoothness assumption, (3) Tsybakov’s margin assumption. With probability at least over the corrupted sample , the Robust k- Nearest Neighbor classifier satisfies Matches the minimax optimal rate for the noise free setting (up to log factors)!
Conclusions Pacific Ballroom #187 • We established fast rates for the Robust k-NN classifier of Gao et al. (2016) • A high probability bound is established for unknown asymmetric label noise • The finite sample rates match the minimax optimal rates for the label-noise free setting up to logarithmic factors (e.g. Audibert & Tsybakov, 2006) • As a biproduct of our analysis we provide a high probability bound for determining the maximum of a noisy function with minimal assumptions. Thank you for listening!

Recommend

The Nave Bayes Classifier Machine Learning 1 Todays lecture The nave Bayes Classifier

The Nave Bayes Classifier Machine Learning 1 Todays lecture The nave Bayes Classifier Learning the nave Bayes Classifier Practical concerns 2 Todays lecture The nave Bayes Classifier Learning the nave Bayes

1.03k views • 84 slides

PROPERTY RATES PROPERTY RATES PROPERTY RATES PROPERTY RATES BUFFALO CITY MUNICIPALITY

PROPERTY RATES PROPERTY RATES PROPERTY RATES PROPERTY RATES BUFFALO CITY MUNICIPALITY BUFFALO CITY MUNICIPALITY Buffalo City Municipality Buffalo City Municipality BCMs Property Rates Policy makes provision BCMs Property Rates

93 views • 5 slides

Outlier Outlier Outlier- Outlier - -robust - robust robust robust identification

GESG seminar, 16 October 2015, UFM Outlier Outlier Outlier- Outlier - -robust - robust robust robust identification identification identification of identification of of of switching regimes: switching regimes: switching regimes:

582 views • 27 slides

Lazy Associative Classification Decision Tree Classifier (Eager) Associative Classifier By

Contents: Classification Lazy Associative Classification Decision Tree Classifier (Eager) Associative Classifier By Adriano Veloso,Wagner Meira Jr. , Mohammad J. Zaki Comparison between Decision Tree and Associative Classifier

105 views • 7 slides

PAM 2004 Typeset by Foil T EX PAM2004 Outline A Robust Classifier for Passive TCP/IP

A Robust Classifier for Passive TCP/IP Fingerprinting Rob Beverly MIT CSAIL rbeverly@csail.mit.edu April 20, 2004 PAM 2004 Typeset by Foil T EX PAM2004 Outline A Robust Classifier for Passive TCP/IP Fingerprinting Background

439 views • 32 slides

Being a METS Startup Fast Failure; Fast Reward November 2016 Fast Failure; Fast Reward

Being a METS Startup Fast Failure; Fast Reward November 2016 Fast Failure; Fast Reward Fireside Chat Story so far What mindset do you need? Whats different about this market? Fast Reward 2 Fireside Chat Clear

508 views • 12 slides

Data Mining with Weka Class 2 Lesson 1 Be a classifier! Ian H. Witten Department of Computer

Data Mining with Weka Class 2 Lesson 1 Be a classifier! Ian H. Witten Department of Computer Science University of Waikato New Zealand weka.waikato.ac.nz Lesson 2.1: Be a classifier! Class 1 Getting started with Weka Lesson 2.1 Be a classifier!

353 views • 33 slides

When and Why to use a Classifier? When and Why to use a Classifier? Alan Rector Alan Rector

When and Why to use a Classifier? When and Why to use a Classifier? Alan Rector Alan Rector with acknowledgement to with acknowledgement to Jeremy Rogers, Pieter Zanstra Zanstra, & the GALEN Consortium , & the GALEN Consortium

773 views • 39 slides

When and Why to use a Classifier? When and Why to use a Classifier? Alan Rector Alan Rector

872 views • 28 slides

Lecture 2: Nearest Neighbour Classifier Aykut Erdem September 2017 Hacettepe University Your

Lecture 2: Nearest Neighbour Classifier Aykut Erdem September 2017 Hacettepe University Your 1st Classifier: Nearest Neighbor Classifier Concept Learning Definition: Acquire an operational definition of a general category of objects

915 views • 68 slides

Maximum Entropy Classifier Ensembling using Ge- netic Algorithm for NER in Bengali Asif Ekbal 1

Outline Background and Motivation Classifier Ensembling Genetic Algorithms Proposed Method of Classifier Ensemble Feature Set Used Experimental Results Conclusions Future Works Maximum Entropy Classifier Ensembling using Ge- netic

749 views • 36 slides

Data Classification Linear Classifier II Latent Differential Analysis Mean Classification

Data Classification Linear Classifier II Latent Differential Analysis Mean Classification Memory If youre here, you are RED If youre here, you are BLUE 2 Back Linear Classifier A classifier that assigns a class to a new point

637 views • 36 slides

Classifier Selection Nicholas Ver Hoeve Craig Martek Ben Gardner Classifier Ensembles Assume

Classifier Selection Nicholas Ver Hoeve Craig Martek Ben Gardner Classifier Ensembles Assume we have an ensemble of classifiers with a well-chosen feature set. We want to optimize the competence of this system. Simple enhancements include:

512 views • 27 slides

Classifier Classifier Systems Systems

Classifier Classifier Systems Systems Christian Jacob Christian Jacob jacob@cpsc.ucalgary.ca Department of Computer Science University of Calgary

289 views • 28 slides

Clearance Rates Office of Research and Data Analysis Clearance Rates Clearance rates are the

Clearance Rates Office of Research and Data Analysis Clearance Rates Clearance rates are the number of outgoing cases as a percentage of the number of incoming cases. Ideally, clearance rates will be 100% or more, indicating a court is

202 views • 7 slides

Advanced Macroeconomics 7. Exchange Rates, Interest Rates and Expectations Karl Whelan School of

Advanced Macroeconomics 7. Exchange Rates, Interest Rates and Expectations Karl Whelan School of Economics, UCD Spring 2020 Karl Whelan (UCD) Exchange Rates and Interest Rates Spring 2020 1 / 17 Exchange Rates We have talked a lot about

636 views • 17 slides

Event announcement Topic: Thermal-Aware Design of 2D/3D Many-Core Servers with Inter- Tier

Event announcement Topic: Thermal-Aware Design of 2D/3D Many-Core Servers with Inter- Tier Liquid Cooling Speaker: Prof. David Atienza, cole polytechnique fdrale de Lausanne (EPFL), Switzerland Time/Location: Monday, July 7th, 16:00,

458 views • 31 slides

Systems Logic Gates and Electrical Properties Shankar Balachandran* Associate Professor, CSE

Spring 2015 Week 5 Module 23 Digital Circuits and Systems Logic Gates and Electrical Properties Shankar Balachandran* Associate Professor, CSE Department Indian Institute of Technology Madras *Currently a Visiting Professor at IIT Bombay

393 views • 15 slides

Signal Types Recall even digital signals are just ___________________ Analog signal

1 2 Signal Types Recall even digital signals are just ___________________ Analog signal Continuous time signal where each voltage level has a unique meaning EE 109 Unit 18 Noise Margins, Digital signal Continuous

253 views • 7 slides

Lecture 15: OS Noise and Interference Abhinav Bhatele, Department of Computer Science Summary of

High Performance Computing Systems (CMSC714) Lecture 15: OS Noise and Interference Abhinav Bhatele, Department of Computer Science Summary of last lecture Goal of auto-tuning: performance portability Selecting code variants,

321 views • 13 slides

1

Perceptron Revisited: Linear Separators Support Vector Machines Binary classification can be viewed as the

278 views • 6 slides

Analyzing Side-Channel Leakage of RFID-Suitable Lightweight

Ins$tute for Applied Informa$on Processing and Communica$ons (IAIK) Analyzing Side-Channel Leakage of RFID-Suitable Lightweight ECC Hardware

244 views • 23 slides

Boosting under high noise. Adaboost is sensitive to label noise Letter / Irvine Database

Boosting under high noise. Adaboost is sensitive to label noise Letter / Irvine Database Focus on a binary problem: {F,I,J} vs. other letters. Label Adaboost Logitboost Noise 0% 0.8% 0.2% 0.8% 0.1% 20% 33.3% 0.7%

726 views • 49 slides

Learning Noise in Quantum Information Processors Travis L Scholten @Travis_Sch Center for Quantum

Learning Noise in Quantum Information Processors Travis L Scholten @Travis_Sch Center for Quantum Information and Control, Center for Computing Research, Sandia National Laboratories, Albuquerque, USA University of New Mexico, Albuquerque, USA

398 views • 28 slides