under Class Imbalance Aditya K. Menon 1 , Harikrishna Narasimhan 2 , - PowerPoint PPT Presentation

Jan 20, 2023 •23 likes •177 views

On the Statistical Consistency of Algorithms for Binary Classification under Class Imbalance Aditya K. Menon 1 , Harikrishna Narasimhan 2 , Shivani Agarwal 2 and Sanjay Chawla 3 1 University of California, San Diego 2 Indian Institute of Science,

On the Statistical Consistency of Algorithms for Binary Classification under Class Imbalance Aditya K. Menon 1 , Harikrishna Narasimhan 2 , Shivani Agarwal 2 and Sanjay Chawla 3 1 University of California, San Diego 2 Indian Institute of Science, Bangalore 3 University of Sydney and NICTA, Sydney
Class Imbalance • Medical Diagnosis • Text Retrieval • Credit Risk Minimization • Fraud Detection • ….
Class Imbalance • Medical Diagnosis • Text Retrieval • Credit Risk Minimization • Fraud Detection • …. Standard misclassification error ill-suited!
Class Imbalance • Medical Diagnosis • Text Retrieval • Credit Risk Minimization • Fraud Detection • …. Standard misclassification error ill-suited!
Class Imbalance • Medical Diagnosis • Text Retrieval • Credit Risk Minimization • Fraud Detection • …. Standard misclassification error ill-suited!
Algorithmic Approaches • Sampling: (Japkowicz & Stephen, 2002; Chawla et al., 2002, 2003; Van Hulse et al., 2007; He & Garcia, 2009) – Over-sample the minority class – Under-sample the majority class – SMOTE – … • Plug-in classifier (Elkan, 2001) • Balanced ERM (Liu & Chawla, 2011; Wallace et al., 2011)
Two Families of Algorithms Algorithm 1 Plug-in with Empirical Threshold • Learn a class probability estimator from training data S . • Apply a suitable empirical threshold on the class probability estimate. 1 0
Two Families of Algorithms Algorithm 1 Algorithm 2 Plug-in with Empirical Threshold Empirically Balanced ERM • Learn a class probability estimator • Learn a binary classifier by minimizing from training data S . a balanced surrogate loss. • Apply a suitable empirical threshold • Balancing terms estimated from on the class probability estimate: training data. 1 0
Main Consistency Results AM-regret
Main Consistency Results AM-regret AM-consistency
Main Consistency Results AM-regret AM-consistency Main Results: Under mild conditions on the underlying distribution and under certain assumptions on the surrogate loss function minimized, Algorithms 1 and 2 are AM-consistent.
Key Ingredients in Proofs • Balanced losses (Kotlowski et al, 2011) • Decomposition lemma: • Surrogate regret bounds for cost-sensitive classification (Scott, 2012) • Proper and strongly proper losses (Reid and Williamson, 2009, 2010; Agarwal, 2013) • Surrogate regret bounds for standard binary classification (Zhang, 2004; Bartlett et al, 2006)
Experiments Standard ERM Synthetic data Real data p = 0.05 p = 0.097
Experiments Standard ERM Synthetic data Real data p = 0.05 p = 0.097 AM performance of Plug-in and Balanced ERM comparable to that of the sampling techniques
Experiments Standard ERM Synthetic data Real data p = 0.05 p = 0.097 AM performance of Plug-in and Poster 794 Balanced ERM comparable to that of Today the sampling techniques

Recommend

Stakeholder telco on single balance single imbalance price model 12.3.2020 Erica Arberg,

Stakeholder telco on single balance single imbalance price model 12.3.2020 Erica Arberg, Energinet Agenda 1. Upcoming changes to Nordic imbalance settlement model 2. The European Methodology for the harmonisation of Imbalance settlement 3.

435 views • 17 slides

PCI Overview of Energy Imbalance Markets in West 1 Webinar Purpose Purpose of Webinar: Provide

PCI Overview of Energy Imbalance Markets in West 1 Webinar Purpose Purpose of Webinar: Provide high level overview of energy imbalance markets and address common questions that PCI hears from entities considering joining an energy imbalance

282 views • 27 slides

Equal Sum Sequences and Imbalance Sets of Tournaments Muhammad Ali Khan Center for Computational

Equal Sum Sequences and Imbalance Sets of Tournaments Muhammad Ali Khan Center for Computational and Discrete Geometry Department of Mathematics & Statistics University of Calgary November 29, 2013 1 / 25 Imbalance The imbalance t ( v )

362 views • 25 slides

Vandalism Detection on Wikipedia The class imbalance problem & new approaches Paul Gtze

Vandalism Detection on Wikipedia The class imbalance problem & new approaches Paul Gtze 13.10. 2014 Contents Vandalism detection The class imbalance problem Content based classifiers Wikipedia in Numbers 920 K 4.7 M 6 M Vandalism

511 views • 26 slides

Class Imbalance Learning in Software Defect Prediction Dr. Shuo Wang s.wang@cs.bham.ac.uk

Class Imbalance Learning in Software Defect Prediction Dr. Shuo Wang s.wang@cs.bham.ac.uk University of Birmingham Research keywords: ensemble learning, class imbalance learning, online learning Shuo Wang (University of Birmingham) Software

546 views • 27 slides

Improving Electric fraud detection using class imbalance strategies Eng. Federico Decia Eng.

Problem description Data Imbalance problem Strategy Proposed Results and Conclusions Improving Electric fraud detection using class imbalance strategies Eng. Federico Decia Eng. Matas Di Martino Eng. Juan Molinelli Prof. Alicia

677 views • 45 slides

Energy I y Imbalance Inf nformation n Sessi essions Novem ember 12 | 12 | Pho hoeni enix,

Energy I y Imbalance Inf nformation n Sessi essions Novem ember 12 | 12 | Pho hoeni enix, AZ , AZ Jim Kendrick VP of Power Marketing 1 AGENDA 1. Welcome & Introduction 2. Study Update 3. WALC Energy Imbalance Currently 4.

325 views • 14 slides

Western Energy Imbalance Market and Regional Energy Activities Valerie Fong and John Prescott

Western Energy Imbalance Market and Regional Energy Activities Valerie Fong and John Prescott Western Energy Imbalance Market Governing Body May 20, 2019 ISO PUBLIC ISO PUBLIC California ISO Western EIM Market Operator Uses advanced

452 views • 20 slides

KARMA KARMA P2P lending platform Investor presentation PREMISES Investment Imbalance

KARMA KARMA P2P lending platform Investor presentation PREMISES Investment Imbalance Geographical Imbalance There are more than 1 billion potential investors all In developed countries: loan interest rates are very over the world

392 views • 20 slides

Energy Imbalance Markets in the West Sierra Nevada Region June 27, 2019 Arun Sethi VP of Power

Energy Imbalance Markets in the West Sierra Nevada Region June 27, 2019 Arun Sethi VP of Power Marketing Markets are here Energy Imbalance in the West | 2 Why is WAPA interested in Markets? Interest not new. Besides the April 2019

330 views • 18 slides

Comments of Powerex Corp. on Energy Imbalance Market Year 1 Enhancements Submitted by Company

Comments of Powerex Corp. on Energy Imbalance Market Year 1 Enhancements Submitted by Company Date Submitted Mike Benn Powerex Corp. January 22, 2015 604.891.6074 Powerex appreciates the opportunity to comment on CAISOs Energy Imbalance

480 views • 12 slides

WTG Rotor Balancing Extending Lifetime of wind turbines 8.2 France damien.huchet@8p2.fr

WTG Rotor Balancing Extending Lifetime of wind turbines 8.2 France damien.huchet@8p2.fr 06 59 12 25 95 - www.8p2.com What is is rotor imbalance ? The rotor imbalance is the result of : - Mass Imbalance, when the center of gravity of

723 views • 8 slides

Detecting Application Load Imbalance on Cray Systems Heidi Poxon Technical Lead, Performance

Detecting Application Load Imbalance on Cray Systems Heidi Poxon Technical Lead, Performance Tools Cray Inc. Outline Cray Performance Tools Overview Motivation for Load Imbalance Analysis Metrics Offered by Cray Performance Tools Examples

477 views • 21 slides

Earth's Energy Imbalance: Natural Variability and SST patterns Cristian Proistosescu JISAO,

Earth's Energy Imbalance: Natural Variability and SST patterns Cristian Proistosescu JISAO, University of Washington The Earths Energy Imbalance and its implications Nov 15, 2018 Toulouse, France Collaborators: Yue Dong, Kyle Armour, Robb

567 views • 33 slides

Programming Abstraction in C++ Eric S. Roberts and Julie Zelenski Stanford University 2010

Vector Class Grid Class Stack Class Queue Class Map Class Lexicon Class Scanner Class Iterators Programming Abstraction in C++ Eric S. Roberts and Julie Zelenski Stanford University 2010 Vector Class Grid Class Stack Class Queue Class

586 views • 54 slides

BIBLICAL SURVEY Introductory Class Introductory Class BIBLICAL SURVEY Introductory Class

BIBLICAL SURVEY Introductory Class Introductory Class BIBLICAL SURVEY Introductory Class Introductory Class BIBLICAL SURVEY Introductory Class Introductory Class BIBLICAL SURVEY Introductory Class Introductory Class From here From here BIBLICAL

840 views • 69 slides

The Consistency Analysis of Secondary Index on Distributed

The Consistency Analysis of Secondary Index on Distributed Ordered Tables Houliang Qi, Xu Chang, Xingwu Liu, Li Zha Agenda Background MoEvaEon

701 views • 27 slides

Handling discovered inconsistencies not always possible semantics-dependent Distributed

Detection of mutual inconsistency in distributed systems (Parker, Popek, et. al.) Distributed system with replication for reliability (availability) efficient access Maintaining consistency of all copies hard to do efficiently

402 views • 13 slides

Scalable consistency for replicated data Anne3e Bieniusa Overview

Scalable consistency for replicated data Anne3e Bieniusa Overview Replica:on Scalable consistency Limita:ons and Outlook Anne3e Bieniusa - Scalable

727 views • 45 slides

Persistency Programming 101 Why and What of memory persistency

Persistency Programming 101 Why and What of memory persistency Aasheesh Kolli* Steven Pelley Ali Saidi $ Peter M. Chen*

369 views • 20 slides

Approaches to Voting Credit for several visuals: Ariel D. Procaccia CSC2556 - Nisarg Shah 1

CSC2556 Lecture 3 Approaches to Voting Credit for several visuals: Ariel D. Procaccia CSC2556 - Nisarg Shah 1 Approaches to Voting What does an approach give us? A way to compare voting rules Hopefully a uniquely optimal voting

636 views • 32 slides

8. Ordinary Differential Equations Indispensable for many technical applications! 8. Ordinary

Introduction Approximation of IVP with FD Consistency and Convergence Multistep Methods Stiff Problems Applications 8. Ordinary Differential Equations Indispensable for many technical applications! 8. Ordinary Differential Equations

575 views • 30 slides

Using Word Embeddings to Enforce Document-Level Lexical Consistency in Machine Translation Eva

Using Word Embeddings to Enforce Document-Level Lexical Consistency in Machine Translation Eva Martnez Garcia Carles Creus Cristina Espaa-Bonet Llus Mrquez EAMT 2017 May 30th Prague Outline Motivation 1 Lexical Consistency

808 views • 47 slides

Consistency, Completeness, and Classicality Adam P renosil Institute of Computer Science,

Consistency, Completeness, and Classicality Adam P renosil Institute of Computer Science, Academy of Sciences of the Czech Republic 18 June 2015, Hejnice Logica 2015 Adam P renosil Consistency, Completeness, and Classicality 1 / 20

571 views • 32 slides