A Pseudo-Boolean Set Covering Machine Pascal Germain, S ebastien - PowerPoint PPT Presentation

A Pseudo-Boolean Set Covering Machine Pascal Germain, S´ ebastien Gigu` ere, Jean-Francis Roy, Brice Zirakiza, Fran¸ cois Laviolette, and Claude-Guy Quimper GRAAL (Universit´ e Laval, Qu´ ebec city) October 9, 2012 Germain et al. (GRAAL, Universit´ e Laval) A Pseudo-Boolean Set Covering Machine October 9, 2012 1 / 10

Plan 1 Binary classification and Machine learning (ML) 2 Set covering machines (SCM) 3 Using a CP approach to answer a ML question 4 Empirical results Germain et al. (GRAAL, Universit´ e Laval) A Pseudo-Boolean Set Covering Machine October 9, 2012 2 / 10

Binary Classification and Machine Learning (ML) Example Each example ( x , y ) is a description-label pair : The description x ∈ R n is a feature vector. The label y ∈ { 0 , 1 } is a boolean value. Dataset A dataset S is a collection of several examples . def S = { ( x 1 , y 1 ) , ( x 2 , y 2 ) , . . . , ( x m , y m ) } Germain et al. (GRAAL, Universit´ e Laval) A Pseudo-Boolean Set Covering Machine October 9, 2012 3 / 10

Binary Classification and Machine Learning (ML) Learning Algorithm A ( S ) → h The goal of a learning algorithm is to study a dataset and build a classifier . Classifier h ( x ) → y A classifier is a function that takes a description of an example as input, and outputs a label prediction. Germain et al. (GRAAL, Universit´ e Laval) A Pseudo-Boolean Set Covering Machine October 9, 2012 4 / 10

Set Covering Machines (SCM) [ Marchand and Shawe-Taylor, 2002 ] Data-Dependent Ball A ball g i , j is defined by a center ( x i , y i ) ∈ S and a border ( x j , y j ) ∈ S . � y i if � x − x i � ≤ � x i − x j � def g i , j ( x ) = ¬ y i otherwise. Conjunction of Data-Dependent Balls def � Given a set of balls B , the SCM classifier is h B ( x ) = g i , j ( x ) . g i , j ∈B Positive ball Negative ball Conjunction of balls Germain et al. (GRAAL, Universit´ e Laval) A Pseudo-Boolean Set Covering Machine October 9, 2012 5 / 10

Sample Compression Theory The theory suggests to minimize the following cost function : def f ( B ) = 2 × number of balls + number of training errors SCM is a Greedy Algorithm The SCM is a fast algorithm driven by a parameterized heuristic . At each greedy step, the heuristic chooses a ball to add to the conjunction B . The search is restarted several times with different heuristic parameters. The cost function f ( B ) selects the best conjunction among all restarts. f ( B ) = 2 × 1 + 2 = 4 f ( B ) = 2 × 1 + 8 = 10 f ( B ) = 2 × 2 + 1 = 5 Germain et al. (GRAAL, Universit´ e Laval) A Pseudo-Boolean Set Covering Machine October 9, 2012 6 / 10

Using a CP approach to answer a ML question How Good is the Greedy Strategy? How far to the optimal f ( B ∗ ) is the solution found by the SCM? Finding the global minimum is hard Finding the optimal f ( B ∗ ) is a combinatorial NP-hard problem . CP to the rescue! We designed a Pseudo-Boolean program that directly minimizes f ( B ) and compare the solution to the one obtained by the SCM. Germain et al. (GRAAL, Universit´ e Laval) A Pseudo-Boolean Set Covering Machine October 9, 2012 7 / 10

Pseudo-Boolean Set Covering Machine Given a dataset S = { ( x 1 , y 1 ) , ( x 2 , y 2 ) , . . . , ( x m , y m ) } of m examples. m f ( B ∗ ) = min � ( r i + s i ) subject to 5 × m linear constraints. i =1 ∼ m 2 Program Variables For every i , j ∈ { 1 , . . . , m } : s i is equal to 1 iff the example x i belongs to a ball. r i is equal to 1 iff h B ∗ misclassifies the example x i . b i , j is equal to 1 iff the ball g i , j belongs to B ∗ . We compare the original SCM to three pseudo-Boolean solvers: PWBO, Lynce (2011) BSOLO, Vasco Manquinho and Marques-Silva (2006) SCIP, Achterberg (2004) Germain et al. (GRAAL, Universit´ e Laval) A Pseudo-Boolean Set Covering Machine October 9, 2012 8 / 10

Empirical results (common benchmarks in Machine Learning community) Dataset SCM PWBO SCIP BSOLO name size F time F time F time F time 25 2 0.04 2 0.03 2 0.71 2 0.05 breastw 50 2 0.07 2 0.06 2 3.7 2 0.64 100 2 0.16 2 0.43 2 0.05 2 20 25 8 0.31 7 0.31 7 4.1 7 0.64 bupa 50 14 1.32 12 589 12 47 12 989 100 27 11 32 T/O 30 T/O 34 T/O 25 4 0.11 4 0.08 4 2 4 0.22 credit 50 6 0.25 5 9.3 5 21 5 30.1 100 12 1.3 11 T/O 10 798 18 T/O 25 5 0.11 5 0.03 5 12 5 0.2 glass 50 9 0.49 8 10.3 8 35 8 28 100 18 2.9 17 T/O 17 T/O 22 T/O 25 5 0.17 5 0.03 5 3.6 5 0.18 haberman 50 10 0.94 10 34 10 30 10 65 100 21 4.5 20 T/O 20 T/O 23 T/O 25 8 0.33 8 0.36 8 4 8 0.94 pima 50 15 0.9 13 2204 13 37 13 1985 100 25 7.4 26 T/O 23 T/O 30 T/O 25 3 0.07 3 0.011 3 0.21 3 0.08 USvotes 50 5 0.17 4 0.141 4 2.4 4 1.1 100 6 0.35 4 1.21 4 100 4 80 Germain et al. (GRAAL, Universit´ e Laval) A Pseudo-Boolean Set Covering Machine October 9, 2012 9 / 10

Conclusion Thanks to pseudo-Boolean techniques For the first time, we show empirically the effectiveness of the SCM . This is a very surprising result given the simplicity and the low complexity of the greedy algorithm. Final word from Anonymous Reviewer #3 This is one of those disconcerting results that show that simple, low-complexity algorithms can be enough to solve combinatorially hard problems that appear to need heavier-weight approaches. Germain et al. (GRAAL, Universit´ e Laval) A Pseudo-Boolean Set Covering Machine October 9, 2012 10 / 10

A Pseudo-Boolean Set Covering Machine Pascal Germain, S ebastien - PowerPoint PPT Presentation

A Pseudo-Boolean Set Covering Machine Pascal Germain, S ebastien Gigu` ere, Jean-Francis Roy, Brice Zirakiza, Fran cois Laviolette, and Claude-Guy Quimper GRAAL (Universit e Laval, Qu ebec city) October 9, 2012 Germain et al.

Boolean Algebra Chapter 3 Boolean Values Introduction Boolean Operations Fundamental Operators

1 Boolean Algebra 1. Boolean Algebra Verification Technology Content 1.1 Boolean algebra basics

Digital Design Discussion: Boolean Algebra Boolean Expression Equivalence Boolean Function

Boolean Logic 01-1 Boolean values Are TRUE and FALSE 01-2 Boolean values Are TRUE and

CHAPTER III BOOLEAN ALGEBRA R.M. Dansereau; v.1.0 BOOLEAN VALUES INTRO. TO COMP. ENG.

Improved efficiency for covering codes matching the sphere- covering bound Aditya Potukuchi and

On Modelling Virtual Machine Consolidation to Pseudo-Boolean Constraints Bruno Cesar Ribas 1 , 3 ,

The boolean type and boolean operators Recall that Java provides a data type boolean which can

Boolean Functions Boolean Expressions Let B = { 0 , 1 } . 1 ... true, 0 ... false Let x 1 , x 2 ,

1. Boolean Algebra 1.1 Boolean Algebra Basics Verification Technology AND-operation

Pseudo-Boolean Constraints: Reasoning and Compilation Romain Wallon (Advisors: Daniel Le Berre,

Sixth Pseudo-Boolean Competition PB11 Vasco M ANQUINHO and Olivier R OUSSEL 14th International

From clauses to pseudo-Boolean constraints in a Boolean solver Daniel Le Berre joint work with

Piecewise Boolean algebra Chris Heunen 1 / 33 Boolean algebra: example , ,

Boolean Algebras Mongi BLEL King Saud University August 30, 2019 Mongi BLEL Boolean Algebras

Definitions Boolean set: B 0 , 1 Boolean constants: 0, 1 Boolean variable: x 0

Advanced Rendering II, Sampling Week 9, Wed Mar 9 http://www.ugrad.cs.ubc.ca/~cs314/Vjan2005

Implem lementing enting the e Continuu ntinuum m of Car are e for r Su Subst stance

Physics Beyond Colliders Annual Workshop Chris Quigg Fermi National Accelerator Laboratory

Detecting Protagonists in German Plays around 1800 as a Classification Task Disclaimer We have

The AEgIS Experiment Measuring the Gravitational Interaction of Antimatter Andreas Knecht / CERN

Optical Communications Telecommunication Engineering School of Engineering University of Rome La

PyMTL:'A'Unified'Framework'for'Ver8cally'Integrated' Computer'Architecture'Research'

Role Of Soil Behavior On The Initial Kinematics Of Tsunamigenic Slides Chapter December 2006