Natureinspired and deep methods for feature selection Jan Plato 1 - PowerPoint PPT Presentation

Nature–inspired and deep methods for feature selection Jan Platoš 1 Pavel Krömer Data Science Summer School @ Uni Vienna 1 Dept. of Computer Science, VŠB - Technical University of Ostrava, Ostrava, Czech Republic {pavel.kromer,jan.platos}@vsb.cz

Outline Compression–based data entropy estimation Introduction Compression–based evolutionary Feature subset selection feature subset selection Nature–inspired feature subset Experiments selection Lesson learned Genetic algorithms Deep feature selection Differential evolution Summary September 04 2018, Vienna, AT 2

Introduction September 04 2018, Vienna, AT 2

Feature (subset) selection is an established procedure to reduce data dimensionality, which is good for performance and accuracy (of e.g. classification). Nature–inspired feature selection methods, based on the principles of evolutionary computation, have shown potential to efficiently process very-high-dimensional datasets. Introduction Problem statement Modern datasets comprise of millions of records, many thousands of features. September 04 2018, Vienna, AT 3

Nature–inspired feature selection methods, based on the principles of evolutionary computation, have shown potential to efficiently process very-high-dimensional datasets. Introduction Problem statement Modern datasets comprise of millions of records, many thousands of features. Feature (subset) selection is an established procedure to reduce data dimensionality, which is good for performance and accuracy (of e.g. classification). September 04 2018, Vienna, AT 3

Introduction Problem statement Modern datasets comprise of millions of records, many thousands of features. Feature (subset) selection is an established procedure to reduce data dimensionality, which is good for performance and accuracy (of e.g. classification). Nature–inspired feature selection methods, based on the principles of evolutionary computation, have shown potential to efficiently process very-high-dimensional datasets. September 04 2018, Vienna, AT 3

In a data set, Y A Z , A a 1 a 2 a n is a set of input features, find B A so that f eval B is maximized. FSS can be formulated as an optimization or e.g. search problem. The definition of the evaluation criteria is a paramount aspect of evolutionary feature selection that highly depends on the purpose of the FSS. Feature subset selection Feature subset selection (FSS) is a high–level search for an optimum subset of data features selected according to a particular set of criteria. September 04 2018, Vienna, AT 4

FSS can be formulated as an optimization or e.g. search problem. The definition of the evaluation criteria is a paramount aspect of evolutionary feature selection that highly depends on the purpose of the FSS. Feature subset selection Feature subset selection (FSS) is a high–level search for an optimum subset of data features selected according to a particular set of criteria. In a data set, Y = { A ∪ Z } , A = { a 1 , a 2 , . . . a n } is a set of input features, find B ⊂ A so that f eval ( B ) is maximized. September 04 2018, Vienna, AT 4

The definition of the evaluation criteria is a paramount aspect of evolutionary feature selection that highly depends on the purpose of the FSS. Feature subset selection Feature subset selection (FSS) is a high–level search for an optimum subset of data features selected according to a particular set of criteria. In a data set, Y = { A ∪ Z } , A = { a 1 , a 2 , . . . a n } is a set of input features, find B ⊂ A so that f eval ( B ) is maximized. FSS can be formulated as an optimization or e.g. search problem. September 04 2018, Vienna, AT 4

Feature subset selection Feature subset selection (FSS) is a high–level search for an optimum subset of data features selected according to a particular set of criteria. In a data set, Y = { A ∪ Z } , A = { a 1 , a 2 , . . . a n } is a set of input features, find B ⊂ A so that f eval ( B ) is maximized. FSS can be formulated as an optimization or e.g. search problem. The definition of the evaluation criteria is a paramount aspect of evolutionary feature selection that highly depends on the purpose of the FSS. September 04 2018, Vienna, AT 4

Nature–inspired feature subset selection September 04 2018, Vienna, AT 4

Evolutionary computation Evolutionary computation is a group of iterative stochastic search and optimization methods based on the programmatical emulation of successful optimization strategies observed in nature. Evolutionary algorithms use Darwinian evolution and Mendelian inheritance to model the survival of the fittest using the processes of selection and heredity. September 04 2018, Vienna, AT 5

Genetic algorithms The Genetic Algorithm (GA) is a population-based, meta-heuristic, soft optimization method. GAs can solve complex optimization problems by evolving a population of encoded candidate solutions. The solutions are ranked using a problem specific fitness function. Artificial evolution, implemented by iterative application of genetic and selection operators, leads to the discovery of solutions with above-average fitness. September 04 2018, Vienna, AT 6

Basic principles of GA Encoding Genetic operators Problem encoding is an important part of GA. It translates candidate solutions from the problem domain (phenotype) to the encoded search space (genotype) of the algorithm. The representation specifies the chromosome data structure and the decoding function. Crossover recombines two or more chromosomes. It propagates so called building blocks (solution patterns with above average fitness) from one generation to another, and creates new, better performing, building blocks. In contrast, mutation is expected to insert new material into the population by random perturbation of chromosome structure. This way, new building blocks can be created or old disrupted. September 04 2018, Vienna, AT 7

Differential evolution Differential evolution (DE) is a versatile stochastic evolutionary optimization algorithm for real-valued problems. It uses differential mutation v i = ⃗ v r 1 + F v r 2 − ⃗ v r 3 ) (1) ( ⃗ ⃗ , and crossover operator l = rand ( 1 , N ) , (2) { v i if ( rand ( 0 , 1 ) < C ) or j = l j , v i (3) j = x i otherwise j , to evolve a population of v r 3 v r 1 v r 2 parameter vectors. v r 1 + F ( v r 2 - v r 3 ) v r 2 - v r 3 F ( v r 2 - v r 3 ) -v r 3 September 04 2018, Vienna, AT 8

Filter–based approaches are classifier independent and utilize various indirect feature subset evaluation measures (e.g. statistical, geometric, information-theoretic). Here, we use two evolutionary methods for fixed–length subset selection and a fitness function based on compression–based data entropy estimation to establish a novell filter–based evolutionary FSS. Evolutionary feature subset selection Evolutionary FSS Types Wrapper–based approaches look for subsets of features for which particular classification algorithm reaches the highest accuracy. September 04 2018, Vienna, AT 9

Here, we use two evolutionary methods for fixed–length subset selection and a fitness function based on compression–based data entropy estimation to establish a novell filter–based evolutionary FSS. Evolutionary feature subset selection Evolutionary FSS Types Wrapper–based approaches look for subsets of features for which particular classification algorithm reaches the highest accuracy. Filter–based approaches are classifier independent and utilize various indirect feature subset evaluation measures (e.g. statistical, geometric, information-theoretic). September 04 2018, Vienna, AT 9

Evolutionary feature subset selection Evolutionary FSS Types Wrapper–based approaches look for subsets of features for which particular classification algorithm reaches the highest accuracy. Filter–based approaches are classifier independent and utilize various indirect feature subset evaluation measures (e.g. statistical, geometric, information-theoretic). Here, we use two evolutionary methods for fixed–length subset selection and a fitness function based on compression–based data entropy estimation to establish a novell filter–based evolutionary FSS. September 04 2018, Vienna, AT 9

Entropy of a random variable, X , consisting of a sequence of values, x 1 x 2 x n , is defined by H X P x i log 2 P x i (4) i Entropy is used as a basis of a number of derived measures including conditional entropy, H X Y , and information gain. It is the basis of several feature selection methods, but is generally hard to evaluate in practical settings. Computationally efficient entropy estimators are used in place of exact measures. Compression–based data entropy estimation Entropy is a general concept that expresses the amount of information contained in a message. September 04 2018, Vienna, AT 10

Entropy is used as a basis of a number of derived measures including conditional entropy, H X Y , and information gain. It is the basis of several feature selection methods, but is generally hard to evaluate in practical settings. Computationally efficient entropy estimators are used in place of exact measures. Compression–based data entropy estimation Entropy is a general concept that expresses the amount of information contained in a message. Entropy of a random variable, X , consisting of a sequence of values, x 1 , x 2 , . . . , x n , is defined by H ( X ) = − ∑ P ( x i ) log 2 P ( x i ) (4) i September 04 2018, Vienna, AT 10

Natureinspired and deep methods for feature selection Jan Plato 1 - PowerPoint PPT Presentation

Natureinspired and deep methods for feature selection Jan Plato 1 Pavel Krmer Data Science Summer School @ Uni Vienna 1 Dept. of Computer Science, VB - Technical University of Ostrava, Ostrava, Czech Republic

Outline Reducing Dimensionality Feature Selection 1 Steven J Zeil Feature Extraction 2

Reducing Dimensionality Steven J Zeil Old Dominion Univ. Fall 2010 1 Feature Selection

Decision Tree Prof. Seungchul Lee Industrial AI Lab. Feature Test Feature 1 Feature 2 Feature

Feature Selection: ROC and Subset Selection Theodoridis 5.5-5.7 Using ROC for Feature Selection

Variable selection bias Bias in Ensemble Bias in Ensemble Methods Methods Variable selection

A Distinctive Feature of A Distinctive Feature of A Distinctive Feature of A Distinctive Feature

Week 3 Video 4 Automated Feature Generation Automated Feature Selection Automated Feature

ERP Selection KIRTANE & PANDIT Suhas Deshpande Why ERP Selection is important ?

Mutual Information an Adequate Tool for Feature Selection ? Benot Frnay November 15, 2013

NATURE HEALS Heather Greeley Benson, Program Specialist Nature-Based Therapeutic Services Nature

Earth: The Feature Presentation - feature, landscape, topography Earth: The Feature Presentation

Approved Selection Methods Selection methods, arrangements and market approach options GOODS,

PCA & ICA CE-717: Machine Learning Sharif University of Technology Spring 2018 Soleymani

Biologically I nspired Hardware System What is Bio-Inspired System? Why do we need

SECONDHAND SELECTION Sales Price - 275,000.00 EU SECONDHAND SELECTION INTERNAL VIEWS SECONDHAND

SELECTION Deterministic Stochastic Proportionate selection: Roulette Wheel Selection

The story of the film so far... The probability of A occurring given that B has occurred is the

UCSF Movement Disorders Neurology Research /Support Staff Jill Ostrem, MD Sarah Wang, PhD

Phenotypic ic Ratio io and The Chi-Square Test en.wikipedia.org In Previous Labs.. 1. The

Mendelian Genetics & and is intended for the non-commercial use of students and teachers.

Graphical Models and Bayesian Networks Tutorial International Workshop on Statistical Modelling

CS 680: Assembly and Analysis of Sequencing Data Fall 2012

1 Part 1 is it to jog your memory DNA is very nearly the universal genetic material All

172 Chapter 10 Mec hanisms for Soft w are Reuse Ob ject-orien ted programming has b

Natureinspired and deep methods for feature selection Jan Plato 1 - PowerPoint PPT Presentation

Natureinspired and deep methods for feature selection Jan Plato 1 Pavel Krmer Data Science Summer School @ Uni Vienna 1 Dept. of Computer Science, VB - Technical University of Ostrava, Ostrava, Czech Republic

Outline Reducing Dimensionality Feature Selection 1 Steven J Zeil Feature Extraction 2

Reducing Dimensionality Steven J Zeil Old Dominion Univ. Fall 2010 1 Feature Selection

Decision Tree Prof. Seungchul Lee Industrial AI Lab. Feature Test Feature 1 Feature 2 Feature

Feature Selection: ROC and Subset Selection Theodoridis 5.5-5.7 Using ROC for Feature Selection

Variable selection bias Bias in Ensemble Bias in Ensemble Methods Methods Variable selection

A Distinctive Feature of A Distinctive Feature of A Distinctive Feature of A Distinctive Feature

Week 3 Video 4 Automated Feature Generation Automated Feature Selection Automated Feature

ERP Selection KIRTANE &amp; PANDIT Suhas Deshpande Why ERP Selection is important ?

Mutual Information an Adequate Tool for Feature Selection ? Benot Frnay November 15, 2013

NATURE HEALS Heather Greeley Benson, Program Specialist Nature-Based Therapeutic Services Nature

Earth: The Feature Presentation - feature, landscape, topography Earth: The Feature Presentation

Approved Selection Methods Selection methods, arrangements and market approach options GOODS,

PCA &amp; ICA CE-717: Machine Learning Sharif University of Technology Spring 2018 Soleymani

Biologically I nspired Hardware System What is Bio-Inspired System? Why do we need

SECONDHAND SELECTION Sales Price - 275,000.00 EU SECONDHAND SELECTION INTERNAL VIEWS SECONDHAND

SELECTION Deterministic Stochastic Proportionate selection: Roulette Wheel Selection

The story of the film so far... The probability of A occurring given that B has occurred is the

UCSF Movement Disorders Neurology Research /Support Staff Jill Ostrem, MD Sarah Wang, PhD

Phenotypic ic Ratio io and The Chi-Square Test en.wikipedia.org In Previous Labs.. 1. The

Mendelian Genetics &amp; and is intended for the non-commercial use of students and teachers.

Graphical Models and Bayesian Networks Tutorial International Workshop on Statistical Modelling

CS 680: Assembly and Analysis of Sequencing Data Fall 2012

1 Part 1 is it to jog your memory DNA is very nearly the universal genetic material All

172 Chapter 10 Mec hanisms for Soft w are Reuse Ob ject-orien ted programming has b

ERP Selection KIRTANE & PANDIT Suhas Deshpande Why ERP Selection is important ?

PCA & ICA CE-717: Machine Learning Sharif University of Technology Spring 2018 Soleymani

Mendelian Genetics & and is intended for the non-commercial use of students and teachers.