An Ensemble-based Feature Selection Methodology for Case-Based - PowerPoint PPT Presentation

An Ensemble-based Feature Selection Methodology for Case-Based Learning PhD. Dissertation Presentation Maqbool Ali 1,2 1 Department of Computer Science and Engineering, Kyung Hee University, South Korea Email: maqbool.ali@oslab.khu.a.c.kr 2 School of Engineering and ICT, University of Tasmania, Australia Advisor: Advisor: Email: maqbool.ali@utas.edu.ac ( Prof. Sungyoung Lee ) ( Prof. Byeong Ho Kang ) 04 th May, 2018

Agenda • Introduction • Background • Motivation • Problem statement • Research Taxonomy • Related work • Proposed methodology • Overview • Workflow • Experiment & results • Dataset • Experimental setup • Results & discussion • Conclusion • Contribution & Uniqueness • Future work • Publications • References 04/05/2018 2

I ntroduction Experiment & results References Related work Conclusion Proposed methodology Publications Background • In medical education domain, Case-Based Learning (CBL) is known to be an effective learning approach for medical students at undergraduate level education as well as for professional development [1-3]. ─ CBL is a shared learning approach in which small-groups of medical students are involved in discussion to identify and solve the patient’s problem [1]. • In CBL practice, ─ the clinical case is a key component in learning activities, which includes basic , social , and clinical studies of the patient [1]. It provides a foundation to understand the situation of a disease. An example of a clinical case  To interact with the patients  To deal with a variety of cases during his/her clinical practical life  Better learning can play an important role in actual practice Medical Students Goal  Human can not  Structured knowledge  perform fast reasoning For better learning  accomplish complex computation can be:  Better decision decision  Queried making CBL Declarative knowledge is a type of knowledge,  Analyzed which tells us facts: what things are. Domain Knowledge  Visualized  “Blood disease is a symptom of diabetes” (i.e. Structured Declarative Knowledge) 04/05/2018 3

I ntroduction Experiment & results References Related work Conclusion Proposed methodology Publications Background and Motivation Feature Selection Knowledge Construction Methods [6] Domain Text Preprocessing Documents Text Transformation Feature Selection Filter Methods Wrapper Methods Embedded Methods Domain Knowledge Terms Extraction + Performs simple and + Conducts a subset search + Requires less Relations Extraction fast computation with an optimal algorithm computation than + Not dependent on the + Better classification wrapper method Model Construction classification algorithm − Specific to a learning accuracy − Decreases classification − Higher risk of over fitting machine − High computational cost performance  Examples: Information  Text Mining is the process of deriving high-quality information  Examples: Information  Examples: Sequential Gain + Genetic from an unstructured text [4]. It involves the application of Gain, Chi-Squared, Forward or Backward Algorithm etc. techniques from information retrieval , natural language ReliefF etc. Selection, Genetic processing , information extraction , and data mining . Algorithm etc.  For constructing domain knowledge, Methodology Comprehensive Not dependent on the ─ Feature selection is an important and critical step in text O A classification algorithm evaluation of feature set mining [5]. R Large number of features selection methods available Ensemble feature Better accuracy O A selection Each method has capabilities and limitations R R Reasons O Characteristics A Advantages 04/05/2018 4

I ntroduction Experiment & results References Related work Conclusion Proposed methodology Publications Problem Statement For an automated CBL, a reliable structured knowledge construction is a challenging task [7]. The key challenge in this regard is to select the relevant features for the following reasons: ─ The irrelevant input features induces greater computational cost [6, 8]. ─ Finding an optimal cut-off value to select important features is problematic [9]. Goal ─ Innovate students’ learning by transforming the unstructured text into structured knowledge with the support of an efficient feature selection methodology. Objectives 1. To design and develop an efficient feature selection methodology to filter out the irrelevant input features for structured knowledge construction process. 2. To innovate the case-based learning approach for better clinical proficiency. • Challenges Challenges 1. How to compute the ranks of features without any individual statistical biases of state-of-the- art feature ranking methods? [10] (e.g., information gain is biased towards choosing feature with large number of value. Similarly, chi square, symmetric uncertainty, and gain ratio are sensitive to sample size. 2. How to provide an empirical method to specify a minimum threshold value for retaining important features? [11] 3. How to design the case-based learning approach to make it interactive and effective? [12] 04/05/2018 5

I ntroduction Experiment & results References Related work Conclusion Proposed methodology Publications Research Taxonomy [13, 14] Chosen  Ranking approach is  The filter methods [15-17]: considered an attractive (i) are generally much faster and approach due to its have less computational costs simplicity, scalability, than wrapper and embedded and good empirical methods, success [14, 18]. (ii) are better suited to high dimensional datasets.  Statistical measures provides good performance in various domains [19].  Information theoretic measures such as entropy are good measures to quantify the uncertainty of features and provides good performance in various domains [13, 19]. Figure: Dimensionality reduction and different categories of feature ranking methods. 04/05/2018 6

Introduction Experiment & results References Related work Conclusion Proposed methodology Publications Related Work Reference Features Limitations [20] Onan and Korukoğlu , A feature selection • Presented an ensemble approach for feature selection, which aggregates • Genetic algorithm (GA) was used for producing an aggregate model based on genetic rank aggregation for text the several individual feature lists obtained by the different feature ranked list, which is relatively more expensive technique than a sentiment classification, 2017. selection methods such as Information gain , Gain ratio , Chi-squared, weighted aggregate technique. Pearson Correlation, ReliefF . • Experiments were primarily performed a binary-class problem . • Used Naïve Bayes and kNN classifiers Hence, it is not clear how would the proposed method will deal with more complex datasets? [11] Osanaiye et al., Ensemble-based multi-filter • Presented an ensemble-based multi-filter feature selection method that • A fixed threshold value i.e. 1/3 of a feature set, was defined a priori Feature Selection feature selection method for DDoS detection in combines the output of Information gain , Gain ratio , Chi-squared and irrespective of the characteristics of the dataset. cloud computing, 2016. ReliefF to select important features. [10] Sarkar et al., Robust feature selection • Proposed a technique that aggregates the Information gain , Chi-Square , • This technique is not comprehensive enough to provide a final technique using rank aggregation, 2014. and Symmetric Uncertainty feature selection methods to develop an subset of features. Hence, a domain expert would still needed to optimal solution. make an educated guess regarding the final subset. [13] Sadeghi and Beigy, A new ensemble method • Proposed a heterogeneous ensemble-based algorithm for feature ranking • This method requires user to specify a θ value. for feature ranking in text mining, 2013. using Information gain, Relief, and DRB-FS features ranking methods . • Moreover, user is given an additional task of defining the notion of • Adopted borda method for features voting relevancy and redundancy of a feature. • Determined the threshold using genetic algorithm. • The proposed wrapper-based method is tightly coupled with the performance evaluation of a single classifier i.e. SVM, hence losing the generality of the method. [21] University of Texas Medical Branch UTMB, • Provides facility to develop case(s) • This approach does not provide domain knowledge support for CBL Case-Based Learning Design a case (DAC), 2017. • Delivers virtual patient encounters to students on any health related topic practice • Support of anywhere accessible [22] The University of New Mexico, Extension for • Provides services for remote patient care • Lacks of an interactive case authoring and its formulation support community healthcare outcomes (ECHO), 2016. • Conducts virtual clinics using multi-point videoconferencing • Lacks of domain knowledge support for CBL practice [23] Chen et al., Applications of a time sequence • Developed a web-based learning system that followed the development • Lacks of feedback support mechanism in the simulation cases of a web-based of the real-world clinical situation • Lacks of domain knowledge support for CBL practice medical problem-based learning system, 2009. 04/05/2018 7

An Ensemble-based Feature Selection Methodology for Case-Based - PowerPoint PPT Presentation

An Ensemble-based Feature Selection Methodology for Case-Based Learning PhD. Dissertation Presentation Maqbool Ali 1,2 1 Department of Computer Science and Engineering, Kyung Hee University, South Korea Email: maqbool.ali@oslab.khu.a.c.kr 2

Variable selection bias Bias in Ensemble Bias in Ensemble Methods Methods Variable selection

Outline Reducing Dimensionality Feature Selection 1 Steven J Zeil Feature Extraction 2

Reducing Dimensionality Steven J Zeil Old Dominion Univ. Fall 2010 1 Feature Selection

Decision Tree Prof. Seungchul Lee Industrial AI Lab. Feature Test Feature 1 Feature 2 Feature

Feature Selection: ROC and Subset Selection Theodoridis 5.5-5.7 Using ROC for Feature Selection

Boosting (ensemble) Module 4 - Ensemble classifiers - Objectives module 4: boosting (ensemble

A Distinctive Feature of A Distinctive Feature of A Distinctive Feature of A Distinctive Feature

Week 3 Video 4 Automated Feature Generation Automated Feature Selection Automated Feature

ERP Selection KIRTANE & PANDIT Suhas Deshpande Why ERP Selection is important ?

Mutual Information an Adequate Tool for Feature Selection ? Benot Frnay November 15, 2013

Earth: The Feature Presentation - feature, landscape, topography Earth: The Feature Presentation

SELECTION Deterministic Stochastic Proportionate selection: Roulette Wheel Selection

Evaluating the Spread of Climate Model Ensembles Based on Computing Environment Selection Tom

PCA & ICA CE-717: Machine Learning Sharif University of Technology Spring 2018 Soleymani

Feature Point Feature-based approach: Detect and match feature Detec.on and Matching points

SECONDHAND SELECTION Sales Price - 275,000.00 EU SECONDHAND SELECTION INTERNAL VIEWS SECONDHAND

A Micro-Systemic Approach for Dependable Natural Language Processing Sylviane CARDEY & Peter

Interoperability Presentations Assistant Director, Maewyn Cumming New Directions in Government ,

Software Development, Variables, Working with User Input, Math CSCI-UA.0002-005 Operators &

A Solid Foundation of A Solid Foundation of Semantic Computing Semantic Computing toward Web

Some of the Team Facebook AI Intelligent Dialogue tests Tests 5 99.5% represents an error in the

Mixed-effects regression and eye-tracking data Lecture 2 of advanced regression methods for

FEE SHIFTING IN PATENT LITIGATION Sughrue Mion, PLLC Abraham J. Rosner May 2014 I.

United States Court of Appeals for the Federal Circuit 2007-1262 BALDWIN GRAPHIC SYSTEMS, INC.,

An Ensemble-based Feature Selection Methodology for Case-Based - PowerPoint PPT Presentation

An Ensemble-based Feature Selection Methodology for Case-Based Learning PhD. Dissertation Presentation Maqbool Ali 1,2 1 Department of Computer Science and Engineering, Kyung Hee University, South Korea Email: maqbool.ali@oslab.khu.a.c.kr 2

Variable selection bias Bias in Ensemble Bias in Ensemble Methods Methods Variable selection

Outline Reducing Dimensionality Feature Selection 1 Steven J Zeil Feature Extraction 2

Reducing Dimensionality Steven J Zeil Old Dominion Univ. Fall 2010 1 Feature Selection

Decision Tree Prof. Seungchul Lee Industrial AI Lab. Feature Test Feature 1 Feature 2 Feature

Feature Selection: ROC and Subset Selection Theodoridis 5.5-5.7 Using ROC for Feature Selection

Boosting (ensemble) Module 4 - Ensemble classifiers - Objectives module 4: boosting (ensemble

A Distinctive Feature of A Distinctive Feature of A Distinctive Feature of A Distinctive Feature

Week 3 Video 4 Automated Feature Generation Automated Feature Selection Automated Feature

ERP Selection KIRTANE &amp; PANDIT Suhas Deshpande Why ERP Selection is important ?

Mutual Information an Adequate Tool for Feature Selection ? Benot Frnay November 15, 2013

Earth: The Feature Presentation - feature, landscape, topography Earth: The Feature Presentation

SELECTION Deterministic Stochastic Proportionate selection: Roulette Wheel Selection

Evaluating the Spread of Climate Model Ensembles Based on Computing Environment Selection Tom

PCA &amp; ICA CE-717: Machine Learning Sharif University of Technology Spring 2018 Soleymani

Feature Point Feature-based approach: Detect and match feature Detec.on and Matching points

SECONDHAND SELECTION Sales Price - 275,000.00 EU SECONDHAND SELECTION INTERNAL VIEWS SECONDHAND

A Micro-Systemic Approach for Dependable Natural Language Processing Sylviane CARDEY &amp; Peter

Interoperability Presentations Assistant Director, Maewyn Cumming New Directions in Government ,

Software Development, Variables, Working with User Input, Math CSCI-UA.0002-005 Operators &amp;

A Solid Foundation of A Solid Foundation of Semantic Computing Semantic Computing toward Web

Some of the Team Facebook AI Intelligent Dialogue tests Tests 5 99.5% represents an error in the

Mixed-effects regression and eye-tracking data Lecture 2 of advanced regression methods for

FEE SHIFTING IN PATENT LITIGATION Sughrue Mion, PLLC Abraham J. Rosner May 2014 I.

United States Court of Appeals for the Federal Circuit 2007-1262 BALDWIN GRAPHIC SYSTEMS, INC.,

ERP Selection KIRTANE & PANDIT Suhas Deshpande Why ERP Selection is important ?

PCA & ICA CE-717: Machine Learning Sharif University of Technology Spring 2018 Soleymani

A Micro-Systemic Approach for Dependable Natural Language Processing Sylviane CARDEY & Peter

Software Development, Variables, Working with User Input, Math CSCI-UA.0002-005 Operators &