Comparative Evaluation of Approaches to Propositionalization - PowerPoint PPT Presentation

Comparative Evaluation of Approaches to Propositionalization Mark-A. Krogel, Otto-von-Guericke-Universität Magdeburg Simon Rawles, University of Bristol Filip Zelezný, Czech Technical University and University of Wisconsin, Madison Peter A. Flach, University of Bristol Nada Lavra č , Institute Jozef Stefan, Ljubljana Stefan Wrobel, Friedrich-Wilhelms-Universität Bonn and Fraunhofer-Institut AiS 1 Krogel, Rawles, Železný, Flach, Lavra č , Wrobel: Comparative Evaluation of Approaches to Propositionalization

Introduction � Propositionalization: largely automatic transformation of relational data into a single-table representation and application of propositional learners � In principle less powerful than searching full first-order hypothesis space � In practice often sufficient, efficient, and flexible � Here: first comparative study using representatives of logic-oriented approaches (RSD, SINUS) and database-oriented approaches (RELAGGS) 2 Krogel, Rawles, Železný, Flach, Lavra č , Wrobel: Comparative Evaluation of Approaches to Propositionalization

Propositionalization � An ILP learning task: given ground facts of target predicate (examples) and clauses of background predicates, find hypothesis to explain together with background theory some properties of examples � Complete vs. partial approches, general-purpose vs. special-purpose approaches � Clauses constructed from relational background knowledge and structural properties of individuals, calls of clauses for individuals produce feature values 3 Krogel, Rawles, Železný, Flach, Lavra č , Wrobel: Comparative Evaluation of Approaches to Propositionalization

RSD � Declarative bias similar to Progol/Aleph, e.g. :-modeb(3,hasCar(+train,-car). � Step 1: identification of all closed feature definitions (Prolog queries) corresponding to declarations hasCar(Train,Car), shape(Car,Shape), instantiate(Shape) � Step 2: instantiation of variables plus feature filtering, e.g. hasCar(Train,Shape), shape(Shape,bucket) � Step 3: creation of propositionalized representation 4 Krogel, Rawles, Železný, Flach, Lavra č , Wrobel: Comparative Evaluation of Approaches to Propositionalization

RSD: Constraints & Pruning � Language � argument modes & types, predicate recall � max feature length & variable depth � undecomposability : f1 <> f2 & f3 � Evaluation � non-triviality: |cov(f)| < |Data| � relevance: |cov(f)| > min � uniqueness: if cov(f1) = cov(f2) then discard the longer � Pruning: � large subspaces identified containing only decomposable f. � eg. EW Trains: SearchTime -> +inf as MaxLength -> +inf � with pruning: SearchTime -> const as MaxLength -> +inf � if |cov(f)| < min then don’t refine f 5 Krogel, Rawles, Železný, Flach, Lavra č , Wrobel: Comparative Evaluation of Approaches to Propositionalization

SINUS: Overview � Developed from LINUS and its feature generation extension � A modular transformational ILP experimentation platform � Automated type construction � Feature reduction � Invocation of learner and back-translation of induced theory to first-order form. � Data as flattened Prolog facts + data definition � Declarative bias similar to 1BC, e.g. train 1 train cwa train2car 2 1:train *:#car * cwa cshape 2 car #shape * cwa 6 Krogel, Rawles, Železný, Flach, Lavra č , Wrobel: Comparative Evaluation of Approaches to Propositionalization

SINUS: Step by step � Step 1: construction of instantiated feature definitions, e.g. f_aaaa(A) :- train(A), hasCar(A,B),shape(B,bucket). Recursive left-to-right considering current variable types and bindings. � Constraining maximum literals, variable, values in a type and the nature of variable reuse. � Step 2: feature set reduction (REDUCE) � Step 3: creation of propositionalized representation � After learning: result transformation into first-order hypothesis 7 Krogel, Rawles, Železný, Flach, Lavra č , Wrobel: Comparative Evaluation of Approaches to Propositionalization

RELAGGS � Declarative bias from foreign key relationships in relational database schema � After example identifier propagation to non-target relations: � Step 1: summarize each non-target relation by example id, avg, max, min, sum, stdev, range, quartiles for numeric data, count possible values for nominal attributes, plus some two-column aggregates � Step 2: creation of propositionalized representation by concatenating aggregate function values to target relation 8 Krogel, Rawles, Železný, Flach, Lavra č , Wrobel: Comparative Evaluation of Approaches to Propositionalization

Learning Tasks � Trains: 20 trains east- or west-bound? � King-Rook-King: 1000 board states legal or not? � Mutagenesis: 188 molecules mutagenic or not? � PKDD Challenges 1999/2000: 682 loans problematic or not? � KDD Cup 2001: 862 genes/proteins with certain function or not and with certain localization or not? � Numbers of predicates/relations depend on modeling issues. 9 Krogel, Rawles, Železný, Flach, Lavra č , Wrobel: Comparative Evaluation of Approaches to Propositionalization

Procedure � Mostly starting point: Prolog representation of target predicate facts and background predicate definitions, SQL scripts generated from those if necessary � Manual construction of declarations, propagation of id‘s if necessary � Application of RSD, SINUS, and RELAGGS to produce single- table representations of relational input data, with different parameter settings to produce feature sets of different sizes � Application of WEKA‘s J48 (10-fold stratified cross-validation) to those tables 10 Krogel, Rawles, Železný, Flach, Lavra č , Wrobel: Comparative Evaluation of Approaches to Propositionalization

Results: Accuracies (1) 11 Krogel, Rawles, Železný, Flach, Lavra č , Wrobel: Comparative Evaluation of Approaches to Propositionalization

Results: Runtimes � Different platforms, hence times only indicators � RSD SINUS RELAGGS � Trains < 1 sec 2 - 10 min < 1 sec � King-Rook-King < 1 sec 2 - 6 min n. a. � Mutagenesis 5 min 6 - 15 min 30 sec � PKDD99-00 5 sec 2 – 30 min 30 sec � KDD01 fct 3 min 30 min 1 min � KDD01 loc 3 min 30 min 1 min 17 Krogel, Rawles, Železný, Flach, Lavra č , Wrobel: Comparative Evaluation of Approaches to Propositionalization

Discussion � Not generally conclusive in favor of any approach: each winner on two tasks � Aggregation strong in some domains, where counting features are relevant (Trains) or many numeric attributes exist in the original data � Differences between RSD and SINUS mainly due to differences in constraining the language bias � RELAGGS most efficient for many tasks, differences between RSD and SINUS possibly caused by pruning or Prolog systems 18 Krogel, Rawles, Železný, Flach, Lavra č , Wrobel: Comparative Evaluation of Approaches to Propositionalization

Related Work � LINUS/DINUS (Lavra č and Džeroski 1994) � Stochastic propositionalization (Kramer et al. 1998) � Bottom-up propositionalization (Kramer 2000) � Lazy propositionalization (Alphonse and Rouveirol 2000) � ... 19 Krogel, Rawles, Železný, Flach, Lavra č , Wrobel: Comparative Evaluation of Approaches to Propositionalization

Future Work and Conclusion � General: � Completion of formal framework � Comparison to other ILP approaches such as Progol and Tilde � Extension of feature subset selection mechanisms � Experiments with other propositional learners such as SVMs � Combination of the features produced by the approaches here � RSD: construction of first-order hypotheses � SINUS: improvements of feature elimination, bias control � RELAGGS: integration with dynamic relational databases � Promising approaches with many questions left open! 20 Krogel, Rawles, Železný, Flach, Lavra č , Wrobel: Comparative Evaluation of Approaches to Propositionalization

Comparative Evaluation of Approaches to Propositionalization - PowerPoint PPT Presentation

Comparative Evaluation of Approaches to Propositionalization Mark-A. Krogel, Otto-von-Guericke-Universitt Magdeburg Simon Rawles, University of Bristol Filip Zelezn, Czech Technical University and University of Wisconsin, Madison Peter

WP3 EX-POST Case studies Comparative Analysis Report Deliverable no.: 3.2 Comparative Analysis

Comparative Genomics: Comparative Genomics: Sequence, Structure, Sequence, Structure, and

Comparative evaluation of an Comparative evaluation of an Eulerian Eulerian CFD and Gaussian

Chapter 12. Evaluation Research Chapter 12. Evaluation Research evaluation research? evaluation

User Interface Evaluation Empirical evaluation Heuristic evaluation 1 CS 349 - UI evaluation

International Comparative Assessments 1 05/06/2015 1 International Comparative Assessments Key

Comparative Genomics Comparative Genomics Common Themes Gene and functional pathway

Comparative statics Comparative statics is the study of how endogenous variables respond to

Resumex COMPARATIVE OF EQUALITY AS + adjective + AS (to, tanto...quanto, como) COMPARATIVE OF

Webinar on Meta-evaluation Approaches to Improve Evaluation Practice Mnica Lomea Gelis,

Comparative Effectiveness Evaluation and Monitoring, Austrian perspectives Workshop: More

Comparative human and automatic evaluation of glass-box and black-box approaches to interactive

National Comparative Audit of the Use of Platelets Prepared by John Grant-Casey East Midland

Comparative analysis of HIV- - 1 1 Comparative analysis of HIV attachment and fusion efficiency

COMPARATIVE LAW JOURNAL OF THE PACIFIC JOURNAL DE DROIT COMPARE DU PACIFIQUE Le Comparative Law

Datahub Comparative Historical Comparative Historical Datahub National Accounts, 1870

AFT PHARMACEUTICALS Investor Presentation May 2017 Investor Presentation May 2017 IMPORTANT

WP 10 Training FOCUS BALKANS Training 6 FOCUS BALKANS Training 6 DEFINING OBJECTIVES AND

Presentation prepared for: Barba bado dos Footba tball C Clubs bs Saturday, September 12,

Ocean Society of India Conference 2019 (OSICON-19) Poster Presentation Schedule Date & Time

Changes in Functional Activity with Prediction during Cycling Exercise Tohru KIRYU*, Kazuyo

IN INNO NOVATION TION A NEW COMPETITIVE LANDSCAPE SO WHATS DRIVING INNOVATION IN HEALTHCARE?

Exploration Medical Data Architecture Big Data Big Think Forum Erik Antonsen MD, PhD, FAAEM

NOTES FOR PRESENTATION COUNCIL OF AUSTRALIASIAN TRIBUNALS (SOUTH AUSTRALIA BRANCH) A CANADIAN

Sambuz

Useful Links

Newsletter

Mail Us

Comparative Evaluation of Approaches to Propositionalization - PowerPoint PPT Presentation

Comparative Evaluation of Approaches to Propositionalization Mark-A. Krogel, Otto-von-Guericke-Universitt Magdeburg Simon Rawles, University of Bristol Filip Zelezn, Czech Technical University and University of Wisconsin, Madison Peter

WP3 EX-POST Case studies Comparative Analysis Report Deliverable no.: 3.2 Comparative Analysis

Comparative Genomics: Comparative Genomics: Sequence, Structure, Sequence, Structure, and

Comparative evaluation of an Comparative evaluation of an Eulerian Eulerian CFD and Gaussian

Chapter 12. Evaluation Research Chapter 12. Evaluation Research evaluation research? evaluation

User Interface Evaluation Empirical evaluation Heuristic evaluation 1 CS 349 - UI evaluation

International Comparative Assessments 1 05/06/2015 1 International Comparative Assessments Key

Comparative Genomics Comparative Genomics Common Themes Gene and functional pathway

Comparative statics Comparative statics is the study of how endogenous variables respond to

Resumex COMPARATIVE OF EQUALITY AS + adjective + AS (to, tanto...quanto, como) COMPARATIVE OF

Webinar on Meta-evaluation Approaches to Improve Evaluation Practice Mnica Lomea Gelis,

Comparative Effectiveness Evaluation and Monitoring, Austrian perspectives Workshop: More

Comparative human and automatic evaluation of glass-box and black-box approaches to interactive

National Comparative Audit of the Use of Platelets Prepared by John Grant-Casey East Midland

Comparative analysis of HIV- - 1 1 Comparative analysis of HIV attachment and fusion efficiency

COMPARATIVE LAW JOURNAL OF THE PACIFIC JOURNAL DE DROIT COMPARE DU PACIFIQUE Le Comparative Law

Datahub Comparative Historical Comparative Historical Datahub National Accounts, 1870

AFT PHARMACEUTICALS Investor Presentation May 2017 Investor Presentation May 2017 IMPORTANT

WP 10 Training FOCUS BALKANS Training 6 FOCUS BALKANS Training 6 DEFINING OBJECTIVES AND

Presentation prepared for: Barba bado dos Footba tball C Clubs bs Saturday, September 12,

Ocean Society of India Conference 2019 (OSICON-19) Poster Presentation Schedule Date &amp; Time

Changes in Functional Activity with Prediction during Cycling Exercise Tohru KIRYU*, Kazuyo

IN INNO NOVATION TION A NEW COMPETITIVE LANDSCAPE SO WHATS DRIVING INNOVATION IN HEALTHCARE?

Exploration Medical Data Architecture Big Data Big Think Forum Erik Antonsen MD, PhD, FAAEM

NOTES FOR PRESENTATION COUNCIL OF AUSTRALIASIAN TRIBUNALS (SOUTH AUSTRALIA BRANCH) A CANADIAN

Sambuz

Useful Links

Newsletter

Mail Us

Ocean Society of India Conference 2019 (OSICON-19) Poster Presentation Schedule Date & Time