Logics in Machine Learning and Data Mining: Achievements and Open - PowerPoint PPT Presentation

Logics in Machine Learning and Data Mining: Achievements and Open Issues Francesca A. Lisi University of Bari “Aldo Moro” Department of Computer Science Lab of Knowledge Acquisition and Machine Learning (LACAM) francesca.lisi@uniba.it June 19, 2019 F.A. Lisi (Univ. Bari) Logics in ML/DM June 19, 2019 1 / 28

Overview Introduction 1 Three cases for Logics in ML/DM 2 Combining rules and ontologies Dealing with imprecision and granularity Modeling and metamodeling Final remarks 3 References 4 F.A. Lisi (Univ. Bari) Logics in ML/DM June 19, 2019 2 / 28

Introduction I Model-free AI vs. Model-based AI Current hype about AI due to many successful ML applications Deep learning follows the model-free approach ML tasks are function-fitting problems! Need to construct and use models (“good old-fashioned AI”) Logics and probability as main tools Is there any ML algorithm following the model-based approach? F.A. Lisi (Univ. Bari) Logics in ML/DM June 19, 2019 3 / 28

Introduction II Inductive Logic Programming in a nutshell Major logic-based approach to rule learning [Muggleton, 1990] Concept Learning within the LP framework Use of background knowledge (BK) Bunch of techniques for structuring, searching and bounding the hypothesis space [Nienhuys-Cheng and de Wolf, 1997] Two representative ILP algorithms: Foil [Quinlan, 1990] 1 Warmr [Dehaspe and Toivonen, 1999] 2 Several extensions, e.g. , towards statistical learning and other probabilistic approaches (see [Riguzzi et al., 2014] for a survey). F.A. Lisi (Univ. Bari) Logics in ML/DM June 19, 2019 4 / 28

Three cases for Logics in ML/DM 1 Combining rules and ontologies 2 Dealing with imprecision and granularity 3 Modeling and metamodeling F.A. Lisi (Univ. Bari) Logics in ML/DM June 19, 2019 5 / 28

Combining rules and ontologies I Logic Programming vs. Descripion Logics 1 CWA vs OWA 2 Single vs. multiple models 3 Negation as failure vs. classical negation 4 Strong negation vs. classical negation 5 Treatment of equality Unique Names Assumption (UNA) [Reiter, 1980] might not hold in DLs 6 Existential quantification Recent work on Datalog ± [Cal` ı et al., 2009] 7 Decidability F.A. Lisi (Univ. Bari) Logics in ML/DM June 19, 2019 6 / 28

Combining rules and ontologies II Exploiting the power of combination Combination is more than the sum of the parts Very expressive FOL languages as an outcome Solutions to the semantic mismatch between LP and DLs Approaches Non-hybrid Combination within a homogeneous semantic framework e.g. , description logic programs [Grosof et al., 2003] Hybrid Combination within a heterogeneous semantic framework e.g. , AL - log [Donini et al., 1998] F.A. Lisi (Univ. Bari) Logics in ML/DM June 19, 2019 7 / 28

Combining rules and ontologies III Learning hybrid rules with ILP Learning Carin - ALN rules Learning AL - log rules Learning SHIQ + log [Rouveirol and Ventos, 2000, [Lisi, 2008] rules [Lisi, 2010] Kietz, 2003] prior knowledge Carin - ALN KB AL - log KB SHIQ + log KB ontology language ALN ALC SHIQ rule language HCL Datalog Datalog hypothesis language Carin - ALN non-recursive AL - log non-recursive rules SHIQ + log non- rules recursive rules target predicate Horn predicate Datalog predicate SHIQ / Datalog predicate logical setting interpretations interpretations/entailment entailment scope of induction prediction prediction/description prediction/description generality order generalized subsumption generalized subsumption generalized subsumption DL + log ∨ query answer- coverage test Carin query answering AL - log query answering ing ref. operators n.a. downward downward/upward implementation unknown yes, see [Lisi, 2011] no application no yes, see no [Lisi and Malerba, 2004] F.A. Lisi (Univ. Bari) Logics in ML/DM June 19, 2019 8 / 28

Combining rules and ontologies IV AL - QuIn : general features [Lisi, 2011] Task: multi-level association rule mining Method: levelwise search BK: hybrid (relational DB + ontology) Knowledge representation formalism: AL - log . Upgrade from: Warmr . F.A. Lisi (Univ. Bari) Logics in ML/DM June 19, 2019 9 / 28

Combining rules and ontologies V AL - QuIn : problem statement Given a relational data set Π a taxonomic ontology Σ a multi-grained language L = {L l } 1 ≤ l ≤ maxG a set { minsup l } 1 ≤ l ≤ maxG of support thresholds the problem of frequent pattern discovery in Π at l levels of description granularity w.r.t. Σ, 1 ≤ l ≤ maxG , is to find the set F of all the patterns P ∈ L l frequent in B = (Π , Σ), namely P ’s with support s such that (i) s ≥ minsup l and (ii) all ancestors of P w.r.t. Σ are frequent. F.A. Lisi (Univ. Bari) Logics in ML/DM June 19, 2019 10 / 28

Combining rules and ontologies VI AL - QuIn : An example Q 1 = q(X) ← believes(X,Y) & MiddleEastCountry(X) Q 3 = q(X) ← believes(X,Y) & MiddleEastCountry(X) , Religion(Y) Q 4 = q(X) ← believes(X,Y) & MiddleEastCountry(X) , MonotheisticReligion(Y) Q 5 = q(X) ← believes(X,Y) , speaks(X,Z) & MiddleEastCountry(X) , MonotheisticReligion(Y), IndoEuropeanLanguage(Z) F.A. Lisi (Univ. Bari) Logics in ML/DM June 19, 2019 11 / 28

Dealing with imprecision and granularity I Fuzzy Description Logics Several ways of extending DLs with fuzzy logic [Straccia, 2013] Some ad-hoc reasoners already available ( e.g. , FuzzyDL [Bobillo and Straccia, 2008]) Fuzzy quantifiers in Fuzzy DLs [Sanchez and Tettamanzi, 2006] Proposal of Fuzzy OWL 2 [Bobillo and Straccia, 2010] F.A. Lisi (Univ. Bari) Logics in ML/DM June 19, 2019 12 / 28

Dealing with imprecision and granularity II Fuzzy EL ( D ) [Straccia, 2005] Complex concepts built according to the following syntactic rules: C → ⊤ | ⊥ | A | C 1 ⊓ C 2 | ∃ R . C | ∃ T . d where d can be one of the membership functions of fuzzy sets ⊓ and ∃ are interpreted as truth combination functions Concepts are interpreted as fuzzy sets Axioms are graded, i.e. have a truth degree α (if omitted, α = 1) e.g., I satisfies an axiom � a : C , α � if C I ( a I ) ≥ α The best entailment degree of an axiom τ w.r.t. K is defined as bed ( K , τ ) = sup { α | K | = � τ, α �} . (1) For a crisp axiom τ , we also write K | = + τ iff bed ( K , τ ) > 0. F.A. Lisi (Univ. Bari) Logics in ML/DM June 19, 2019 13 / 28

Dealing with imprecision and granularity III Fuzzy DL Learning [Konstantopoulos and Charalambidis, 2010] propose an ad-hoc translation of fuzzy � Lukasiewicz ALC DL constructs into LP in order to apply a conventional ILP method for rule learning. Unsound method [Iglesias and Lehmann, 2011] propose to interface CELOE with the fuzzyDL reasoner Uncomparable method [Lisi and Straccia, 2013] present a method for learning fuzzy EL GCI axioms from fuzzy DL-Lite KBs. Unimplemented method F.A. Lisi (Univ. Bari) Logics in ML/DM June 19, 2019 14 / 28

Dealing with imprecision and granularity IV Foil - DL : general features [Lisi and Straccia, 2014] Task: classification rule mining Method: sequential covering BK: DL KB Hypothesis description language: fuzzy EL ( D ). Upgrade from: Foil . F.A. Lisi (Univ. Bari) Logics in ML/DM June 19, 2019 15 / 28

Dealing with imprecision and granularity V Foil - DL : problem statement Given: a consistent DL KB K = �T , A� (the background theory ); an atomic concept A t (the target concept ); a set E = E + ∪ E − of crisp DL concept assertions labelled as either positive or negative examples for H (the training set ); a set L H of fuzzy EL ( D ) GCIs of the form C ⊑ A t (the language of hypotheses ) where C is a complex concept Find: a set H ⊂ L H (a hypothesis ) such that: Completeness. ∀ e ∈ E + , K ∪ H | = + e , and Consistency. ∀ e ∈ E − , K ∪ H �| = + e . F.A. Lisi (Univ. Bari) Logics in ML/DM June 19, 2019 16 / 28

Dealing with imprecision and granularity VI Foil - DL : An example Confidence Axiom 1,000 Hostel subclass of Good_Hotel 1,000 hasPrice_veryhigh subclass of Good_Hotel 0,739 hasDistance some (isDistanceFor some (Bus_Station) and hasValue_low) and hasDistance some (isDistanceFor some (Town_Hall) and hasValue_fair) and hasRank some (Rank) and hasPrice_verylow subclass of Good_Hotel 0,569 hasPrice_high subclass of Good_Hotel 0,289 Hotel_3_Stars and hasDistance some (isDistanceFor some (Train_Station) and hasValue_verylow) and hasPrice_fair subclass of Good_Hotel 0,198 Hotel_4_Stars and hasDistance some (isDistanceFor some (Square) and hasValue_high) and hasRank some (Rank) and hasPrice_fair subclass of Good_Hotel F.A. Lisi (Univ. Bari) Logics in ML/DM June 19, 2019 17 / 28

Dealing with imprecision and granularity VII Information granulation and DLs [Lisi and Mencar, 2017, Lisi and Mencar, 2018] Support to both coarser-granularity fuzzy quantified sentences such as “Many hotels have a low distance from attractions” and finer-granularity fuzzy quantified sentences such as “Hotel Verdi has a low distance from many attractions” F.A. Lisi (Univ. Bari) Logics in ML/DM June 19, 2019 18 / 28

Logics in Machine Learning and Data Mining: Achievements and Open - PowerPoint PPT Presentation

Logics in Machine Learning and Data Mining: Achievements and Open Issues Francesca A. Lisi University of Bari Aldo Moro Department of Computer Science Lab of Knowledge Acquisition and Machine Learning (LACAM) francesca.lisi@uniba.it

Web Mining Web Mining Web Mining Web Mining Web mining is the use of data mining techniques

Introduction What is data mining? to Data Mining: On what kind of data? Data Mining

Data mining Machine Intelligence Thomas D. Nielsen September 2008 Data mining September 2008

Web Mining Web Mining Web mining is the use of data mining techniques to automatically

Description Logics Description Logics and Logics Enrico Franconi franconi@cs.man.ac.uk

Applications of Description Logics IJCAR 2001: Description Logics tutorial p.1/9 Application

Data Mining Practical Machine Learning Tools and Techniques Slides for Chapter 1 of Data Mining by

Introduction What is data mining? to Data mining functionalities Data Mining Major

DATA MINING LECTURE 2 What is data? The data mining pipeline What is Data Mining? Data

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

Chapter 1 Logics Course Model checking Volker Stolz, Martin Steffen Autumn 2019 Chapter

Data Mining 2020 Frequent Pattern Mining (2) Ad Feelders Universiteit Utrecht October 2, 2020

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

Ensemble Asteroseimology of 16,000 Kepler red giants Jie Yu 1,2 , Tim Bedding 1,2 , Daniel Huber

MEANING, CHOICE and ALGEBRAIC SEMANTICS of SIMILARITY BASED ROUGH SET THEORY A. Mani Member,

PROBING STELLAR DYNAMICS WITH SPACE PHOTOMETRY RESULTS FROM COROT AND KEPLER Rafael A. Garca

ASTR 1120 ASTR 1120 General Astronomy: General Astronomy: Stars & Galaxies Stars &

Equidistribution for groups of toral automorphisms J. Bourgain A. Furman E. Lindenstrauss S.

Advanced Space Weather Modeling Ward Manchester, Gabor Toth & Yuxi Chen (University of

Absolute Solar Abundances Absolute Solar Abundances From Helioseismology Helioseismology From

Radial Velocity Limitations To The Detection Of Giant Planets 8 6 4 2 0 2 4 6

Sambuz

Useful Links

Newsletter

Mail Us

Logics in Machine Learning and Data Mining: Achievements and Open - PowerPoint PPT Presentation

Logics in Machine Learning and Data Mining: Achievements and Open Issues Francesca A. Lisi University of Bari Aldo Moro Department of Computer Science Lab of Knowledge Acquisition and Machine Learning (LACAM) francesca.lisi@uniba.it

Web Mining Web Mining Web Mining Web Mining Web mining is the use of data mining techniques

Introduction What is data mining? to Data Mining: On what kind of data? Data Mining

Data mining Machine Intelligence Thomas D. Nielsen September 2008 Data mining September 2008

Web Mining Web Mining Web mining is the use of data mining techniques to automatically

Description Logics Description Logics and Logics Enrico Franconi franconi@cs.man.ac.uk

Applications of Description Logics IJCAR 2001: Description Logics tutorial p.1/9 Application

Data Mining Practical Machine Learning Tools and Techniques Slides for Chapter 1 of Data Mining by

Introduction What is data mining? to Data mining functionalities Data Mining Major

DATA MINING LECTURE 2 What is data? The data mining pipeline What is Data Mining? Data

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

Chapter 1 Logics Course Model checking Volker Stolz, Martin Steffen Autumn 2019 Chapter

Data Mining 2020 Frequent Pattern Mining (2) Ad Feelders Universiteit Utrecht October 2, 2020

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

Ensemble Asteroseimology of 16,000 Kepler red giants Jie Yu 1,2 , Tim Bedding 1,2 , Daniel Huber

MEANING, CHOICE and ALGEBRAIC SEMANTICS of SIMILARITY BASED ROUGH SET THEORY A. Mani Member,

PROBING STELLAR DYNAMICS WITH SPACE PHOTOMETRY RESULTS FROM COROT AND KEPLER Rafael A. Garca

ASTR 1120 ASTR 1120 General Astronomy: General Astronomy: Stars &amp; Galaxies Stars &amp;

Equidistribution for groups of toral automorphisms J. Bourgain A. Furman E. Lindenstrauss S.

Advanced Space Weather Modeling Ward Manchester, Gabor Toth &amp; Yuxi Chen (University of

Absolute Solar Abundances Absolute Solar Abundances From Helioseismology Helioseismology From

Radial Velocity Limitations To The Detection Of Giant Planets 8 6 4 2 0 2 4 6

Sambuz

Useful Links

Newsletter

Mail Us

ASTR 1120 ASTR 1120 General Astronomy: General Astronomy: Stars & Galaxies Stars &

Advanced Space Weather Modeling Ward Manchester, Gabor Toth & Yuxi Chen (University of