CSI5180. MachineLearningfor BioinformaticsApplications Rule Learning - PowerPoint PPT Presentation

CSI5180. MachineLearningfor BioinformaticsApplications Rule Learning by Marcel Turcotte Version November 21, 2019

Preamble Preamble 2/49

Preamble Rule Learning Chances are that you have never heard the term rule learning despite the fact that it is one of the oldest paradigms in machine learning . Particularly now, the emphasis is on developing machine learning algorithms with exceptionally high “accuracy”. We have deep learning algorithms with superhuman powers classifying images, detecting cancer from medical images, or defeating the world champions of Go, one of the most challenging games. In this lecture, we focus on a set of methods putting the emphasis on interpretability rather than numerical performance. General objective : Explain rule learning in your own words Preamble 3/49

Learning objectives Justify the need (or not) for interpretability Explain rule learning in your own words Reading: Fürnkranz, D. Gamberger, and N. Lavrač. Foundations of Rule Learning. Cognitive Technologies. Springer Berlin Heidelberg, 2012. King, R. D. et al. The automation of science. Science 324 , 8589 (2009). Sparkes, A. et al. Towards Robot Scientists for autonomous scientific discovery. Autom Exp 2 , 1 (2010). King, R. D., Schuler Costa, V., Mellingwood, C. & Soldatova, L. N. Automating Sciences: Philosophical and Social Dimensions. IEEE Technology and Society Magazine 37 , 4046 (2018). Preamble 4/49

Plan 1. Preamble 2. Introduction 3. Building blocks 4. Science (fiction) 5. Current research 6. Prologue Preamble 5/49

2020 Make this the last lecture of the term. Preamble 6/49

Introduction Introduction 7/49

Rule Learning , a vast and diverse continent that you may never have heard of. Introduction 8/49

Globin-like f o l d ( ’ Globin − l i k e ’ , X) : − adjacent (X, A, B, 1 , h , h ) , has_pro (B) . Introduction 9/49

Flavodoxin, Rossman-fold, TIM-barrel f o l d ( ’ Flavodoxin − l i k e ’ ,A) : − nb_alpha (A, B) , nb_beta (A, B) , i n t e r v a l _ l (B ≤ 6 ) . f o l d ( ’NAD(P) − binding Rossmann − f o l d domains ’ ,A) : − nb_alpha (A, B) , nb_beta (A, B) , i n t e r v a l (5 ≤ B ≤ 7 ) . f o l d ( ’ beta / alpha (TIM) − b a r r e l ’ ,A) : − nb_alpha (A, B) , nb_beta (A, B) , i n t e r v a l (8 ≤ B ≤ 16 ) . The number of strands is the same as the number of helices, however, that number is variable. Introduction 10/49

Beta-grasp f o l d ( ’ beta − Grasp ’ ,A) : − adjacent (A,B, C ,2 , e , h ) , adjacent (A, C ,D, 1 , h , e ) , c o i l (C,D, 3 ) . This rule effectively describes a relation involving three secondary structure elements, β 2 - α 1 - β 3 , although no triple relationship was explicitly encoded in the background knowledge. Introduction 11/49

SH3 f o l d (A, ’SH3 − l i k e b a r r e l ’ ) : − number_strands (4 = < A = < 7) , sheet (A, B, a n t i ) , has_n_strands (B, 5) , strand (A, C, B, 1) , strand (A, D, B, − 1), a n t i p a r a l l e l (C, D) . The first and the last are anti-parallel! Introduction 12/49

SH3 (1bia) (d1bb) (d1pht) (2ahj) Introduction 13/49

“Inductive” Logic Programming Examples: Phycocyanin adopts a globin fold. Hemoglobin adopts a globin fold. Oct-1 POU Homeodomain is not a globin. + Background: The second helix in phycocyanin contains a proline. To calculate the hydrophobic moment . . . ⇓ Hypothesis: The first helix is followed by another one that contains a proline. Introduction 14/49

Keywords Knowledge discovery Introduction 15/49

Keywords Knowledge discovery Can expert-like knowledge be discovered automatically? Introduction 15/49

Keywords Knowledge discovery Can expert-like knowledge be discovered automatically? Background knowledge Introduction 15/49

Keywords Knowledge discovery Can expert-like knowledge be discovered automatically? Background knowledge How can we make effective use of accumulated knowledge ? Introduction 15/49

Keywords Knowledge discovery Can expert-like knowledge be discovered automatically? Background knowledge How can we make effective use of accumulated knowledge ? Relational information Introduction 15/49

Keywords Knowledge discovery Can expert-like knowledge be discovered automatically? Background knowledge How can we make effective use of accumulated knowledge ? Relational information Can we learn complex interactions between sub-structures? Introduction 15/49

Keywords Knowledge discovery Can expert-like knowledge be discovered automatically? Background knowledge How can we make effective use of accumulated knowledge ? Relational information Can we learn complex interactions between sub-structures? Interpretability Introduction 15/49

Keywords Knowledge discovery Can expert-like knowledge be discovered automatically? Background knowledge How can we make effective use of accumulated knowledge ? Relational information Can we learn complex interactions between sub-structures? Interpretability How can we make hypotheses easily amenable to human interpretation ? Introduction 15/49

Buildingblocks Building blocks 16/49

Foundation These algorithms are based on formal logic , a sub-branch of mathematics. Building blocks 17/49

Foundation These algorithms are based on formal logic , a sub-branch of mathematics. Propositional (zero-order) logic Building blocks 17/49

Foundation These algorithms are based on formal logic , a sub-branch of mathematics. Propositional (zero-order) logic “If it’s raining then it’s cloudy” Building blocks 17/49

Foundation These algorithms are based on formal logic , a sub-branch of mathematics. Propositional (zero-order) logic “If it’s raining then it’s cloudy” First-order (predicate) logic Building blocks 17/49

Foundation These algorithms are based on formal logic , a sub-branch of mathematics. Propositional (zero-order) logic “If it’s raining then it’s cloudy” First-order (predicate) logic “there exists x such that x is Socrates and x is a man” Building blocks 17/49

Foundation These algorithms are based on formal logic , a sub-branch of mathematics. Propositional (zero-order) logic “If it’s raining then it’s cloudy” First-order (predicate) logic “there exists x such that x is Socrates and x is a man” J.W. Lloyd, Logic for learning: Learning comprehensible theories from structured data , Cognitive Technologies, Springer Berlin Heidelberg, 2003. Building blocks 17/49

Foundation These algorithms are based on formal logic , a sub-branch of mathematics. Propositional (zero-order) logic “If it’s raining then it’s cloudy” First-order (predicate) logic “there exists x such that x is Socrates and x is a man” J.W. Lloyd, Logic for learning: Learning comprehensible theories from structured data , Cognitive Technologies, Springer Berlin Heidelberg, 2003. Fürnkranz, D. Gamberger, and N. Lavrač. Foundations of Rule Learning . Cognitive Technologies. Springer Berlin Heidelberg, 2012. Building blocks 17/49

Task - concept (classification) Given: A data description language A target concept A hypothesis description language A coverage function , covered ( r , e ) A class attribute , C A set of positive examples , P A set of negative examples , N Find: A hypothesis which is: complete , covers all the examples, and consistent , predicts the correct class for all the examples. Adapted from [Fürnkranz et al., 2012] Figure 2.2. Building blocks 18/49

Completeness and consistency Source: [Fürnkranz et al., 2012] Figure 2.3. Building blocks 19/49

Definitions An instance is covered by a rule , if the description of the instance satisfies the conditions of the rule. Building blocks 20/49

Definitions An instance is covered by a rule , if the description of the instance satisfies the conditions of the rule. An example is correctly covered by a rule , if it is covered and the class of the rule is the same as the class of the example . Building blocks 20/49

Representation Propositional (attribute-value) rules . Building blocks 21/49

Representation Propositional (attribute-value) rules . The rules have the form: Building blocks 21/49

Representation Propositional (attribute-value) rules . The rules have the form: IF Conditions THEN c Building blocks 21/49

Representation Propositional (attribute-value) rules . The rules have the form: IF Conditions THEN c where Conditions is a conjunction (and) of simple tests (properties of the instance) and c is a class. Building blocks 21/49

Representation Propositional (attribute-value) rules . The rules have the form: IF Conditions THEN c where Conditions is a conjunction (and) of simple tests (properties of the instance) and c is a class. Corresponds to the implication in propositional logic , c ← Conditions . SportsCar ← HasChildren = No ∧ Sex = Male Building blocks 21/49

Representation Propositional (attribute-value) rules . The rules have the form: IF Conditions THEN c where Conditions is a conjunction (and) of simple tests (properties of the instance) and c is a class. Corresponds to the implication in propositional logic , c ← Conditions . SportsCar ← HasChildren = No ∧ Sex = Male Alternatively, first-order logic can be used to represent the data, the background knowledge, and the hypotheses. Building blocks 21/49

CSI5180. MachineLearningfor BioinformaticsApplications Rule Learning - PowerPoint PPT Presentation

CSI5180. MachineLearningfor BioinformaticsApplications Rule Learning by Marcel Turcotte Version November 21, 2019 Preamble Preamble 2/49 Preamble Rule Learning Chances are that you have never heard the term rule learning despite the fact

CSI5180. MachineLearningfor BioinformaticsApplications Fundamentals of Machine Learning tasks

CSI5180. MachineLearningfor BioinformaticsApplications Essential Cellular Biology (continued) by

CSI5180. MachineLearningfor BioinformaticsApplications Essential Cellular Biology by Marcel

CSI5180. MachineLearningfor BioinformaticsApplications Ensemble Learning by Marcel Turcotte

CSI5180. MachineLearningfor BioinformaticsApplications Hidden Markov Models by Marcel Turcotte

CSI5180. MachineLearningfor BioinformaticsApplications Essential Bioinformatics Skills by Marcel

CSI5180. MachineLearningfor BioinformaticsApplications Course overview by Marcel Turcotte

CSI5180. MachineLearningfor BioinformaticsApplications Fundamentals of Machine Learning

CSI5180. MachineLearningfor BioinformaticsApplications Fundamentals of Machine Learning

CSI5180. MachineLearningfor BioinformaticsApplications Deep learning encoding and transfer

CSI5180. MachineLearningfor BioinformaticsApplications Deep learning practical issues by

CSI5180. MachineLearningfor BioinformaticsApplications Regularized Linear Models by Marcel

Data Mining in Bioinformatics Day 8: Clustering in Bioinformatics Clustering Gene Expression Data

Composite repetition-aware text indexing Djamal Belazzougui Fabio Cunial Travis Gagie Nicola

Mo#f discovery Morgane Thomas-Chollier Computa)onal systems

CSE 527 Lecture 10 More on the Gibbs Sampler Projects see web Implementation or

Pattern Matching in Protein-Protein Interaction Graphs Ga elle Brevier ( Universit e de

Membrane Systems in Algebraic Biology: From a Toy to a Tool Thomas Hinze Friedrich Schiller

Timing from Stochasticity Scott Yang Nick Rhind (UMass Med) John Bechhoefer (SFU) CMMT TGIF

chromozoom.org rethinking the UI of genome browsers Ted Pak Roth Laboratory Donnelly Centre,

Protein-Protein interactions Reducing the complexity Why are protein-protein interactions

Selective Integration of Multiple Biological Data for Supervised Network Inference Koji Tsuda

GENOMIC SELECTION WORKSHOP: Hands on Practical Sessions (GBLUP-RR) Paulino Prez 1 Jos Crossa 2

Genomic Analysis Hoon Cho (MIT) and David Wu (Stanford) March, 2015 Homomorphic Encryption