Scalable Uncertainty Management 06 Markov Logic Rainer Gemulla - PowerPoint PPT Presentation

Scalable Uncertainty Management 06 – Markov Logic Rainer Gemulla July 13, 2012

Overview In this lecture Statistical relational learning (SRL) Introduction to probabilistic graphical models (PGM) Basics of undirected models (called Markov networks ) Markov logic as a template for undirected models Basics of inference in Markov logic networks Not in this lecture Directed models (called Bayesian networks ) Other SRL approaches (such as probabilistic relational models ) High coverage and in-depth discussion of inference Learning Markov logic networks 2 / 78

Outline Introduction to Markov Logic Networks 1 Probabilistic Graphical Models 2 Introduction Preliminaries Markov Networks 3 Markov Logic Networks 4 Grounding Markov logic networks Log-Linear Models Inference in MLNs 5 Basics Exact Inference Approximate Inference Summary 6 3 / 78

Correlations in probabilistic databases Simple probabilistic models ◮ Tuple-independent databases ◮ Block-disjoint independent databases ◮ Key/foreign key constraints, . . . Correlations (mainly) through RA queries/views ◮ Any discrete probability distribution can be modeled ◮ Queries describe precisely how result is derived Example (Nell) NellExtraction NellSource Subject Pattern Object Source Source P P Sony produces Walkman 1 0.96 1 0.99 IBM produces PC 1 0.96 2 0.1 IBM produces PC 2 1 Microsoft produces MacOS 2 0.9 AlbertEinstein bornIn Ulm 1 0.9 Produces Subject Object P Produces( x , y ) ← NellExtraction( x , ’produces’ , y , s ) , Sony Walkman 0.9504 IBM PC 0.95536 NellSource( s ) Microsoft MacOS 0.09 4 / 78

Statistical relational learning (I) Does John smoke? Learn correlations from structured data, then apply to new data. 5 / 78

Statistical relational learning (II) Goal: Declarative modelling of correlations in structured data Idea: Use (subsets of) first-order logic ◮ Very expressive formalism; lots of knowledge bases use it ◮ Symmetry: ∀ x . ∀ y . Friends( x , y ) ⇐ ⇒ Friends( y , x ) ◮ Everybody has a friend: ∀ x . ∃ y . Friends( x , y ) ◮ Transitivity: ∀ x . ∀ y . ∀ z . Friends( x , y ) ∧ Friends( y , z ) = ⇒ Friends( x , z ) ◮ Smoking causes cancer: ∀ x . Smokes( x ) = ⇒ Cancer( x ) ◮ Friends have similar smoking habits: ∀ x . ∀ y . Friends( x , y ) = ⇒ (Smokes( x ) ⇐ ⇒ Smokes( y )) Problem: Real-world knowledge is incomplete, contradictory, complex → Above rules do not generally hold, but they are “likely” to hold! Approach: Combine first-order logic with probability theory ◮ Expressiveness of first-order logic ◮ Principled treatment of uncertainty using probability theory There are many approaches of this kind. Our focus is on Markov logic , a recent and very successful language. 6 / 78

Markov logic networks Definition A Markov logic network is a set of pairs ( F i , w i ), where F i is a formula in first-order logic and the weight w i is a real number. Example � Smoking causes cancer 1.5 ∀ x . Smokes( x ) = ⇒ Cancer( x ) � Friends have similar smoking habits 1.1 ∀ x . ∀ y . Friends( x , y ) = ⇒ (Smokes( x ) ⇐ ⇒ Smokes( y )) Formulas may or may not hold Weights express confidence ◮ High positive weight → confident that formula holds ◮ High negative weight → confident that formula does not hold ◮ But careful: weights actually express confidence of certain “groundings” of a formula and not the formula as a whole (more later) Formulas may introduce complex correlations 7 / 78

Simple MLN for entity resolution Which citations refer to the same publication? author Richardson, Matt M. Richardson and Domingos, Pedro and and Domingos, P. Domingos Richardson, Matthew Pedro title Markov Logic Markov logic Markov Logic: A Unifying Networks networks Framework for Statistical Relational Learning year 2006 2006 2007 // predicates HasToken(token, field, citation ) // e.g., HasToken(’Logic’, ’ title ’, C1) SameField(field, citation , citation ) // Semantic equality of values in a field SameCitation(citation, citation ) // Semantic equality of citations // formulas HasToken(+t, +f, c1) ˆ HasToken(+t, +f, c2) = > SameField(+f, c1, c2) SameField(+f, c1, c2) = > SameCitation(c1, c2) SameCitation(c1, c2) ˆ SameCitation(c2, c3) = > SameCitation(c1, c3) Rule weights are usually learned from data. The same rule may have different weights for different constants (indicated by “+”). 8 / 78

Alchemy Alchemy is well-known software package for Markov logic Developed at University of Washington Supports a wide range of tasks ◮ Structure learning ◮ Weight learning ◮ Probabilistic inference Has been used for wide range of applications ◮ Information extraction ◮ Social network modeling ◮ Entity resolution ◮ Collective classification ◮ Link prediction Check out http://alchemy.cs.washington.edu/ ◮ Code ◮ Real-world datasets ◮ Real-world Markov logic networks ◮ Literature 9 / 78

From Markov logic to graphical models (example) Friends Smokes Cancer Name1 Name2 Value Name Value Name Value Anna Bob Yes Anna Yes Anna No Bob Anna Yes Smoking causes cancer � 1.5 Anna Anna Yes ∀ x . Smokes( x ) = ⇒ Cancer( x ) Bob Bob Yes Friends have similar smoking habits � 1.1 ∀ x . ∀ y . Friends( x , y ) = ⇒ (Smokes( x ) ⇐ ⇒ Smokes( y )) Inference (conceptual) Inference result S(B) C(B) #R1 #R2 � w P P ( Bob smokes ) = 84 . 6% No No 1 1 2.6 7.7% P ( Bob has cancer ) = 76 . 9% No Yes 1 1 2.6 7.7% Friends(A,B) Yes No 0 3 3.3 15.4% Yes Yes 1 3 4.8 69.2% Friends(A,A) Smokes(A) Smokes(B) Friends(B,B) Example is simplified; actual semantics are Cancer(A) Friends(B,A) Cancer(B) slightly different. 10 / 78

Probabilistic databases and graphical models Probabilistic databases Graphical models Simple Complex Probabilistic (disjoint-independent tuples) (independencies given by graph) model Query Complex Simple (e.g., ∃ x . ∃ y . R ( x , y ) ∧ S ( x )) (e.g., P ( X 1 , X 2 | Z 1 , Z 2 , Z 3 )) Network Dynamic Static (database + query) (Bayesian or Markov network) Complexity Database Network measured in size of Complexity Query Treewidth parameter System Extension to RDBMS Stand-alone Hybrid approaches have many potential applications and are under active research. 11 / 78

Reasoning with uncertainty Goal: Automated reasoning system ◮ Take all available information (e.g., patient information: symptoms, test results, personal data) ◮ Reach conclusions (e.g., which diseases the patient has, which medication to give) Desiderata Separation of knowledge and reasoning 1 ⋆ Declarative, model-based representation of knowledge ⋆ General suite of reasoning algorithms, applicable to many domains Principled treatment of uncertainty 2 ⋆ Partially observed data ⋆ Noisy observations ⋆ Non-deterministic relationships Lots of applications ◮ medical diagnosis, fault diagnosis, analysis of genetic and genomic data, communication and coding, analysis of marketing data, speech recognition, natural language understanding , segmenting and denoising images, social network analysis, . . . 14 / 78

Probabilistic models Multiple interrelated aspects may relate to the reasoning task ◮ Possible diseases ◮ Hundreds of symptoms and diagnostic tests ◮ Personal characteristics 1 Characterize data by a set of random variables ◮ Flu (yes / no) ◮ Hayfever (yes / no) ◮ Season (Spring / Sommer / Autumn / Winter) ◮ Congestion (yes / no) ◮ MusclePain (yes / no) → Variables and their domain are important design decision 2 Model dependencies by a joint distribution ◮ Diseases, season, and symptoms are correlated ◮ Probabilistic models construct joint probability space → 2 · 2 · 4 · 2 · 2 outcomes (64 values, 63 non-redundant) ◮ Given joint probability space, interesting questions can be answered P ( Flu | Season=Spring , Congestion , ¬ MusclePain ) Specifying a joint distribution is infeasible in general! 15 / 78

Probabilistic graphical models A graph-based representation of direct probabilistic interactions A break-down of high-dimensional distributions into smaller factors (here: 63 vs. 17 non-redundant parameters) A compact representation of a set of (conditional) independencies Example (directed graphical model) Graph representation Season Hayfever Flu MusclePain Congestion Factorization P ( S , F , H , M , C ) = P ( S ) P ( F | S ) P ( H | S ) P ( C | F , H ) P ( M | F ) Independencies ( F ⊥ H | S ) , ( C ⊥ S | F , H ) , ( M ⊥ H , C , S | F ) 16 / 78

Scalable Uncertainty Management 06 Markov Logic Rainer Gemulla - PowerPoint PPT Presentation

Scalable Uncertainty Management 06 Markov Logic Rainer Gemulla July 13, 2012 Overview In this lecture Statistical relational learning (SRL) Introduction to probabilistic graphical models (PGM) Basics of undirected models (called Markov

Uncertainty AIMA Chapter 13 Outline Uncertainty Uncertainty Probability Syntax and

Cache Coherence in Scalable Machines Scalable Cache Coherent Systems Scalable, distributed

UNCERTAINTY IN KNOWLEDGE Ch. 9 Uncertainty in Knowledge 1 Sources of Uncertainty

Scalable String Matching on the Scalable String Matching on the Scalable String Matching on the

7 Modelling Uncertainty Bayes theorem 7 Modelling Uncertainty Bayes theorem

Uncertainty and its Representa/on @kordinglab Uncertainty ma7ers

Decision Making Privacy-Motivated . . . under Uncertainty: Uncertainty Leads to . . .

CPSC 875 CPSC 875 John D McGregor John D. McGregor C10 Error Design Uncertainty Uncertainty

Decision Making Under Uncertainty Making Decisions Under Uncertainty AI C LASS 10 (C H .

Handling Probabilistic Data Ander de Keijzer Scalable Uncertainty Management Medical domain

Scalable Uncertainty Management 01 Introduction Rainer Gemulla April 20, 2012 Information

The Scalable Commutativity Rule: Designing Scalable Software for Multicore Processors Austin T.

Dyninst Scalable Tools Workshop Granlibakken Resort Lake Tahoe, California Dyninst Scalable

Scalable Distributed Lineage Authentication Ashish Gehani Scalable Distributed Lineage

VISUALIZING UNCERTAINTY Fall 2017 Mac Hill VISUALIZING UNCERTAINTY 2 DEVELOPING A VISUAL

The Role of Expert Knowledge in Uncertainty Quantification (Are We Adding More Uncertainty (Are

CDA-Compliant Section Annotation of German-Language Discharge Summaries: Guideline Development,

NTCIR13 MedWeb Task: Multi-label Classification of Tweets using an Ensemble of Neural Networks.

Mold Contamination Friday, May 8 th 10:00am 11:30am PDT Webinar provided by the Tribal Healthy

Human-Oriented Robotics Basics of Probabilistic Reasoning Kai Arras Social Robotics Lab,

to Fit Different Communities and Populations Presenters: Lynn Knox, State Health Care Liaison,

1/16/2020 COPE WEBINAR SERIES FOR HEALTH PROFESSIONALS FINDING SLIDES FOR TODAYS WEBINAR

Applying the Data Wrangling Process Nicole G Weiskopf, 8/21/18 Wrangling diabetes Research

Primary Care Kent Lewandrowski MD Associate Chief Pathology Director of Clinical Laboratories