What s the probability that the next candy is lime? What is P(d - PowerPoint PPT Presentation

CS440/ECE448: Intro to Artificial Intelligence � Lecture 19   Learning graphical models � Prof. Julia Hockenmaier � juliahmr@illinois.edu � � http://cs.illinois.edu/fa11/cs440 � � �

The Burglary example � B=t ¡ B=f ¡ E=t ¡ E=f ¡ .001 ¡ 0.999 ¡ .002 ¡ 0.998 ¡ Burglary � Earthquake � B ¡ E ¡ A=t ¡ A=f ¡ t ¡ t ¡ .95 ¡ .05 ¡ Alarm � t ¡ f ¡ .94 ¡ .06 ¡ f ¡ t ¡ .29 ¡ .71 ¡ f ¡ f ¡ .999 ¡ .001 ¡ JohnCalls � MaryCalls � A ¡ J=t ¡ J=f ¡ A ¡ M=t ¡ M=f ¡ t ¡ .9 ¡ .1 ¡ t ¡ .7 ¡ .3 ¡ f ¡ .05 ¡ .95 ¡ f ¡ .01 ¡ .99 ¡ What is the probability of a burglary if John and Mary call? � CS440/ECE448: Intro AI � 3 �

Learning Bayes Nets �

How do we know the parameters of a Bayes Net? � We want to estimate the parameters based on data D. � � Data = instantiations of some or all random variables in the Bayes Net. � � The data are our evidence. �

Surprise Candy � There are two flavors of Surprise Candy: cherry and lime. Both have the same wrapper. � � There are five different types of bags (which all look the same) that Surprise Candy is sold in: � – h1: 100% cherry � – h2: 75% cherry + 25% lime � – h3: 50% cherry + 50% lime � – h4: 25% cherry + 75% lime � – h5: 100% lime � CS440/ECE448: Intro AI � 6 �

Surprise Candy � You just bought a bag of Surprise Candy. � Which kind of bag did you get? � � There are five different hypotheses: h1-h5 � � You start eating your candy. This is your data � D1 = cherry, D2 = lime, …., DN= …. � � What is the most likely hypothesis given your data (evidence)? � � � CS440/ECE448: Intro AI � 7 �

Conditional probability   refresher � � P ( X | Y ) = P ( X , Y ) � P ( Y ) � P ( X | Y ) P ( Y ) = P ( X , Y ) P ( X | Y ) P ( Y ) = P ( Y | X ) P ( X ) CS440/ECE448: Intro AI � 8 �

Bayes Rule � � P ( cause | effect ) = P ( effect | cause ) P ( cause ) � P ( effect ) � P(cause): prior probability of cause P(cause | effect) : posterior probability of cause. � P(effect | cause) : likelihood of effect � � Prior ∝ posterior × likelihood � P ( cause | effect ) ! P ( effect | cause ) P ( cause ) CS440/ECE448: Intro AI � 9 �

Bayes Rule � � argmax h P ( h | D ) � P ( D | h ) P ( h ) = argmax h � P ( D ) = argmax h P ( D | h ) P ( h ) P(h): prior probability of hypothesis P(h | D) : posterior probability of hypothesis. � P(D | h) : likelihood of data, given hypothesis � CS440/ECE448: Intro AI � 11 �

Bayesian learning � Use Bayes rule to calculate the probability of each hypothesis given the data. � � P ( h | D ) = P ( D | h ) P ( h ) � � P ( D ) � How do we know the prior and the likelihood? � � � CS440/ECE448: Intro AI � 12 �

The prior P(h) � Sometimes we know P(h) in advance. � – Surprise Candy: (0.1, 0.2, 0.4, 0.2, 0.1) � � Sometimes we have to make an assumption � (e.g. a uniform prior, when we don ʼ t know anything) � CS440/ECE448: Intro AI � 13 �

The likelihood P(D|h) � We typically assume that each observation d i is drawn “i.i.d.” - independently from the same (identical) distribution. � � Therefore: � � ! P ( D | h ) = P ( d i | h ) � i CS440/ECE448: Intro AI � 14 �

The posterior P(h|D) � Assume we ʼ ve seen 10 lime candies: � Posterior probability of hypothesis 1 P ( h 1 | d ) P ( h 2 | d ) P ( h 3 | d ) 0.8 P ( h 4 | d ) P ( h 5 | d ) 0.6 0.4 0.2 0 0 2 4 6 8 10 Number of observations in d CS440/ECE448: Intro AI � 15 �

What ʼ s the probability that the next candy is lime? � Probability that next candy is lime 1 0.9 0.8 0.7 0.6 0.5 0.4 0 2 4 6 8 10 Number of observations in d This probability will eventually (if we had an infinite amount of data) agree with the true hypothesis. � CS440/ECE448: Intro AI � 16 �

Bayes optimal prediction � We don ʼ t know which hypothesis is true,   so we marginalize them out: � � P(X | D ) = ∑ i P(X | h i )P(h i | D) This is guaranteed to converge to the true hypothesis. � � CS440/ECE448: Intro AI � 17 �

  Maximum a-posteriori (MAP) � We assume the hypothesis with the maximum posterior probability   � h MAP = argmax h P(h|D) is true: � � P(X | D ) = P(X | h MAP ) CS440/ECE448: Intro AI � 18 �

  Maximum likelihood (ML) � We assume a uniform prior P(h). We then choose the hypothesis that assigns the highest likelihood to the data   h ML = argmax h P(D|h) P(X | D ) = P(X | h ML ) This is commonly used in machine learning. � CS440/ECE448: Intro AI � 19 �

Surprise candy again � Now the manufacturer has been bought up by another company. � � Now we don ʼ t know the lime-cherry proportions θ (= P(cherry)) anymore. � � Can we estimate θ from data? � flavor � cherry θ lime: 1- θ CS440/ECE448: Intro AI � 20 �

Maximum likelihood learning � Given data D , we want to find the parameters that maximize P( D | θ ). � � We have a data set with N candies. � c candies are cherry. � l = (N-c) candies are lime. � � CS440/ECE448: Intro AI � 21 �

Maximum likelihood learning � Out of N candies, c are cherry, (N-c) lime. � � The likelihood of our data set: � � N ! ) = ! c (1 " ! ) l ! P ( d | ! ) = P ( d j | j = 1 CS440/ECE448: Intro AI � 22 �

Log likelihood � It ʼ s actually easier to work with   the log-likelihood: � L ( d | ! ) = log P( d | ! ) N ! = log P ( d i ! ) j = 1 = c log ! + l log(1 " ! ) CS440/ECE448: Intro AI � 23 �

Maximizing Log-likelihood � dL ( D | ! ) = c l 1 ! ! = 0 ! ! d ! c + l = c c " ! = N CS440/ECE448: Intro AI � 24 �

Maximum likelihood estimation � We can simply count how many cherry candies we see. � � This is also called the relative frequency estimate. � � It is appropriate when we have complete data (i.e. we know the flavor of each candy). � CS440/ECE448: Intro AI � 25 �

Today ʼ s reading � Chapter 13.5, Chapter 20.1 and 20.2.1 � � CS440/ECE448: Intro AI � 26 �

What s the probability that the next candy is lime? What is P(d - PowerPoint PPT Presentation

What s the probability that the next candy is lime? What is P(d i+1 | d 1 ,, d i ) = P(X | D )? We don t know which bag of candy we got, so we have to assume it could be any one of them: P(X | D ) = i P(X | D , h i )P(h i |

Candy Apples and Popsicles SaraMae H . Candy Apples Candy Apples were created in New Jersey

Hydrated lime - a fundamental partner Hydrated lime - a fundamental partner for for high

with hair degradation New lime plaster with pig bristle reinforcement Historic lime plaster

Probability Basics Martin Emms October 1, 2020 Probability Basics Outline Probability

East Area Planning Committee Presentation 19/03303/FUL 21 4 Lime Walk Agenda Item 6 Site

A STITCH IN TIME SAVES NINE NON HYDRAULIC LIME NATURALLY HYDRAULIC LIME ORDINARY PORTLAND

Marijuana Infused Candy October 3, 2018 Area of Concern: Recently concerns have been raised by

COMPANY PROFILE 2017 1 Candy Group Profile OVERVIEW Candy Group is a leading privately owned

Continuing Probability. Wrap up: Total Probability and Conditional Probability. Continuing

Chapter 2 Probability 1. Definition of Probability 2. Probability of disjoint events 3.

Probability Basics Probability Background Martin Emms October 1, 2020 Probability Basics

Chapter 2 Probability 1. Definition of Probability 2. Probability of disjoint events 3.

Counting and Probability Whats to come? Counting and Probability Whats to come?

Lime Tree Marking and Presentation Policy Governors Committee Headteacher Responsible

CS 331: Artificial Intelligence Fundamentals of Probability III Thanks to Andrew Moore for some

CS 331: Artificial Intelligence Fundamentals of Probability II Thanks to Andrew Moore for some

The geometry of Boolean algebra Chris Heunen 1 / 22 Boolean algebra: example , ,

Market Trend and Production Challenge of Organic Tropical and Sub- Tropical Fruits Vitoon

What do you mean? Inferring Word Meaning Using Computer Vision Shibamouli Lahiri Original paper

Looking for Experts? What can Linked Data do for you? Milan Stankovic Claudia Wagner Jelena

Statistical learning Chapter 20, Sections 13 Chapter 20, Sections 13 1 Outline

CS 331: Artificial Intelligence in the last column tails black 3 0.09 sum to 1 tails red 1

Statistical Learning February 4, 2010 CS 489 / 698 University of Waterloo Outline

LEARNING PROBABILISTIC MODELS AIMA CHAPTER 20 Instructor: Sael Lee Materials form AIMA resources,