CSCI 446: Artificial Intelligence Markov Models Instructor: Michele - PowerPoint PPT Presentation

CSCI 446: Artificial Intelligence Markov Models Instructor: Michele Van Dyne [These slides were created by Dan Klein and Pieter Abbeel for CS188 Intro to AI at UC Berkeley. All CS188 materials are available at http://ai.berkeley.edu.]

Today  Probability Revisited  Independence  Conditional Independence  Markov Models

Independence  Two variables are independent in a joint distribution if:  Says the joint distribution factors into a product of two simple ones  Usually variables aren’t independent!  Can use independence as a modeling assumption  Independence can be a simplifying assumption  Empirical joint distributions: at best “close” to independent  What could we assume for {Weather, Traffic, Cavity}?  Independence is like something from CSPs: what?

Example: Independence? T P hot 0.5 cold 0.5 T W P T W P hot sun 0.4 hot sun 0.3 hot rain 0.1 hot rain 0.2 cold sun 0.2 cold sun 0.3 cold rain 0.3 cold rain 0.2 W P sun 0.6 rain 0.4

Example: Independence  N fair, independent coin flips: H 0.5 H 0.5 H 0.5 T 0.5 T 0.5 T 0.5

Conditional Independence

Conditional Independence  P(Toothache, Cavity, Catch)  If I have a cavity, the probability that the probe catches in it doesn't depend on whether I have a toothache:  P(+catch | +toothache, +cavity) = P(+catch | +cavity)  The same independence holds if I don ’ t have a cavity:  P(+catch | +toothache, -cavity) = P(+catch| -cavity)  Catch is conditionally independent of Toothache given Cavity:  P(Catch | Toothache, Cavity) = P(Catch | Cavity)  Equivalent statements:  P(Toothache | Catch , Cavity) = P(Toothache | Cavity)  P(Toothache, Catch | Cavity) = P(Toothache | Cavity) P(Catch | Cavity)  One can be derived from the other easily

Conditional Independence  Unconditional (absolute) independence very rare (why?)  Conditional independence is our most basic and robust form of knowledge about uncertain environments.  X is conditionally independent of Y given Z if and only if: or, equivalently, if and only if

Conditional Independence  What about this domain:  Traffic  Umbrella  Raining

Conditional Independence  What about this domain:  Fire  Smoke  Alarm

Probability Recap  Conditional probability  Product rule  Chain rule  X, Y independent if and only if:  X and Y are conditionally independent given Z if and only if:

Markov Models

Reasoning over Time or Space  Often, we want to reason about a sequence of observations  Speech recognition  Robot localization  User attention  Medical monitoring  Need to introduce time (or space) into our models

Markov Models  Value of X at a given time is called the state X 1 X 2 X 3 X 4  Parameters: called transition probabilities or dynamics, specify how the state evolves over time (also, initial state probabilities)  Stationarity assumption: transition probabilities the same at all times  Same as MDP transition model, but no choice of action

Joint Distribution of a Markov Model X 1 X 2 X 3 X 4  Joint distribution:  More generally:  Questions to be resolved:  Does this indeed define a joint distribution?  Can every joint distribution be factored this way, or are we making some assumptions about the joint distribution by using this factorization?

Chain Rule and Markov Models X 1 X 2 X 3 X 4  From the chain rule, every joint distribution over can be written as:  Assuming that and results in the expression posited on the previous slide:

Chain Rule and Markov Models X 1 X 2 X 3 X 4  From the chain rule, every joint distribution over can be written as:  Assuming that for all t : gives us the expression posited on the earlier slide:

Implied Conditional Independencies X 1 X 2 X 3 X 4  We assumed: and  Do we also have ?  Yes!  Proof:

Markov Models Recap  Explicit assumption for all t :  Consequence, joint distribution can be written as:  Implied conditional independencies: (try to prove this!)  Past variables independent of future variables given the present i.e., if or then:  Additional explicit assumption: is the same for all t

Example Markov Chain: Weather  States: X = {rain, sun}  Initial distribution: 1.0 sun  CPT P(X t | X t-1 ): Two new ways of representing the same CPT X t-1 X t P(X t |X t-1 ) 0.9 0.3 0.9 sun sun 0.9 sun sun rain sun 0.1 sun rain 0.1 0.3 rain sun 0.3 rain rain 0.7 rain rain 0.7 0.7 0.1

Example Markov Chain: Weather  Initial distribution: 1.0 sun 0.9 0.3 rain sun 0.7 0.1  What is the probability distribution after one step?

Mini-Forward Algorithm  Question: What’ s P(X) on some day t? X 1 X 2 X 3 X 4 Forward simulation

Example Run of Mini-Forward Algorithm  From initial observation of sun P( X 1 ) P( X 2 ) P( X 3 ) P( X 4 ) P( X  )  From initial observation of rain P( X 1 ) P( X 2 ) P( X 3 ) P( X 4 ) P( X  )  From yet another initial distribution P(X 1 ): … P( X 1 ) P( X  ) [Demo: L13D1,2,3]

Stationary Distributions  For most chains:  Stationary distribution:  Influence of the initial distribution  The distribution we end up with is called gets less and less over time. the stationary distribution of the chain  The distribution we end up in is  It satisfies independent of the initial distribution

Example: Stationary Distributions  Question: What’ s P(X) at time t = infinity? X 1 X 2 X 3 X 4 X t-1 X t P(X t |X t-1 ) sun sun 0.9 sun rain 0.1 rain sun 0.3 rain rain 0.7 Also:

Application of Stationary Distribution: Web Link Analysis  PageRank over a web graph  Each web page is a state  Initial distribution: uniform over pages  Transitions:  With prob. c, uniform jump to a random page (dotted lines, not all shown)  With prob. 1-c, follow a random outlink (solid lines)  Stationary distribution  Will spend more time on highly reachable pages  E.g. many ways to get to the Acrobat Reader download page  Somewhat robust to link spam  Google 1.0 returned the set of pages containing all your keywords in decreasing rank, now all search engines use link analysis along with many other factors (rank actually getting less important over time)

Today  Probability Revisited  Independence  Conditional Independence  Markov Models

CSCI 446: Artificial Intelligence Markov Models Instructor: Michele - PowerPoint PPT Presentation

CSCI 446: Artificial Intelligence Markov Models Instructor: Michele Van Dyne [These slides were created by Dan Klein and Pieter Abbeel for CS188 Intro to AI at UC Berkeley. All CS188 materials are available at http://ai.berkeley.edu.] Today

CSCI 446: Artificial Intelligence CSCI 446: Artificial Intelligence Course Website:

CSCI 446: Arficial Intelligence CSCI 446: Arficial Intelligence

CSCI 446 ARTIFICIAL INTELLIGENCE EXAM 1 STUDY OUTLINE Introduction to Artificial Intelligence

CSCI 446 ARTIFICIAL INTELLIGENCE EXAM 1 STUDY OUTLINE Introduction to Artificial Intelligence

CSCI 446 ARTIFICIAL INTELLIGENCE FINAL EXAM STUDY OUTLINE Introduction to Artificial

SWARM INTELLIGENCE SWARM INTELLIGENCE Ross Moon CSCI 446: Artificial Intelligence OVERVIEW

Artificial Intelligence Artificial Intelligence Artificial Intelligence Study and design of

Artificial Intelligence Course Presentation Summary Artificial Intelligence Motivations

Artificial Intelligence Course Presentation Summary Artificial Intelligence Motivations

Artificial intelligence Artificial Intelligence is the science of PHILOSOPHY OF ARTIFICIAL

Artificial Intelligence Intro (Chapter 1 of AIMA) Summary Artificial Intelligence What is AI?

CSCI 446: Artificial Intelligence Genetic Algorithms Genetic Algorithms Basic concept is

Agents and State Spaces CSCI 446: Artificial Intelligence Overview Agents and environments

CSCI 446: Artificial Intelligence Decision Networks and Value of Perfect Information Instructor:

CSCI 446: Artificial Intelligence Constraint Satisfaction Problems Instructor: Michele Van Dyne

CSCI 446: Artificial Intelligence Uncertainty and Utilities Instructor: Michele Van Dyne [These

Extending Decision Trees Alice Gao Lecture 20 Based on work by K. Leyton-Brown, K. Larson, and

CSE 473: Artificial Intelligence Spring 2014 Uncertainty & Probabilistic Reasoning Hanna

Decision Networks CS 188: Artificial Intelligence Decision Networks and Value of Information

KOTLIN/NATIVE + CLANG, TRAVEL NOTES NIKOLAY IGOTTI, JETBRAINS KOTLIN IS NOT JUST AN ISLAND

Groundwater Water held Groundwater underground in the soil or in pores and crevices in rock.

From brain responses to algorithms: advances in parsing the computational architecture of

http://cs246.stanford.edu Overlaps with machine learning, statistics, artificial intelligence,

Probabilistic representation, representation of uncertainty Applied artificial intelligence

CSCI 446: Artificial Intelligence Markov Models Instructor: Michele - PowerPoint PPT Presentation

CSCI 446: Artificial Intelligence Markov Models Instructor: Michele Van Dyne [These slides were created by Dan Klein and Pieter Abbeel for CS188 Intro to AI at UC Berkeley. All CS188 materials are available at http://ai.berkeley.edu.] Today

CSCI 446: Artificial Intelligence CSCI 446: Artificial Intelligence Course Website:

CSCI 446: Ar*ficial Intelligence CSCI 446: Ar*ficial Intelligence

CSCI 446 ARTIFICIAL INTELLIGENCE EXAM 1 STUDY OUTLINE Introduction to Artificial Intelligence

CSCI 446 ARTIFICIAL INTELLIGENCE EXAM 1 STUDY OUTLINE Introduction to Artificial Intelligence

CSCI 446 ARTIFICIAL INTELLIGENCE FINAL EXAM STUDY OUTLINE Introduction to Artificial

SWARM INTELLIGENCE SWARM INTELLIGENCE Ross Moon CSCI 446: Artificial Intelligence OVERVIEW

Artificial Intelligence Artificial Intelligence Artificial Intelligence Study and design of

Artificial Intelligence Course Presentation Summary Artificial Intelligence Motivations

Artificial Intelligence Course Presentation Summary Artificial Intelligence Motivations

Artificial intelligence Artificial Intelligence is the science of PHILOSOPHY OF ARTIFICIAL

Artificial Intelligence Intro (Chapter 1 of AIMA) Summary Artificial Intelligence What is AI?

CSCI 446: Artificial Intelligence Genetic Algorithms Genetic Algorithms Basic concept is

Agents and State Spaces CSCI 446: Artificial Intelligence Overview Agents and environments

CSCI 446: Artificial Intelligence Decision Networks and Value of Perfect Information Instructor:

CSCI 446: Artificial Intelligence Constraint Satisfaction Problems Instructor: Michele Van Dyne

CSCI 446: Artificial Intelligence Uncertainty and Utilities Instructor: Michele Van Dyne [These

Extending Decision Trees Alice Gao Lecture 20 Based on work by K. Leyton-Brown, K. Larson, and

CSE 473: Artificial Intelligence Spring 2014 Uncertainty &amp; Probabilistic Reasoning Hanna

Decision Networks CS 188: Artificial Intelligence Decision Networks and Value of Information

KOTLIN/NATIVE + CLANG, TRAVEL NOTES NIKOLAY IGOTTI, JETBRAINS KOTLIN IS NOT JUST AN ISLAND

Groundwater Water held Groundwater underground in the soil or in pores and crevices in rock.

From brain responses to algorithms: advances in parsing the computational architecture of

http://cs246.stanford.edu Overlaps with machine learning, statistics, artificial intelligence,

Probabilistic representation, representation of uncertainty Applied artificial intelligence

CSCI 446: Arficial Intelligence CSCI 446: Arficial Intelligence

CSE 473: Artificial Intelligence Spring 2014 Uncertainty & Probabilistic Reasoning Hanna