The mind is a neural computer, fitted by natural selection with - PowerPoint PPT Presentation

“The mind is a neural computer, fitted by natural selection with combinatorial algorithms for causal and probabilistic reasoning about plants, animals, objects, and people. “In a universe with any regularities at all, decisions informed about the past are better than decisions made at random. That has always been true, and we would expect organisms, especially informavores such as humans, to have evolved acute intuitions about probability. The founders of probability, like the founders of logic, assumed they were just formalizing common sense.” Steven Pinker, How the Mind Works, 1997, pp. 524, 343. � D. Poole and A. Mackworth 2010 c Artificial Intelligence, Lecture 6.1, Page 1

Learning Objectives At the end of the class you should be able to: justify the use and semantics of probability know how to compute marginals and apply Bayes’ theorem build a belief network for a domain predict the inferences for a belief network explain the predictions of a causal model � D. Poole and A. Mackworth 2010 c Artificial Intelligence, Lecture 6.1, Page 2

Using Uncertain Knowledge Agents don’t have complete knowledge about the world. Agents need to make decisions based on their uncertainty. It isn’t enough to assume what the world is like. Example: wearing a seat belt. An agent needs to reason about its uncertainty. � D. Poole and A. Mackworth 2010 c Artificial Intelligence, Lecture 6.1, Page 3

Why Probability? There is lots of uncertainty about the world, but agents still need to act. Predictions are needed to decide what to do: ◮ definitive predictions: you will be run over tomorrow ◮ point probabilities: probability you will be run over tomorrow is 0.002 ◮ probability ranges: you will be run over with probability in range [0.001,0.34] Acting is gambling: agents who don’t use probabilities will lose to those who do — Dutch books. Probabilities can be learned from data. Bayes’ rule specifies how to combine data and prior knowledge. � D. Poole and A. Mackworth 2010 c Artificial Intelligence, Lecture 6.1, Page 4

Probability Probability is an agent’s measure of belief in some proposition — subjective probability. An agent’s belief depends on its prior assumptions and what the agent observes. � D. Poole and A. Mackworth 2010 c Artificial Intelligence, Lecture 6.1, Page 5

Numerical Measures of Belief Belief in proposition, f , can be measured in terms of a number between 0 and 1 — this is the probability of f . ◮ The probability f is 0 means that f is believed to be definitely false. ◮ The probability f is 1 means that f is believed to be definitely true. Using 0 and 1 is purely a convention. f has a probability between 0 and 1, means the agent is ignorant of its truth value. Probability is a measure of an agent’s ignorance. Probability is not a measure of degree of truth. � D. Poole and A. Mackworth 2010 c Artificial Intelligence, Lecture 6.1, Page 6

Random Variables A random variable is a term in a language that can take one of a number of different values. The range of a variable X , written range ( X ), is the set of values X can take. A tuple of random variables � X 1 , . . . , X n � is a complex random variable with range range ( X 1 ) × · · · × range ( X n ). Often the tuple is written as X 1 , . . . , X n . Assignment X = x means variable X has value x . A proposition is a Boolean formula made from assignments of values to variables. � D. Poole and A. Mackworth 2010 c Artificial Intelligence, Lecture 6.1, Page 7

Possible World Semantics A possible world specifies an assignment of one value to each random variable. A random variable is a function from possible worlds into the range of the random variable. ω | = X = x means variable X is assigned value x in world ω . Logical connectives have their standard meaning: ω | = α ∧ β if ω | = α and ω | = β ω | = α ∨ β if ω | = α or ω | = β ω | = ¬ α if ω �| = α Let Ω be the set of all possible worlds. � D. Poole and A. Mackworth 2010 c Artificial Intelligence, Lecture 6.1, Page 8

Semantics of Probability For a finite number of possible worlds: Define a nonnegative measure µ ( ω ) to each world ω so that the measures of the possible worlds sum to 1. The probability of proposition f is defined by: � P ( f ) = µ ( ω ) . ω | = f � D. Poole and A. Mackworth 2010 c Artificial Intelligence, Lecture 6.1, Page 9

Axioms of Probability: finite case Three axioms define what follows from a set of probabilities: Axiom 1 0 ≤ P ( a ) for any proposition a . Axiom 2 P ( true ) = 1 Axiom 3 P ( a ∨ b ) = P ( a ) + P ( b ) if a and b cannot both be true. These axioms are sound and complete with respect to the semantics. � D. Poole and A. Mackworth 2010 c Artificial Intelligence, Lecture 6.1, Page 10

Semantics of Probability: general case In the general case, probability defines a measure on sets of possible worlds. We define µ ( S ) for some sets S ⊆ Ω satisfying: µ ( S ) ≥ 0 µ (Ω) = 1 µ ( S 1 ∪ S 2 ) = µ ( S 1 ) + µ ( S 2 ) if S 1 ∩ S 2 = {} . Or sometimes σ -additivity: � � µ ( S i ) = µ ( S i ) if S i ∩ S j = {} for i � = j i i Then P ( α ) = µ ( { ω | ω | = α } ). � D. Poole and A. Mackworth 2010 c Artificial Intelligence, Lecture 6.1, Page 11

Probability Distributions A probability distribution on a random variable X is a function range ( X ) → [0 , 1] such that x �→ P ( X = x ) . This is written as P ( X ). This also includes the case where we have tuples of variables. E.g., P ( X , Y , Z ) means P ( � X , Y , Z � ). When range ( X ) is infinite sometimes we need a probability density function... � D. Poole and A. Mackworth 2010 c Artificial Intelligence, Lecture 6.1, Page 12

Conditioning Probabilistic conditioning specifies how to revise beliefs based on new information. An agent builds a probabilistic model taking all background information into account. This gives the prior probability. All other information must be conditioned on. If evidence e is the all of the information obtained subsequently, the conditional probability P ( h | e ) of h given e is the posterior probability of h . � D. Poole and A. Mackworth 2010 c Artificial Intelligence, Lecture 6.1, Page 13

Semantics of Conditional Probability Evidence e rules out possible worlds incompatible with e . Evidence e induces a new measure, µ e , over possible worlds � c × µ ( S ) if ω | = e for all ω ∈ S µ e ( S ) = 0 if ω �| = e for all ω ∈ S We can show c = � D. Poole and A. Mackworth 2010 c Artificial Intelligence, Lecture 6.1, Page 14

Semantics of Conditional Probability Evidence e rules out possible worlds incompatible with e . Evidence e induces a new measure, µ e , over possible worlds � c × µ ( S ) if ω | = e for all ω ∈ S µ e ( S ) = 0 if ω �| = e for all ω ∈ S 1 We can show c = P ( e ) . The conditional probability of formula h given evidence e is P ( h | e ) = µ e ( { ω : ω | = h } ) = � D. Poole and A. Mackworth 2010 c Artificial Intelligence, Lecture 6.1, Page 15

Semantics of Conditional Probability Evidence e rules out possible worlds incompatible with e . Evidence e induces a new measure, µ e , over possible worlds � c × µ ( S ) if ω | = e for all ω ∈ S µ e ( S ) = 0 if ω �| = e for all ω ∈ S 1 We can show c = P ( e ) . The conditional probability of formula h given evidence e is P ( h | e ) = µ e ( { ω : ω | = h } ) P ( h ∧ e ) = P ( e ) � D. Poole and A. Mackworth 2010 c Artificial Intelligence, Lecture 6.1, Page 16

Conditioning Possible Worlds: � D. Poole and A. Mackworth 2010 c Artificial Intelligence, Lecture 6.1, Page 17

Conditioning Possible Worlds: Observe Color = orange : � D. Poole and A. Mackworth 2010 c Artificial Intelligence, Lecture 6.1, Page 18

Exercise What is: (a) P ( flu ∧ sneeze ) Flu Sneeze Snore µ true true true 0.064 (b) P ( flu ∧ ¬ sneeze ) true true false 0.096 (c) P ( flu ) true false true 0.016 (d) P ( sneeze | flu ) true false false 0.024 false true true 0.096 (e) P ( ¬ flu ∧ sneeze ) false true false 0.144 (f) P ( flu | sneeze ) false false true 0.224 false false false 0.336 (g) P ( sneeze | flu ∧ snore ) (h) P ( flu | sneeze ∧ snore ) � D. Poole and A. Mackworth 2010 c Artificial Intelligence, Lecture 6.1, Page 19

Chain Rule P ( f 1 ∧ f 2 ∧ . . . ∧ f n ) = � D. Poole and A. Mackworth 2010 c Artificial Intelligence, Lecture 6.1, Page 20

Chain Rule P ( f 1 ∧ f 2 ∧ . . . ∧ f n ) = P ( f n | f 1 ∧ · · · ∧ f n − 1 ) × P ( f 1 ∧ · · · ∧ f n − 1 ) = � D. Poole and A. Mackworth 2010 c Artificial Intelligence, Lecture 6.1, Page 21

Chain Rule P ( f 1 ∧ f 2 ∧ . . . ∧ f n ) = P ( f n | f 1 ∧ · · · ∧ f n − 1 ) × P ( f 1 ∧ · · · ∧ f n − 1 ) = P ( f n | f 1 ∧ · · · ∧ f n − 1 ) × P ( f n − 1 | f 1 ∧ · · · ∧ f n − 2 ) × P ( f 1 ∧ · · · ∧ f n − 2 ) = P ( f n | f 1 ∧ · · · ∧ f n − 1 ) × P ( f n − 1 | f 1 ∧ · · · ∧ f n − 2 ) × · · · × P ( f 3 | f 1 ∧ f 2 ) × P ( f 2 | f 1 ) × P ( f 1 ) n � = P ( f i | f 1 ∧ · · · ∧ f i − 1 ) i =1 � D. Poole and A. Mackworth 2010 c Artificial Intelligence, Lecture 6.1, Page 22

Bayes’ theorem The chain rule and commutativity of conjunction ( h ∧ e is equivalent to e ∧ h ) gives us: P ( h ∧ e ) = � D. Poole and A. Mackworth 2010 c Artificial Intelligence, Lecture 6.1, Page 23

The mind is a neural computer, fitted by natural selection with - PowerPoint PPT Presentation

The mind is a neural computer, fitted by natural selection with combinatorial algorithms for causal and probabilistic reasoning about plants, animals, objects, and people. In a universe with any regularities at all, decisions informed

www.Every-Mind.org www.Every-Mind.org www.Every-Mind.org www.Every-Mind.org

Neural Fitted Actor-Critic Matthieu Zimmer Alain Dutech Yann Boniface University of Lorraine,

The Fitted Response Surface Graph the fitted surface and its standard error: response.R 1 / 17

On the global convergence of a singularly perturbed parabolic problem of reaction diffusion type

??? It s It s Make Your Mind Up Make Your Mind Up Time Time ??? Make

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Know the mind. Shape the mind. Free the mind. 1 The Neurology of Awakening: Using the New Brain

ERP Selection KIRTANE & PANDIT Suhas Deshpande Why ERP Selection is important ?

Come up with your own definition of the theory of evolution through natural selection Natural

PEACE OF MIND PEACE OF MIND Peace of Mind is a is a nine-month therapeutic intervention in

MIND Microbiology In Nuclear waste Disposal Coordination: SKB The MIND consortium 2 The MIND

The Mind-Body Problem: Dualism The Mind-Body Problem While the mind-body problem can be stated

Theory Mind from Videos Tao Gao MIT Theory of Mind Can't see Can't do Don't care R B A M

SECONDHAND SELECTION Sales Price - 275,000.00 EU SECONDHAND SELECTION INTERNAL VIEWS SECONDHAND

Variable selection bias Bias in Ensemble Bias in Ensemble Methods Methods Variable selection

SELECTION Deterministic Stochastic Proportionate selection: Roulette Wheel Selection

Bayesian Inverse Problems and Uncertainty Quantification Hanne Kekkonen Centre for Mathematical

BAYES FORMULA a two-stage experiment Xingru Chen xingru.chen.gr@dartmouth.edu XC 2020

Bayesian Methods for Machine Learning Radford M. Neal, University of Toronto

Calibrated Bayes, and Inferential Paradigm for Of7icial Statistics in the Era of Big Data Rod

Quasi-Bayesian inference - pitfalls of incoherence Jacek Osiewalski (Cracow University of

Which probability Which probability Which probability Which probability theory for cosmology?

Bayesian Methods in Cryo-EM Marcus A. Brubaker York University / Structura Biotechnology Toronto,

Frequentist Properties of Bayesian Methods Applied Bayesian Statistics Dr. Earvin Balderama

The mind is a neural computer, fitted by natural selection with - PowerPoint PPT Presentation

The mind is a neural computer, fitted by natural selection with combinatorial algorithms for causal and probabilistic reasoning about plants, animals, objects, and people. In a universe with any regularities at all, decisions informed

www.Every-Mind.org www.Every-Mind.org www.Every-Mind.org www.Every-Mind.org

Neural Fitted Actor-Critic Matthieu Zimmer Alain Dutech Yann Boniface University of Lorraine,

The Fitted Response Surface Graph the fitted surface and its standard error: response.R 1 / 17

On the global convergence of a singularly perturbed parabolic problem of reaction diffusion type

??? It s It s Make Your Mind Up Make Your Mind Up Time Time ??? Make

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Know the mind. Shape the mind. Free the mind. 1 The Neurology of Awakening: Using the New Brain

ERP Selection KIRTANE &amp; PANDIT Suhas Deshpande Why ERP Selection is important ?

Come up with your own definition of the theory of evolution through natural selection Natural

PEACE OF MIND PEACE OF MIND Peace of Mind is a is a nine-month therapeutic intervention in

MIND Microbiology In Nuclear waste Disposal Coordination: SKB The MIND consortium 2 The MIND

The Mind-Body Problem: Dualism The Mind-Body Problem While the mind-body problem can be stated

Theory Mind from Videos Tao Gao MIT Theory of Mind Can't see Can't do Don't care R B A M

SECONDHAND SELECTION Sales Price - 275,000.00 EU SECONDHAND SELECTION INTERNAL VIEWS SECONDHAND

Variable selection bias Bias in Ensemble Bias in Ensemble Methods Methods Variable selection

SELECTION Deterministic Stochastic Proportionate selection: Roulette Wheel Selection

Bayesian Inverse Problems and Uncertainty Quantification Hanne Kekkonen Centre for Mathematical

BAYES FORMULA a two-stage experiment Xingru Chen xingru.chen.gr@dartmouth.edu XC 2020

Bayesian Methods for Machine Learning Radford M. Neal, University of Toronto

Calibrated Bayes, and Inferential Paradigm for Of7icial Statistics in the Era of Big Data Rod

Quasi-Bayesian inference - pitfalls of incoherence Jacek Osiewalski (Cracow University of

Which probability Which probability Which probability Which probability theory for cosmology?

Bayesian Methods in Cryo-EM Marcus A. Brubaker York University / Structura Biotechnology Toronto,

Frequentist Properties of Bayesian Methods Applied Bayesian Statistics Dr. Earvin Balderama

ERP Selection KIRTANE & PANDIT Suhas Deshpande Why ERP Selection is important ?