Formal Modeling in Cognitive Science Lecture 25: Entropy, Joint - PowerPoint PPT Presentation

Entropy Entropy Formal Modeling in Cognitive Science Lecture 25: Entropy, Joint Entropy, Conditional Entropy 1 Entropy Entropy and Information Joint Entropy Frank Keller Conditional Entropy School of Informatics University of Edinburgh keller@inf.ed.ac.uk March 6, 2006 Frank Keller Formal Modeling in Cognitive Science 1 Frank Keller Formal Modeling in Cognitive Science 2 Entropy and Information Entropy and Information Entropy Joint Entropy Entropy Joint Entropy Conditional Entropy Conditional Entropy Entropy and Information Entropy and Information Definition: Entropy Example: 8-sided die If X is a discrete random variable and f ( x ) is the value of its Suppose you are reporting the result of rolling a fair probability distribution at x , then the entropy of X is: eight-sided die. What is the entropy? 1 The probability distribution is f ( x ) = 8 for x = � H ( X ) = − f ( x ) log 2 f ( x ) 1 . . . 8. Therefore entropy is: x ∈ X 8 8 1 8 log 1 � � H ( X ) = − f ( x ) log f ( x ) = − Entropy is measured in bits (the log is log 2 ); 8 x =1 x =1 intuitively, it measures amount of information (or uncertainty) − log 1 = 8 = log 8 = 3 bits in random variable; it can also be interpreted as the length of message to transmit This means the average length of a message required to transmit an outcome of the random variable; the outcome of the roll of the die is 3 bits. note that H ( X ) ≥ 0 by definition. Frank Keller Formal Modeling in Cognitive Science 3 Frank Keller Formal Modeling in Cognitive Science 4

Entropy and Information Entropy and Information Entropy Joint Entropy Entropy Joint Entropy Conditional Entropy Conditional Entropy Entropy and Information Example: simplified Polynesian Example: simplified Polynesian Example: 8-sided die Polynesian languages are famous for their small alphabets. Assume Suppose you wish to send the result of rolling the die. What is the a language with the following letters and associated probabilities: most efficient way to encode the message? x p t k a i u The entropy of the random variable is 3 bits. That means the 1 1 1 1 1 1 f(x) 8 4 8 4 8 8 outcome of the random variable can be encoded as 3 digit binary What is the per-character entropy for this language? message: � 1 2 3 4 5 6 7 8 H ( X ) = − f ( x ) log f ( x ) 001 010 011 100 101 110 111 000 x ∈{ p , t , k , a , i , u } − (4 log 1 8 + 2 log 1 4) = 21 = 2 bits Frank Keller Formal Modeling in Cognitive Science 5 Frank Keller Formal Modeling in Cognitive Science 6 Entropy and Information Entropy and Information Entropy Joint Entropy Entropy Joint Entropy Conditional Entropy Conditional Entropy Example: simplified Polynesian Properties of Entropy Theorem: Entropy Example: simplified Polynesian If X is a binary random variable with the distribution f (0) = p and Now let’s design a code that takes 2 1 2 bits to transmit a letter: f (1) = 1 − p , then: H ( X ) = 0 if p = 0 or p = 1 p t k a i u 100 00 101 01 110 111 max H ( X ) for p = 1 2 Any code is suitable, as long as it uses two digits to encode the Intuitively, an entropy of 0 means that the outcome of the random high probability letters, and three digits to encode the low variable is determinate; it contains no information (or uncertainty). probability letters. If both outcomes are equally likely (p = 1 2 ), then we have maximal uncertainty. Frank Keller Formal Modeling in Cognitive Science 7 Frank Keller Formal Modeling in Cognitive Science 8

Entropy and Information Entropy and Information Entropy Joint Entropy Entropy Joint Entropy Conditional Entropy Conditional Entropy Properties of Entropy Joint Entropy Visualize the content of the previous theorem: 1 Definition: Joint Entropy If X and Y are discrete random variables and f ( x , y ) is the value 0.8 of their joint probability distribution at ( x , y ), then the joint entropy of X and Y is: 0.6 H(X) � � H ( X , Y ) = − f ( x , y ) log f ( x , y ) 0.4 x ∈ X y ∈ Y The joint entropy represents the amount of information needed on 0.2 average to specify the value of two discrete random variables. 0 0 0.2 0.4 0.6 0.8 1 p Frank Keller Formal Modeling in Cognitive Science 9 Frank Keller Formal Modeling in Cognitive Science 10 Entropy and Information Entropy and Information Entropy Joint Entropy Entropy Joint Entropy Conditional Entropy Conditional Entropy Conditional Entropy Conditional Entropy Example: simplified Polynesian Definition: Conditional Entropy Now assume that you have the joint probability of a vowel and a consonant occurring together in the same syllable: If X and Y are discrete random variables and f ( x , y ) and f ( y | x ) are the values of their joint and conditional probability f ( x , y ) p t k f ( y ) 1 3 1 1 distributions, then: a 16 8 16 2 1 3 1 i 0 16 16 4 � � 3 1 1 H ( Y | X ) = − f ( x , y ) log f ( y | x ) u 0 16 16 4 1 3 1 f ( x ) x ∈ X y ∈ Y 8 4 8 Compute the conditional probabilities; for example: is the conditional entropy of Y given X . 1 f ( a , p ) = 1 16 f ( a | p ) = = The conditional entropy indicates how much extra information you f ( p ) 1 2 8 still need to supply on average to communicate Y given that the 3 f ( a , t ) = 1 8 f ( a | t ) = = other party knows X . 3 f ( t ) 2 4 Frank Keller Formal Modeling in Cognitive Science 11 Frank Keller Formal Modeling in Cognitive Science 12

Entropy and Information Entropy and Information Entropy Joint Entropy Entropy Joint Entropy Conditional Entropy Conditional Entropy Conditional Entropy Conditional Entropy For probability distributions we defined: Example: simplified Polynesian Now compute the conditional entropy of a vowel given a consonant: f ( y | x ) = f ( x , y ) g ( x ) H ( V | C ) = − � � f ( x , y ) log f ( y | x ) x ∈ C y ∈ V A similar theorem holds for entropy: = − ( f ( a , p ) log f ( a | p ) + f ( a , t ) log f ( a | t ) + f ( a , k ) log f ( a | k )+ f ( i , p ) log f ( i | p ) + f ( i , t ) log f ( i | t ) + f ( i , k ) log f ( i | k )+ Theorem: Conditional Entropy f ( u , p ) log f ( u | p ) + f ( u , t ) log f ( u | t ) + f ( u , k ) log f ( u | k )) 1 3 1 If X and Y are discrete random variables with joint entropy − ( 1 8 + 3 4 + 1 = 16 log 8 log 16 log 8 + 16 8 16 1 3 1 H ( X , Y ) and the marginal entropy of X is H ( X ), then: 1 3 1 8 + 3 16 log 16 log 4 + 0+ 16 16 1 3 3 1 0 + 3 4 + 1 16 log 16 log 8 ) H ( Y | X ) = H ( X , Y ) − H ( X ) 16 16 3 1 11 = 8 = 1 . 375 bits Division instead of subtraction as entropy is defined on logarithms. Frank Keller Formal Modeling in Cognitive Science 13 Frank Keller Formal Modeling in Cognitive Science 14 Entropy and Information Entropy and Information Entropy Joint Entropy Entropy Joint Entropy Conditional Entropy Conditional Entropy Conditional Entropy Summary Example: simplified Polynesian Use the previous theorem to compute the joint entropy of a consonant and a vowel. First compute H ( C ): Entropy measures the amount of information in a random variable or the length of the message required to transmit the � H ( C ) = − f ( x ) log f ( x ) outcome; x ∈ C joint entropy is the amount of information in two (or more) = − ( f ( p ) log f ( p ) + f ( t ) log f ( t ) + f ( k ) log f ( k )) random variables; − (1 8 log 1 8 + 3 4 log 3 4 + 1 8 log 1 = 8) conditional entropy is the amount of information in one = 1 . 061 bits random variable given we already know the other. Then we can compute the joint entropy as: H ( V , C ) = H ( V | C ) + H ( C ) = 1 . 375 + 1 . 061 = 2 . 436 bits Frank Keller Formal Modeling in Cognitive Science 15 Frank Keller Formal Modeling in Cognitive Science 16

Formal Modeling in Cognitive Science Lecture 25: Entropy, Joint - PowerPoint PPT Presentation

Entropy Entropy Formal Modeling in Cognitive Science Lecture 25: Entropy, Joint Entropy, Conditional Entropy 1 Entropy Entropy and Information Joint Entropy Frank Keller Conditional Entropy School of Informatics University of Edinburgh

Cognitive Modeling Symbolic School Lecture 2: Approaches Symbolic Models 2 Symbolic

Formal Definition of a Finite Automaton Formal Definition of a Finite Automaton p.1/23 Why a

Cognitive Interviewing Debbie Collins What is cognitive interviewing? Cognitive interviewing

Formal Methods and Cryptography Lecture 25 Formal Methods Formal Methods Logical foundations

Formal Methods and Cryptography Lecture 24 1 Formal Methods 2 Formal Methods Logical

Cognitive Computing Venkat N Gudivada East Carolina University Greenville, North Carolina USA

Cognitive Load Effects Robert Whelan E19.2174 Cognitive Science & Educational Technology I

Formal Verification of RISC-V cores with riscv-formal Clifford Wolf CTO, Symbiotic EDA

1 PSYC530 wk01 handouts Cognitive Engineering Cognitive Science Applied to Human Factors

Modeling of proteins and complexes High resolution Low resolution Modeling of domains Modeling

Virtual Reality Modeling Virtual Reality Modeling from http://www.okino.com/ Modeling Modeling

Formal Modeling in Cognitive Science Lecture 20: Joint, Marginal, and Conditional Distributions

Formal Modeling in Cognitive Science Lecture 17: Sample Spaces, Events, Probabilities Steve

Formal Modeling in Cognitive Science Lecture 18: Conditional Probability; Bayes Theorem Steve

Formal Modeling in Cognitive Science Lecture 19: Application of Bayes Theorem; Discrete Random

Formal Modeling in Cognitive Science Uniform Distribution Lecture 27: Special Distributions;

7. What Is Joint Action Shared Agency? butterfillS@ceu.hu butterfillS@ceu.hu Outline 1. The

SEQUENTIAL INTERCEPT MAPPING IMPROVING THE BEHAVIORAL HEALTH CRISIS SYSTEM IN YELLOWSTONE COUNTY

Transforming teaching and learning lessons learnt from the London Challenge Professor David

Notation Atkinson, A.B. and A. Brandolini:

Trans-dimensional What if the number of things you dont know is one of the things you

Down the Borel Hierarchy: Solving Muller Games via Safety Games Joint work with Daniel Neider and

The ubiquity of pseudofinite metric structures (joint work with Bradd Hart) Isaac Goldbring

ssts rs r r

Sambuz

Useful Links

Newsletter

Mail Us

Formal Modeling in Cognitive Science Lecture 25: Entropy, Joint - PowerPoint PPT Presentation

Entropy Entropy Formal Modeling in Cognitive Science Lecture 25: Entropy, Joint Entropy, Conditional Entropy 1 Entropy Entropy and Information Joint Entropy Frank Keller Conditional Entropy School of Informatics University of Edinburgh

Cognitive Modeling Symbolic School Lecture 2: Approaches Symbolic Models 2 Symbolic

Formal Definition of a Finite Automaton Formal Definition of a Finite Automaton p.1/23 Why a

Cognitive Interviewing Debbie Collins What is cognitive interviewing? Cognitive interviewing

Formal Methods and Cryptography Lecture 25 Formal Methods Formal Methods Logical foundations

Formal Methods and Cryptography Lecture 24 1 Formal Methods 2 Formal Methods Logical

Cognitive Computing Venkat N Gudivada East Carolina University Greenville, North Carolina USA

Cognitive Load Effects Robert Whelan E19.2174 Cognitive Science &amp; Educational Technology I

Formal Verification of RISC-V cores with riscv-formal Clifford Wolf CTO, Symbiotic EDA

1 PSYC530 wk01 handouts Cognitive Engineering Cognitive Science Applied to Human Factors

Modeling of proteins and complexes High resolution Low resolution Modeling of domains Modeling

Virtual Reality Modeling Virtual Reality Modeling from http://www.okino.com/ Modeling Modeling

Formal Modeling in Cognitive Science Lecture 20: Joint, Marginal, and Conditional Distributions

Formal Modeling in Cognitive Science Lecture 17: Sample Spaces, Events, Probabilities Steve

Formal Modeling in Cognitive Science Lecture 18: Conditional Probability; Bayes Theorem Steve

Formal Modeling in Cognitive Science Lecture 19: Application of Bayes Theorem; Discrete Random

Formal Modeling in Cognitive Science Uniform Distribution Lecture 27: Special Distributions;

7. What Is Joint Action Shared Agency? butterfillS@ceu.hu butterfillS@ceu.hu Outline 1. The

SEQUENTIAL INTERCEPT MAPPING IMPROVING THE BEHAVIORAL HEALTH CRISIS SYSTEM IN YELLOWSTONE COUNTY

Transforming teaching and learning lessons learnt from the London Challenge Professor David

Notation Atkinson, A.B. and A. Brandolini:

Trans-dimensional What if the number of things you dont know is one of the things you

Down the Borel Hierarchy: Solving Muller Games via Safety Games Joint work with Daniel Neider and

The ubiquity of pseudofinite metric structures (joint work with Bradd Hart) Isaac Goldbring

ssts rs r r

Sambuz

Useful Links

Newsletter

Mail Us

Cognitive Load Effects Robert Whelan E19.2174 Cognitive Science & Educational Technology I