Entropy, Relative Entropy, Cross Entropy Entropy Entropy, H(x) is a - PowerPoint PPT Presentation

Entropy, Relative Entropy, Cross Entropy

Entropy Entropy, H(x) is a measure of the uncertainty of a discrete random variable. Properties: ● H(x) >= 0 ●

Entropy

Entropy ● Lesser the probability for an event, larger the entropy. Entropy of a six-headed fair dice is log 2 6.

Entropy : Properties Primer on Probability Fundamentals ● Random Variable ● Probability ● Expectation ● Linearity of Expectation

Entropy : Properties Primer on Probability Fundamentals ● Jensen’s Inequality Ex:- Subject to the constraint that, f is a convex function.

Entropy : Properties ● H(U) >= 0, Where, U = {u 1 , u 2 , …, u M } ● H(U) <= log(M)

Entropy between pair of R.Vs ● Joint Entropy ● Conditional Entropy

Relative Entropy aka Kullback Leibler Distance D(p||q) is a measure of the inefficiency of assuming that the distribution is q, when the true distribution is p. ● H(p) : avg description length when true distribution. ● H(p) + D(p||q) : avg description length when approximated distribution. If X is a random variable and p(x), q(x) are probability mass functions,

Relative Entropy/ K-L Divergence : Properties D(p||q) is a measure of the inefficiency of assuming that the distribution is q, when the true distribution is p. Properties: ● Non-negative. ● D(p||q) = 0 if p=q. ● Non-symmetric and does not satisfy triangular inequality - it is rather divergence than distance.

Relative Entropy/ K-L Divergence : Properties Asymmetricity: Let, X = {0, 1} be a random variable. Consider two distributions p, q on X. Assume, p(0) = 1-r, p(1) = r ; q(0) = 1-s, q(1) = s; If, r=s, then D(p||q) = D(q||p) = 0, else for r!=s, D(p||q) != D(q||p)

Relative Entropy/ K-L Divergence : Properties Non-negativity:

Relative Entropy/ K-L Divergence : Properties

Relative Entropy of joint distributions as Mutual Information Mutual Information, which is a measure of the amount of information that one random variable contains about another random variable. It is the reduction in the uncertainty of one random variable due to the knowledge of the other. ● Unlike Relative Entropy, Mutual Information is symmetric. And, it is non-negative.

Relationship between Entropy and Mutual Information

Relationship between Entropy and Mutual Information ● I(X;X) = H(X) + H(X|X) = H(X) Mutual Information of a random variable with itself is the entropy of the random variable. This is the reason that entropy is sometimes referred to as self-information . ● Intuitively, the entropy of a random variable X with a probability distribution p(x) is related to how much p(x) diverges from the uniform distribution on the support of X. The more p(x) diverges the lesser its entropy and vice versa.

Relationship between Entropy and Mutual Information Conditioning reduces Entropy: H(X|Y) <= H(X) 3 1 as 0 <= I(X; Y) = H(X) - H(XIY). H(Y|X) 2 Z 7 5 Y 4 I(X;Y) H(X|Y) H(X,Y) 6 X

Cross Entropy vs K-L Divergence

Cross Entropy vs K-L Divergence Entropy: A random variable has information about itself - self-informativeness. True distribution How B differs from A Cross-Entropy: A random variable compares true distribution A with approximated distribution B. Cross-entropy = divergence + entropy [A random variable knows about itself ( entropy ) and from its perspective compares its true Relative-Entropy: A random variable compares true distribution with approximated distribution distribution A with how the approximated distribution B differs through divergence ] Minimizing divergence and cross-entropy are from A at each sample point (divergence or difference). said to have the same effects.

Questions? Thank You

Entropy, Relative Entropy, Cross Entropy Entropy Entropy, H(x) is a - PowerPoint PPT Presentation

Entropy, Relative Entropy, Cross Entropy Entropy Entropy, H(x) is a measure of the uncertainty of a discrete random variable. Properties: H(x) >= 0 Entropy Entropy Lesser the probability for an event, larger the entropy.

Formal Modeling in Cognitive Science Lecture 25: Entropy, Joint Entropy, Conditional Entropy 1

Chapter 2 Entropy, Relative Entropy, and Mutual Infor- mation Peng-Hua Wang Graduate Institute

02 | 27 SOUTHERN CROSS 23.04 03 | 27 SOUTHERN CROSS 23.04 04 | 27 SOUTHERN CROSS 23.04 06

The Shadow of the Cross The Cross of Jesus part 1B The Shadow of the Cross Hebrews 10:1-14 The

Finding Maxima and Minima For a function of two variables what does a relative maximum or relative

Relative Clause clause adds additional information to the noun in the sentence. 1 Direct

Advanced Lesson 22 Topic 22: Dreams. Grammar: Relative clauses and reduced relative clauses

Relative Entropy in CFT (Based on a joint paper with R. Longo arxiv 1712.07283 ) Feng Xu Dept of

Infotheory for Statistics and Learning Lecture 1 Entropy Relative entropy Mutual

Chapter 38: Relative Clauses of Characteristic, Relative Clauses of Purpose and Subordinate

Relative Gain Pattern ANITA HORN 0,2,3 THESE ARE UNCORRECTED NUMBERS! The shape is what is

Aim To identify and use relative clauses. Success Criteria I can explain that a complex

Chapter 11: Relative Clause Constructions Syntactic Constructions in English Kim and Michaelis

Estimating Relative Expression Mark Voorhies 4/6/2011 Mark Voorhies Estimating Relative

Relative Clauses and Relative Pronouns Max was making a clay sculpture. He loved art lessons. The

Relative Density Chapters 3.5 Relative Density 1 2/5/2015 Minimum Density Pluviate soil from

Note on von Neumann and R enyi entropies of a graph Jephian C.-H. Lin Department of

Computing and Communications 2. Information Theory -Entropy Ying Cui Department of Electronic

Optimization for Machine Learning Lecture 1: Introduction to Convexity S.V . N. (vishy)

Convex Functions (II) Lijun Zhang zlj@nju.edu.cn http://cs.nju.edu.cn/zlj Outline The

Randomness in Computing L ECTURE 21 Last time Probabilistic method Sample and Modify

The Fundamental Theorem prof. dr Arno Siebes Algorithmic Data Analysis Group Department of

Wasserstein barycenters over Riemannian manifolds Brendan Pass (joint work with Y.H. Kim (UBC))

Algorithms for Distributed Functional Monitoring

Sambuz

Useful Links

Newsletter

Mail Us