Probability Distribu.ons on Structured Objects September - PowerPoint PPT Presentation

Probability ¡Distribu.ons ¡on ¡ Structured ¡Objects ¡ September ¡17, ¡2013 ¡

Reminder ¡ • HW1 ¡is ¡due ¡at ¡11:59pm ¡tonight ¡ • There ¡was ¡some ¡ambiguity ¡in ¡this ¡assignment ¡ • The ¡TAs ¡gave ¡a ¡lot ¡of ¡help, ¡but ¡in ¡general, ¡ learning ¡to ¡work ¡from ¡incomplete ¡specs ¡is ¡ important ¡

Probability ¡Outline ¡ • Why ¡probability? ¡ • Probability ¡review ¡ • Mul.nomials ¡vs. ¡exponen.al ¡parameteriza.on ¡ • Locally ¡vs. ¡globally ¡normalized ¡models ¡& ¡ par..on ¡func.ons ¡ • Examples ¡

Why ¡Probability? ¡ • Probability ¡formalizes ¡ – The ¡concept ¡of ¡ models ¡ – The ¡concept ¡of ¡ data ¡ – The ¡concept ¡of ¡ learning ¡ – The ¡concept ¡of ¡ predic0on ¡(inference) ¡ Probability is expectation founded upon partial knowledge.

Why ¡Probability? ¡ • What ¡might ¡we ¡have ¡par.al ¡knowledge ¡ about? ¡ – The ¡state ¡of ¡the ¡world ¡(test ¡data) ¡ – The ¡reliability ¡of ¡our ¡training ¡data ¡ – The ¡correctness ¡of ¡our ¡model ¡ – The ¡values ¡of ¡our ¡parameters ¡ p ( x | partial knowledge)

What ¡is ¡a ¡Probability? ¡ • Limi0ng ¡(rela0ve) ¡frequency ¡of ¡events ¡ – in ¡repeated ¡(iden.cal) ¡experiments ¡ • Degree ¡of ¡belief ¡ – Subjec.ve ¡concep.on ¡ – 40% ¡chance ¡of ¡rain ¡tomorrow ¡in ¡PiXsburgh ¡ • Viewpoint ¡affects ¡ – interpreta.on ¡ – not ¡rules ¡of ¡probability ¡calculus ¡themselves ¡

Discrete ¡Distribu.ons ¡ Sample ¡space ¡ x 1 x 2 Ω Discrete ¡distribu.on: ¡ ¡ ¡ ¡ ¡ ¡ ¡is ¡ finite ¡or ¡ countable , ¡ Ω but ¡no ¡bigger ¡ ¡

Discrete ¡Distribu.ons ¡ f ( x ) ∈ [0 , 1] ∀ x ∈ Ω , X f ( x ) = 1 x ∈ Ω Probability ¡mass ¡func.on ¡ An ¡ event ¡is ¡a ¡subset ¡(maybe ¡one ¡element) ¡ of ¡the ¡sample ¡space, ¡ ¡ E ⊆ Ω X P ( E ) = f ( x ) x ∈ E

Random ¡Variables ¡ A ¡ random ¡variable ¡ is ¡a ¡func.on ¡from ¡a ¡random ¡event ¡ from ¡a ¡set ¡of ¡possible ¡outcomes ¡( ¡ ¡ ¡) ¡and ¡a ¡probability ¡ Ω distribu.on ¡( ¡ ¡), ¡a ¡func.on ¡from ¡outcomes ¡to ¡ ρ probabili.es. ¡ Ω = { 1 , 2 , 3 , 4 , 5 , 6 } X ( ω ) = ω ( 1 if x = 1 , 2 , 3 , 4 , 5 , 6 6 ρ X ( x ) = 0 otherwise

Random ¡Variables ¡ A ¡ random ¡variable ¡ is ¡a ¡func.on ¡from ¡a ¡random ¡event ¡ from ¡a ¡set ¡of ¡possible ¡outcomes ¡( ¡ ¡ ¡) ¡and ¡a ¡probability ¡ Ω distribu.on ¡( ¡ ¡), ¡a ¡func.on ¡from ¡outcomes ¡to ¡ ρ probabili.es. ¡ Ω = { 1 , 2 , 3 , 4 , 5 , 6 } ( 0 if ω ∈ { 2 , 4 , 6 } Y ( ω ) = 1 otherwise ( 1 if y = 0 , 1 2 ρ Y ( y ) = 0 otherwise

Sampling ¡Nota.on ¡ x = 4 × z + 1 . 7 Expression y ∼ Distribution( θ ) Variable Distribution Random variable Parameter

Sampling ¡Nota.on ¡ x = 4 × z + 1 . 7 y ∼ Distribution( θ ) Distribution Random variable Parameter

Sampling ¡Nota.on ¡ x = 4 × z + 1 . 7 y ∼ Distribution( θ ) y 0 = y × x Random variable

Joint ¡Probability ¡ • Probability ¡over ¡mul.ple ¡event ¡types ¡ • Tool ¡for ¡reasoning ¡about ¡dependent ¡ (correlated) ¡events ¡ A ¡ joint ¡probability ¡distribu0on ¡ is ¡a ¡probability ¡ distribu.on ¡over ¡r.v.’s ¡with ¡the ¡following ¡form: ¡  X ( ω ) � Z = Y ( ω ) ✓ �◆ ✓ �◆ x x X = 1 ≥ 0 ∀ x ∈ X , y ∈ Y ρ Z ρ Z y y x ∈ X ,y ∈ Y

Joint ¡Probability ¡ • Probability ¡over ¡mul.ple ¡event ¡types ¡ • Tool ¡for ¡reasoning ¡about ¡dependent ¡ (correlated) ¡events ¡ A ¡ joint ¡probability ¡distribu0on ¡ is ¡a ¡probability ¡ distribu.on ¡over ¡r.v.’s ¡with ¡the ¡following ¡form: ¡  X ( ω ) � Words ¡ Z = Y ( ω ) Tags ¡ ✓ �◆ ✓ �◆ x x X = 1 ≥ 0 ∀ x ∈ X , y ∈ Y ρ Z ρ Z y y x ∈ X ,y ∈ Y

Joint ¡Probability ¡ • Probability ¡over ¡mul.ple ¡event ¡types ¡ • Tool ¡for ¡reasoning ¡about ¡dependent ¡ (correlated) ¡events ¡ A ¡ joint ¡probability ¡distribu0on ¡ is ¡a ¡probability ¡ distribu.on ¡over ¡r.v.’s ¡with ¡the ¡following ¡form: ¡  X ( ω ) � Words ¡ Z = Y ( ω ) Trees ¡ ✓ �◆ ✓ �◆ x x X = 1 ≥ 0 ∀ x ∈ X , y ∈ Y ρ Z ρ Z y y x ∈ X ,y ∈ Y

Joint ¡Probability ¡ • Probability ¡over ¡mul.ple ¡event ¡types ¡ • Tool ¡for ¡reasoning ¡about ¡dependent ¡ (correlated) ¡events ¡ A ¡ joint ¡probability ¡distribu0on ¡ is ¡a ¡probability ¡ distribu.on ¡over ¡r.v.’s ¡with ¡the ¡following ¡form: ¡  X ( ω ) � DNA ¡sequence ¡ Z = Y ( ω ) Proteins ¡ ✓ �◆ ✓ �◆ x x X = 1 ≥ 0 ∀ x ∈ X , y ∈ Y ρ Z ρ Z y y x ∈ X ,y ∈ Y

Ω = { 1 , 2 , 3 , 4 , 5 , 6 } X ( ω ) = ω

Ω = { 1 , 2 , 3 , 4 , 5 , 6 } X ( ω ) = ω Ω = { (1 , 1) , (1 , 2) , (1 , 3) , (1 , 4) , (1 , 5) , (1 , 6) , (2 , 1) , (2 , 2) , (2 , 3) , (2 , 4) , (2 , 5) , (2 , 6) , (3 , 1) , (3 , 2) , (3 , 3) , (3 , 4) , (3 , 5) , (3 , 6) , (4 , 1) , (4 , 2) , (4 , 3) , (4 , 4) , (4 , 5) , (4 , 6) , (5 , 1) , (5 , 2) , (5 , 3) , (5 , 4) , (5 , 5) , (5 , 6) , (6 , 1) , (6 , 2) , (6 , 3) , (6 , 4) , (6 , 5) , (6 , 6) , } X ( ω ) = ω 1 Y ( ω ) = ω 2 ( 1 if ( x, y ) ∈ Ω 36 ρ X,Y ( x, y ) = 0 otherwise

Ω = { 1 , 2 , 3 , 4 , 5 , 6 } X ( ω ) = ω Ω = { (1 , 1) , (1 , 2) , (1 , 3) , (1 , 4) , (1 , 5) , (1 , 6) , (2 , 1) , (2 , 2) , (2 , 3) , (2 , 4) , (2 , 5) , (2 , 6) , (3 , 1) , (3 , 2) , (3 , 3) , (3 , 4) , (3 , 5) , (3 , 6) , (4 , 1) , (4 , 2) , (4 , 3) , (4 , 4) , (4 , 5) , (4 , 6) , (5 , 1) , (5 , 2) , (5 , 3) , (5 , 4) , (5 , 5) , (5 , 6) , (6 , 1) , (6 , 2) , (6 , 3) , (6 , 4) , (6 , 5) , (6 , 6) , } X ( ω ) = ω 1 Y ( ω ) = ω 2 ( x + y if ( x, y ) ∈ Ω 252 ρ X,Y ( x, y ) = 0 otherwise

Marginal ¡Probability ¡ p ( X = x, Y = y ) = ρ X,Y ( x, y ) X p ( X = x, Y = y 0 ) p ( X = x ) = y 0 2 Y X p ( X = x 0 , Y = y ) p ( Y = y ) = x 0 2 X Ω = { (1 , 1) , (1 , 2) , (1 , 3) , (1 , 4) , (1 , 5) , (1 , 6) , (2 , 1) , (2 , 2) , (2 , 3) , (2 , 4) , (2 , 5) , (2 , 6) , (3 , 1) , (3 , 2) , (3 , 3) , (3 , 4) , (3 , 5) , (3 , 6) , X p ( X = 4 , Y = y 0 ) p ( X = 4) = (4 , 1) , (4 , 2) , (4 , 3) , (4 , 4) , (4 , 5) , (4 , 6) , y 0 2 [1 , 6] (5 , 1) , (5 , 2) , (5 , 3) , (5 , 4) , (5 , 5) , (5 , 6) , (6 , 1) , (6 , 2) , (6 , 3) , (6 , 4) , (6 , 5) , (6 , 6) , } X p ( X = x 0 , Y = 3) p ( Y = 3) = x 0 2 [1 , 6]

Marginal ¡Probability ¡ Sample ¡space ¡ (NN , cat) (JJ , fuzzy) (NN , sloth) (VB , book) (NN , book) (RB , quickly)

Marginal ¡Probability ¡ Sample ¡space ¡ (NN , cat) (JJ , fuzzy) (NN , sloth) (NN , · ) (VB , book) (NN , book) (RB , quickly) p ( t = NN)

Marginal ¡Probability ¡ Sample ¡space ¡ (NN , cat) (JJ , fuzzy) (NN , sloth) (VB , book) (NN , book) (RB , quickly) p ( w = book)

Marginal ¡Probability ¡ Sample ¡space ¡ (NN , cat) (JJ , fuzzy) (NN , sloth) (VB , book) (NN , book) (RB , quickly)

Marginal ¡Probabili.es ¡ • In ¡a ¡joint ¡model ¡of ¡word ¡and ¡tag ¡sequences ¡ p( w , t ) ¡ – The ¡probability ¡of ¡a ¡word ¡sequence ¡p( w ) ¡ – The ¡probability ¡of ¡a ¡tag ¡sequence ¡p( t ) ¡ – The ¡probability ¡of ¡a ¡word ¡sequence ¡with ¡the ¡word ¡ “cat” ¡somewhere ¡in ¡it ¡ – The ¡probability ¡of ¡a ¡tag ¡sequence ¡containing ¡three ¡ verbs ¡in ¡a ¡row ¡

Condi.onal ¡Probability ¡ The ¡ condi0onal ¡probability ¡ is ¡defined ¡as ¡follows: ¡ p ( X = x | Y = y ) = p ( X = x, Y = y ) = joint probability p ( Y = y ) marginal This ¡assumes ¡ p ( Y = y ) 6 = 0 We ¡can ¡construct ¡joint ¡probability ¡distribu.ons ¡out ¡of ¡ condi.onal ¡distribu.ons: ¡ p ( x | y ) p ( y ) = p ( x, y ) = p ( y | x ) p ( x )

Condi.onal ¡Probability ¡Distribu.ons ¡ The ¡ condi0onal ¡probability ¡distribu0on ¡of ¡a ¡variable ¡X ¡ given ¡a ¡variable ¡Y ¡has ¡the ¡following ¡proper.es: ¡ X p ( X = x | Y = y ) = 1 ∀ y ∈ Y, x ∈ X

Condi.onal ¡Probability ¡ Sample ¡space ¡ (NN , cat) (JJ , fuzzy) (NN , sloth) (VB , book) (NN , book) (RB , quickly)

Condi.onal ¡Probability ¡ Sample ¡space ¡ (NN , cat) (JJ , fuzzy) (NN , sloth) (VB , book) (NN , book) (RB , quickly) p ( · | w = book)

Probability Distribu.ons on Structured Objects September - PowerPoint PPT Presentation

Probability Distribu.ons on Structured Objects September 17, 2013 Reminder HW1 is due at 11:59pm tonight There was some ambiguity in this assignment

A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE

Probability Basics Martin Emms October 1, 2020 Probability Basics Outline Probability

Func+on applica+ons (calls, invoca+ons) lambda denotes a anonymous func+on To use a func+on, you

Mutable Values Announcements Objects (Demo) Objects 4 Objects Objects represent

61A Lecture 12 Announcements Objects (Demo) Objects 4 Objects Objects represent

Structured Prediction Introduction What is structured prediction? CS 6355: Structured Prediction

Continuing Probability. Wrap up: Total Probability and Conditional Probability. Continuing

Chapter 2 Probability 1. Definition of Probability 2. Probability of disjoint events 3.

Probability Basics Probability Background Martin Emms October 1, 2020 Probability Basics

Chapter 2 Probability 1. Definition of Probability 2. Probability of disjoint events 3.

Study of Generalized Parton Distribu ons at Je ff erson Lab for LIGHT CONE 2016 September 05,

Counting and Probability Whats to come? Counting and Probability Whats to come?

Objects & Inheritance Section 7 Implementing Objects in 401 Ways of implementing objects:

Scaling Log-Structured KV-Stores featuring Monkey and Dostoevsky SIGMOD17 / SIGMOD18 Niv Dayan

Machine Learning Fall 2017 Structured Prediction (structured perceptron, HMM, structured SVM)

Introduction to SparkSQL Structured Data Processing in Spark 1 Structured Data Processing A

Greedy maximal independent sets via local limits Peleg Michaeli Tel Aviv University Workshop on

Self-similar growth-fragmentations as scaling limits of Markov branching processes Benjamin

The scaling limit of critical random graphs Christina Goldschmidt Joint work with Louigi

Combinatorial structures and algorithms: phase transition, enumeration and sampling Mihyun Kang

Branching Processes in Fluid Mechanics: An application to the Navier-Stokes and LANS-alpha

Quantum Chromodynamics Lecture 2: Leading order and showers Hadron Collider Physics Summer School

Weighted branching formulas for the hook lengths Matja Konvalinka Vanderbilt University (joint

Quasi-stationary distributions and diffusion models in population dynamics P. Cattiaux, P.

Probability Distribu.ons on Structured Objects September - PowerPoint PPT Presentation

Probability Distribu.ons on Structured Objects September 17, 2013 Reminder HW1 is due at 11:59pm tonight There was some ambiguity in this assignment

A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE

Probability Basics Martin Emms October 1, 2020 Probability Basics Outline Probability

Func+on applica+ons (calls, invoca+ons) lambda denotes a anonymous func+on To use a func+on, you

Mutable Values Announcements Objects (Demo) Objects 4 Objects Objects represent

61A Lecture 12 Announcements Objects (Demo) Objects 4 Objects Objects represent

Structured Prediction Introduction What is structured prediction? CS 6355: Structured Prediction

Continuing Probability. Wrap up: Total Probability and Conditional Probability. Continuing

Chapter 2 Probability 1. Definition of Probability 2. Probability of disjoint events 3.

Probability Basics Probability Background Martin Emms October 1, 2020 Probability Basics

Chapter 2 Probability 1. Definition of Probability 2. Probability of disjoint events 3.

Study of Generalized Parton Distribu ons at Je ff erson Lab for LIGHT CONE 2016 September 05,

Counting and Probability Whats to come? Counting and Probability Whats to come?

Objects &amp; Inheritance Section 7 Implementing Objects in 401 Ways of implementing objects:

Scaling Log-Structured KV-Stores featuring Monkey and Dostoevsky SIGMOD17 / SIGMOD18 Niv Dayan

Machine Learning Fall 2017 Structured Prediction (structured perceptron, HMM, structured SVM)

Introduction to SparkSQL Structured Data Processing in Spark 1 Structured Data Processing A

Greedy maximal independent sets via local limits Peleg Michaeli Tel Aviv University Workshop on

Self-similar growth-fragmentations as scaling limits of Markov branching processes Benjamin

The scaling limit of critical random graphs Christina Goldschmidt Joint work with Louigi

Combinatorial structures and algorithms: phase transition, enumeration and sampling Mihyun Kang

Branching Processes in Fluid Mechanics: An application to the Navier-Stokes and LANS-alpha

Quantum Chromodynamics Lecture 2: Leading order and showers Hadron Collider Physics Summer School

Weighted branching formulas for the hook lengths Matja Konvalinka Vanderbilt University (joint

Quasi-stationary distributions and diffusion models in population dynamics P. Cattiaux, P.

Objects & Inheritance Section 7 Implementing Objects in 401 Ways of implementing objects: