Probabilistische graphische Modelle mit Scala Andreas Bille rcs - PowerPoint PPT Presentation

Probabilistische graphische Modelle mit Scala Andreas Bille rcs systems GmbH

• Ab 1500 G. Cardano Glücksspiel • Ab 1600 Fermat, Pascal Diskrete Räume • 1763 Bayes Bayes Theorem • 1902 Gibbs Graphen als Distribution • 1921 Wright Genetik • 1930 Kolmogorov Moderne Formulierung • … • 1972 de Bombal et al. Naive Bayes model • 1988 J. Pearl Probabilistic Reasoning • 1992 Heckerman et.al. Pathfinder • Today medical diagnosis, market data, natural language, genetic, communication, social science …

Resources • http://probabilistic-programming.org/wiki/Home: • Existing probabilistic programming systems • Below we have compiled a list of probabilistic programming systems including languages, implementations/compilers, as well as software libraries for constructing probabilistic models and toolkits for building probabilistic inference algorithms. • Anglican is a portable Turing-complete research probabilistic programming language that includes particle MCMC inference. • BLOG, or Bayesian logic, is a probabilistic programming language with elements of first-order logic, as well as an MCMC-based inference algorithm. BLOG makes it relatively easy to represent uncertainty about the number of underlying objects explaining observed data. • BUGS is a language for specifying finite graphical models and accompanying software for performing B(ayesian) I(nference) U(sing) G(ibbs) S(ampling), although modern implementations (such as WinBUGS, JAGS, and OpenBUGS) are based on Metropolis-Hastings. BiiPS is an implementation based on interacting particle systems methods like Sequential Monte Carlo. • Church is a universal probabilistic programming language, extending Scheme with probabilistic semantics, and is well suited for describing infinite-dimensional stochastic processes and other recursively-defined generative processes (Goodman, Mansinghka, Roy, Bonawitz and Tenenbaum, 2008). Implementations of Church include MIT-Church, Cosh, Bher, and JSChurch. • Dimple is a software tool that performs inference and learning on probabilistic graphical models via belief propagation algorithms or sampling based algorithms. • FACTORIE is a Scala library for creating relational factor graphs, estimating parameters and performing inference. • Figaro is a Scala library for constructing probabilistic models that also provides a number of built-in reasoning algorithms that can be applied automatically to any constructed models. • HANSEI is a domain-specific language embedded in OCaml, which allows one to express discrete-distribution models with potentially infinite support, perform exact inference as well as importance sampling-based inference, and model inference over inference. • Hierarchical Bayesian Compiler (HBC) is a language for expressing and compiler for implementing hierarchical Bayesian models, with a focus on large-dimension discrete models and support for a number of non-parametric process priors. • PRISM is a general programming language intended for symbolic-statistical modeling, and the PRISM programming system is a tool that can be used to learn the parameters of a PRISM program from data, e.g., by expectation-maximization. • Infer.NET is a software library developed by Microsoft for expressing graphical models and implementing Bayesian inference using a variety of algorithms. • Probabilistic-C is a C-language probabilistic programming system that, using standard compilation tools, automatically produces a compiled parallel inference executable from C-language generative model code. • ProbLog is a probabilistic extension of Prolog based on Sato's distribution semantics. While ProbLog1 focuses on calculating the success probability of a query, ProbLog2 can calculate both conditional probabilities and MPE states. • PyMC is a python module that implements a suite of MCMC algorithms as python classes, and is extremely flexible and applicable to a large suite of problems. PyMC includes methods for summarizing output, plotting, goodness-of-fit and convergence diagnostics. • R2 is a probabilistic programming system that employs powerful techniques from programming language design, program analysis and verification for scalable and efficient inference. • Stan exposes a language for defining probability density functions for probabilistic models. Stan includes a compiler, which produces C++ code that performs Bayesian inference via a method similar to Hamiltonian Monte Carlo sampling. • Venture is an interactive, Turing-complete, higher-order probabilistic programming platform that aims to be sufficiently expressive, extensible and efficient for general-purpose use. Its virtual machine supports multiple scalable, reprogrammable inference strategies, plus two front-end languages: VenChurch and VentureScript. • Diverse commercial packages, packages für MatLab, Mathematica, carda

Gibt es „ probabilistisches Programmieren“? „Traditionelles Programmieren“ „ Probabilistisches Programmieren“ Computer können erstmal nichts. Computer sind autonom lernfähig. Der Entwickler ist allwissend. Der Entwickler wird zum Trainer. Ein Programm ist in sich stark strukturiert Ein Graph setzt – oder sollte es sein: Klassen, Module, Wahrscheinlichkeitsvariablen zueinander Komponenten, Funktionen, Tiers, Bus, in Beziehung. Systemmodellierung ist vor Service, Regeln, Workflow …. allem Bestimmung der relevanten WV. Ein Programm ist eine eindeutige und Der Computer lernt durch Training. Das deterministische Beschreibung des Endergebnis wird nicht spezifiziert. Systemverhaltens zu jeder Eingabe. Logik, Eindeutigkeit, Determiniertheit Datenqualität, Samplesize etc. Spezifikation, Testen, Versionen ,Wartung, Fehlerhafte Verhalten inhärent, Application Lifecycle Debugging, Lifecyle? Ausgereift, erlernbar Emerging, keine best practices etc.

Probabilistische Grundlagen • Wahrscheinlichkeitsraum 𝐵, 𝜏 𝐵 , 𝑄 • 𝐵 ∈ 𝜏 𝐵 , ∅ 𝜗 𝜏 𝐵 , 𝑏𝑐𝑨äℎ𝑚𝑐𝑏𝑠𝑓 𝑊𝑓𝑠𝑓𝑗𝑜𝑗𝑕𝑣𝑜𝑕, 𝐿𝑝𝑛𝑞𝑚𝑓𝑛𝑓𝑜𝑢 • 𝑄: 𝜏 𝐵 → 𝑆, 𝑄 ∅ = 0, 𝑄 𝐵 = 1 • 𝑄 𝛽 ∪ 𝛾 = 𝑄 𝛽 + 𝑄 𝛾 , 𝑔𝑏𝑚𝑚𝑡 𝛽 ∩ 𝛾 = ∅ Z.B. Würfel: 𝐵 = 1,2,3,4,5,6 , 𝜏 𝐵 = 𝑁𝑓𝑜𝑕𝑓 𝑏𝑚𝑚𝑓𝑠 𝑈𝑓𝑗𝑚𝑛𝑓𝑜𝑕𝑓𝑜 𝑤𝑝𝑜 𝐵 , 𝑄 𝛽 = 𝛽 /6

Bayes • Wahrscheinlichkeitsvariablen X,Y,Z… • val X: 𝜏(𝐵) • Wahrscheinlichkeitsverteilung über alle X,Y,Z,… P(X,Y,Z,…) • Bayes bedingte Wahrscheinlichkeit: 𝑸(𝒀∩𝒁) 𝑸(𝒁) , P(Y) = 𝒆𝒀 𝑸(𝒀, 𝒁) P(X|Y) =

Graphische Darstellung P(G,W,A,B) = P(G|W,A,B)*P(W|A,B)*P(A|B)*P(B) Aber: P(G,W,A,B) = P(G|A)*P(W|A)*P(A|B)*P(B)

Figaro • Avi Pfeffer, Charles River Analytics • Scala library • Dokumentation gut • Manning in Arbeit ( 2015 ) • Umfangreich, erweiterbar • Name: Der Autor komponiert klassische Musik

Abragen, Evidenz, Lernen • 𝑅 = 𝑅1, 𝑅2, … . , • 𝐹 = 𝐹1, 𝐹2. . , • 𝑉 = 𝑉1, 𝑉2, … • P(Q,U,E) Wahrscheinlichkeitsverteilung • 𝑄 𝑅 = 1 𝑂 𝑄(𝑅, 𝑉, 𝐹 = 𝑓1, 𝑓2, … ) 𝑉 • N = 𝑄(𝑅, 𝑉, 𝐹 = 𝑓1, 𝑓2, . . ) 𝑅,𝑉 • Most probable explanation

Informationen, Entscheidungen, Nutzen Informationlink: Informationen, die in Entscheidungen einfliessen Entscheidungen Utility-Funktionen

rcs • Beratung, Entwicklung • Schulung • Eigenentwicklung carda ( pre-alpha ) - Fokus sehr große Systeme, Cloud, Web - Parallelisierbarkeit und Verteilbarkeit - Veröffentlichung von Modellen - Entscheiden lernen versus Model lernen

Literatur • Risk Assessment and Decision Analysis with Bayesian Networks , N. Fenton, M. Neil, CRC Press • Bayesian Reasoning and Machine Learning , D. Barber, Cambridge • Probabilistic Graphical Models , D. Koller, N. Friedman, MIT Press • Modeling and Reasoning with Bayesian Networks , A. Darwiche, Cambridge

Probabilistische graphische Modelle mit Scala Andreas Bille rcs - PowerPoint PPT Presentation

Probabilistische graphische Modelle mit Scala Andreas Bille rcs systems GmbH Ab 1500 G. Cardano Glcksspiel Ab 1600 Fermat, Pascal Diskrete Rume 1763 Bayes Bayes Theorem 1902 Gibbs Graphen als Distribution 1921

Rumliche und netzw erkbasierte Modelle zur Interventionsplanung bei direkt bertragenen

Entwurf domnenspezifischer Modelle im Web mit Oryx Matthias Kunze Mathias Weske (Koautor)

Evaluation verschiedener 3D-Drucker Seminar Technische Informatik, Wintersemester 2013/2014

Logik f ur Informatiker 2. Aussagenlogik Teil 3 30.04.2012 Viorica Sofronie-Stokkermans

Workshop 5: Introduction to Bayesian models Murray Logan April 9, 2016 Table of contents 0.1.

An Intro to Probabilistic Programming using JAGS John Myles White December 27, 2012 What Ill

Probabilistic Programming or Revd. Bayes meets Countess Lovelace John Winn, Microsoft Research

09 Shadow Mapping Steve Marschner CS5625 Spring 2019 Thanks to previous instructor Kavita Bala

t trt s

Statistical Methods for Infectious Diseases Household Based Studies I Lecture 7C M. Elizabeth

Probabilistic Programming Practical Frank Wood, Brooks Paige {fwood,brooks}@robots.ox.ac.uk MLSS

TOS Arno Puder 1 Objectives Enhance TOS: Add malloc(), free() Overlapping windows

L p eigenfunction estimates and directional oscillation Melissa Tacy Department of Mathematics

Topic 11 Simple Graphics "What makes the situation worse is that the highest level CS

Art by Numbers Creative Coding & Generative Art in Processing 2 Ira Greenberg, Dianna Xu,

Terminal Propagation - NANDAN BEDEKAR - PRIYADARSHINI SAVAN ROSHAN Brief Description Overview

Hastings ratio = P ( proposing ) P ( proposing ) = g ( u ) g ( u )

Bridge Trolley Width Trolleys can be no longer than ~16 to accommodate loading the south

Performance and Power Impact of Issue- width in Chip-Multiprocessor Cores Magnus Ekman

Estimating Estimands with Estimators Fill In Your Name 30 October 2020 1/88 Key Points Review

Data Pipeline Selection and Optimization DOLAP 2019 Alexandre Quemy IBM IBM, , Da Data ta an

MLE vs. MAP Aarti Singh Machine Learning 10-701/15-781 Sept 15, 2010 1 MLE vs. MAP Maximum

A CLT for Wishart Tensors Dan Mikulincer Weizmann Institute of Science 1 Wishart Tensors Let {

On Demmel Condition Number Distributions with Applications in Telecommunications Lu Wei and Olav

Probabilistische graphische Modelle mit Scala Andreas Bille rcs - PowerPoint PPT Presentation

Probabilistische graphische Modelle mit Scala Andreas Bille rcs systems GmbH Ab 1500 G. Cardano Glcksspiel Ab 1600 Fermat, Pascal Diskrete Rume 1763 Bayes Bayes Theorem 1902 Gibbs Graphen als Distribution 1921

Rumliche und netzw erkbasierte Modelle zur Interventionsplanung bei direkt bertragenen

Entwurf domnenspezifischer Modelle im Web mit Oryx Matthias Kunze Mathias Weske (Koautor)

Evaluation verschiedener 3D-Drucker Seminar Technische Informatik, Wintersemester 2013/2014

Logik f ur Informatiker 2. Aussagenlogik Teil 3 30.04.2012 Viorica Sofronie-Stokkermans

Workshop 5: Introduction to Bayesian models Murray Logan April 9, 2016 Table of contents 0.1.

An Intro to Probabilistic Programming using JAGS John Myles White December 27, 2012 What Ill

Probabilistic Programming or Revd. Bayes meets Countess Lovelace John Winn, Microsoft Research

09 Shadow Mapping Steve Marschner CS5625 Spring 2019 Thanks to previous instructor Kavita Bala

t trt s

Statistical Methods for Infectious Diseases Household Based Studies I Lecture 7C M. Elizabeth

Probabilistic Programming Practical Frank Wood, Brooks Paige {fwood,brooks}@robots.ox.ac.uk MLSS

TOS Arno Puder 1 Objectives Enhance TOS: Add malloc(), free() Overlapping windows

L p eigenfunction estimates and directional oscillation Melissa Tacy Department of Mathematics

Topic 11 Simple Graphics &quot;What makes the situation worse is that the highest level CS

Art by Numbers Creative Coding &amp; Generative Art in Processing 2 Ira Greenberg, Dianna Xu,

Terminal Propagation - NANDAN BEDEKAR - PRIYADARSHINI SAVAN ROSHAN Brief Description Overview

Hastings ratio = P ( proposing ) P ( proposing ) = g ( u ) g ( u )

Bridge Trolley Width Trolleys can be no longer than ~16 to accommodate loading the south

Performance and Power Impact of Issue- width in Chip-Multiprocessor Cores Magnus Ekman

Estimating Estimands with Estimators Fill In Your Name 30 October 2020 1/88 Key Points Review

Data Pipeline Selection and Optimization DOLAP 2019 Alexandre Quemy IBM IBM, , Da Data ta an

MLE vs. MAP Aarti Singh Machine Learning 10-701/15-781 Sept 15, 2010 1 MLE vs. MAP Maximum

A CLT for Wishart Tensors Dan Mikulincer Weizmann Institute of Science 1 Wishart Tensors Let {

On Demmel Condition Number Distributions with Applications in Telecommunications Lu Wei and Olav

Topic 11 Simple Graphics "What makes the situation worse is that the highest level CS

Art by Numbers Creative Coding & Generative Art in Processing 2 Ira Greenberg, Dianna Xu,