quancol . ........ . . ... . ... ... ... ... ... ... - PowerPoint PPT Presentation

Probabilistic Programming of Biology Jane Hillston Joint work with Anastasis Georgoulas and Guido Sanguinetti School of Informatics, University of Edinburgh December 2015 quan�col . ........ . . ... . ... ... ... ... ... ... Hillston Dagstuhl 15491 1 / 29

Outline Introduction 1 Probabilistic Programming 2 ProPPA 3 Inference 4 Conclusions 5 Hillston Dagstuhl 15491 2 / 29

Modelling There are two approaches to model construction: Machine Learning: extracting a model from the data generated by the system, or refining a model based on system behaviour using statistical techniques. Mechanistic Modelling: starting from a description or hypothesis, construct a model that algorithmically mimics the behaviour of the system, validated against data. Hillston Dagstuhl 15491 4 / 29

Machine Learning prior inference posterior data Hillston Dagstuhl 15491 5 / 29

Machine Learning prior inference posterior data Bayesian statistics Represent belief and uncertainty as probability distributions (prior, posterior). Treat parameters and unobserved variables similarly. Bayes’ Theorem: P ( θ | D ) = P ( θ ) · P ( D | θ ) P ( D ) posterior ∝ prior · likelihood Hillston Dagstuhl 15491 5 / 29

Mechanistic modelling Models are constructed reflecting what is known about the components of the biological system and their behaviour. A variety of formal modelling techniques from theoretical computer science have been proposed to capture the system behaviour. These are then compiled into executable models 1 which can be run to deepen understanding of the model. Executing the model generates data that can be compared with biological data. 1 Jasmin Fisher, Thomas A. Henzinger: Executable cell biology . Nature Biotechnology 2007 Hillston Dagstuhl 15491 6 / 29

Comparing the techniques Data-driven modelling: + rigorous handling of parameter uncertainty - limited or no treatment of stochasticity - in many cases bespoke solutions are required which can limit the size of system which can be handled Hillston Dagstuhl 15491 7 / 29

Comparing the techniques Data-driven modelling: + rigorous handling of parameter uncertainty - limited or no treatment of stochasticity - in many cases bespoke solutions are required which can limit the size of system which can be handled Mechanistic modelling: + general execution ”engine” (deterministic or stochastic) can be reused for many models + models can be used speculatively to investigate roles of parameters, or alternative hypotheses - parameters are assumed to be known and fixed Hillston Dagstuhl 15491 7 / 29

Comparing the techniques Data-driven modelling: + rigorous handling of parameter uncertainty - limited or no treatment of stochasticity - in many cases bespoke solutions are required which can limit the size of system which can be handled Mechanistic modelling: + general execution ”engine” (deterministic or stochastic) can be reused for many models + models can be used speculatively to investigate roles of parameters, or alternative hypotheses - parameters are assumed to be known and fixed Probabilistic Programming seeks to bring elements of both forms of modelling together. Hillston Dagstuhl 15491 7 / 29

Probabilistic programming A way to express probabilistic models in a high level language, like software code. Offers automated inference without the need to write bespoke solutions. Platforms: IBAL, Church, Infer.NET, Fun, ... Key actions: specify a distribution, specify observations, infer posterior distribution. Hillston Dagstuhl 15491 9 / 29

Probabilistic Process Algebra What if we could... include information about uncertainty in the model? automatically use observations to refine this uncertainty? do all this in a formal context? Starting from an existing process algebra (Bio-PEPA), we have developed a new language ProPPA that addresses these issues. 2 2 Anastasis Georgoulas, Jane Hillston, Dimitrios Milios, Guido Sanguinetti: Probabilistic Programming Process Algebra . QEST 2014: 249-264. Hillston Dagstuhl 15491 10 / 29

Stochastic Process Algebra In a stochastic process algebra actions (reactions) not only have a name or type, but also a stochastic duration or rate. Hillston Dagstuhl 15491 12 / 29

Stochastic Process Algebra In a stochastic process algebra actions (reactions) not only have a name or type, but also a stochastic duration or rate. The language may be used to generate a Markov Process (CTMC). SOS rules LABELLED state transition SPA CTMC Q TRANSITION ✲ ✲ MODEL diagram SYSTEM Q is the infinitesimal generator matrix characterising the CTMC. Hillston Dagstuhl 15491 12 / 29

Stochastic Process Algebra In a stochastic process algebra actions (reactions) not only have a name or type, but also a stochastic duration or rate. The language may be used to generate a Markov Process (CTMC). SOS rules LABELLED state transition SPA CTMC Q TRANSITION ✲ ✲ MODEL diagram SYSTEM Q is the infinitesimal generator matrix characterising the CTMC. Models are typically executed by simulation using Gillespie’s Stochastic Simulation Algorithm (SSA) or similar. Hillston Dagstuhl 15491 12 / 29

The Bio-PEPA abstraction Each species i is described by a species component C i Hillston Dagstuhl 15491 13 / 29

The Bio-PEPA abstraction Each species i is described by a species component C i Each reaction j is associated with an action type α j and its dynamics is described by a specific function f α j Hillston Dagstuhl 15491 13 / 29

The Bio-PEPA abstraction Each species i is described by a species component C i Each reaction j is associated with an action type α j and its dynamics is described by a specific function f α j The species components are then composed together to describe the behaviour of the system. Hillston Dagstuhl 15491 13 / 29

The Bio-PEPA abstraction Each species i is described by a species component C i Each reaction j is associated with an action type α j and its dynamics is described by a specific function f α j The species components are then composed together to describe the behaviour of the system. The semantics is defined by two transition relations: First, a capability relation — is a transition possible? Second, a stochastic relation — gives rate of a transition, derived from the parameters of the model. The result is a Continuous Time Markov Chain (CTMC) Hillston Dagstuhl 15491 13 / 29

A Probabilistic Programming Process Algebra: ProPPA ProPPA aims to retain the features of the stochastic process algebra: simple model description in terms of components rigorous semantics giving an executable version of the model... Hillston Dagstuhl 15491 14 / 29

A Probabilistic Programming Process Algebra: ProPPA ProPPA aims to retain the features of the stochastic process algebra: simple model description in terms of components rigorous semantics giving an executable version of the model... ... whilst also incorporating features of a probabilistic programming language: recording uncertainty in the parameters ability to incorporate observations into models accss to inference to update uncertainty based on observations Hillston Dagstuhl 15491 14 / 29

Example S S I I S stop1 R spread stop2 R k_s = 0.5; k_r = 0.1; kineticLawOf spread : k_s * I * S; kineticLawOf stop1 : k_r * S * S; kineticLawOf stop2 : k_r * S * R; I = (spread,1) ↓ ; S = (spread,1) ↑ + (stop1,1) ↓ + (stop2,1) ↓ ; R = (stop1,1) ↑ + (stop2,1) ↑ ; I[10] ⊲ ⊳ S[5] ⊲ ⊳ R[0] ∗ ∗ Hillston Dagstuhl 15491 15 / 29

Additions Declaring uncertain parameters: k s = Uniform(0,1); k t = Gaussian(0,1); Providing observations: observe(’trace’) Specifying inference approach: infer(’ABC’) Hillston Dagstuhl 15491 16 / 29

Additions S S I I S stop1 R spread stop2 R k_s = Uniform(0,1); k_r = Uniform(0,1); kineticLawOf spread : k_s * I * S; kineticLawOf stop1 : k_r * S * S; kineticLawOf stop2 : k_r * S * R; I = (spread,1) ↓ ; S = (spread,1) ↑ + (stop1,1) ↓ + (stop2,1) ↓ ; R = (stop1,1) ↑ + (stop2,1) ↑ ; I[10] ⊲ ⊳ S[5] ⊲ ⊳ R[0] ∗ ∗ observe(’trace’) infer(’ABC’) //Approximate Bayesian Computation Hillston Dagstuhl 15491 17 / 29

k = 2 parameter model CTMC Hillston Dagstuhl 15491 18 / 29

k ∈ [0,5] parameter model set of CTMCs Hillston Dagstuhl 15491 18 / 29

k ∼ p parameter model μ distribution over CTMCs A ProPPA model should be mapped to something like a distribution over CTMCs – a Probabilistic Constraint Markov Chain. Hillston Dagstuhl 15491 18 / 29

quancol . ........ . . ... . ... ... ... ... ... ... - PowerPoint PPT Presentation

Probabilistic Programming of Biology Jane Hillston Joint work with Anastasis Georgoulas and Guido Sanguinetti School of Informatics, University of Edinburgh December 2015 quancol . ........ . . ... . ... ... ... ... ... ...

T3 PRODUCT PRESENTATION August 2016 COMPARISON DIFFERENT BATTERY CHEMISTRY TYPES Cell Quan@ty

Modeling Immigration with a Discrete Dynamical System Eugene Quan April 28, 2006 Eugene Quan

Introduction to self-similar growth-fragmentations Quan Shi CIMAT, 11-15 December, 2017 Quan Shi

Improved Heuristics for Short Linear Programs Thomas Peyrin Quan Quan Tan Nanyang Technological

Texas Military Summit Texas Commanders Council CAPT Jon Townsend Naval Air Station Joint Reserve

Session 12: git Citations and Slides in Markdown P . S. Langeslag 17 January 2019 No Headings

Pr oj ect Logi - col [ i ] Pr oj ect Logi - col [ i ] Adventures in genetic logic 2008 NINT

A WARFIGHTERS PERSPECTIVE A WARFIGHTERS PERSPECTIVE COL AL KELLY COL AL KELLY 1-17 th INFANTRY

Fukushima et al., Supplemental Figure S1 A Col - 0 +MV0.01 M +MV0.05 M +MV0.1 M MS B

CSC Orienteering Lt-Col Lars EW Gerhardsson PCSC Orienteering 03MAY17 Three Disciplines Relay

Group Presentation Resource Person MATHISHA HEWAVITHARANA MBA (Col),BBA Sp.Mktng (Col), PPG DIP.

Transformation Brief COL Mary Garr COL Mary Garr Commander, US Army Garrison, Fort Sam Houston

The grid program: N = input('Enter a number: '); for Row = 1:N for Col = 1:N fprintf('+--');

Col Joe Boyle Col Joe Boyle Associate Director, Propulsion Associate Director, Propulsion Air

Pai ain n th the e City ty Col olorfull orfully: y: Lo Locatio ation n Visualizat

Rapid Improvement Team (RIT) Initiative with Lessons Learned from GTN 21 Lt Col Stewart Laing,

The thermodynamics of cellular computation Sourjik and Wingreen (2012) Cur. Opinions in Cell Bio.

Xiaolan Wang Xin Luna Dong Alexandra Meliou U NIVERSITY OF M ASSACHUSETTS , A MHERST College

Guidelines for Managing Unique Resource Identifiers Prepared by the iDigBio Information Technology

CMOS Switched-Capacitor Circuits: Recent Advances in Bio-Medical and RF Applications David J.

Interactive applications on HPC systems Erich Birngruber (erich.birngruber@gmi.oeaw.ac.at, @ebirn)

GWAS and prior knowledge to uncover gene-gene interac7ons

1 Theoretical foundations: Mass Action Law Theoretical foundations: Mass Conservation Law RATE IS

State of the Industry A view from the West David Thomas Director, Industry Research &

quancol . ........ . . ... . ... ... ... ... ... ... - PowerPoint PPT Presentation

Probabilistic Programming of Biology Jane Hillston Joint work with Anastasis Georgoulas and Guido Sanguinetti School of Informatics, University of Edinburgh December 2015 quancol . ........ . . ... . ... ... ... ... ... ...

T3 PRODUCT PRESENTATION August 2016 COMPARISON DIFFERENT BATTERY CHEMISTRY TYPES Cell Quan@ty

Modeling Immigration with a Discrete Dynamical System Eugene Quan April 28, 2006 Eugene Quan

Introduction to self-similar growth-fragmentations Quan Shi CIMAT, 11-15 December, 2017 Quan Shi

Improved Heuristics for Short Linear Programs Thomas Peyrin Quan Quan Tan Nanyang Technological

Texas Military Summit Texas Commanders Council CAPT Jon Townsend Naval Air Station Joint Reserve

Session 12: git Citations and Slides in Markdown P . S. Langeslag 17 January 2019 No Headings

Pr oj ect Logi - col [ i ] Pr oj ect Logi - col [ i ] Adventures in genetic logic 2008 NINT

A WARFIGHTERS PERSPECTIVE A WARFIGHTERS PERSPECTIVE COL AL KELLY COL AL KELLY 1-17 th INFANTRY

Fukushima et al., Supplemental Figure S1 A Col - 0 +MV0.01 M +MV0.05 M +MV0.1 M MS B

CSC Orienteering Lt-Col Lars EW Gerhardsson PCSC Orienteering 03MAY17 Three Disciplines Relay

Group Presentation Resource Person MATHISHA HEWAVITHARANA MBA (Col),BBA Sp.Mktng (Col), PPG DIP.

Transformation Brief COL Mary Garr COL Mary Garr Commander, US Army Garrison, Fort Sam Houston

The grid program: N = input('Enter a number: '); for Row = 1:N for Col = 1:N fprintf('+--');

Col Joe Boyle Col Joe Boyle Associate Director, Propulsion Associate Director, Propulsion Air

Pai ain n th the e City ty Col olorfull orfully: y: Lo Locatio ation n Visualizat

Rapid Improvement Team (RIT) Initiative with Lessons Learned from GTN 21 Lt Col Stewart Laing,

The thermodynamics of cellular computation Sourjik and Wingreen (2012) Cur. Opinions in Cell Bio.

Xiaolan Wang Xin Luna Dong Alexandra Meliou U NIVERSITY OF M ASSACHUSETTS , A MHERST College

Guidelines for Managing Unique Resource Identifiers Prepared by the iDigBio Information Technology

CMOS Switched-Capacitor Circuits: Recent Advances in Bio-Medical and RF Applications David J.

Interactive applications on HPC systems Erich Birngruber (erich.birngruber@gmi.oeaw.ac.at, @ebirn)

GWAS and prior knowledge to uncover gene-gene interac7ons

1 Theoretical foundations: Mass Action Law Theoretical foundations: Mass Conservation Law RATE IS

State of the Industry A view from the West David Thomas Director, Industry Research &amp;

State of the Industry A view from the West David Thomas Director, Industry Research &