Engineering Genetic Circuits
Chris J. Myers
Lecture 2: Learning Models
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 1 / 98
Engineering Genetic Circuits Chris J. Myers Lecture 2: Learning - - PowerPoint PPT Presentation
Engineering Genetic Circuits Chris J. Myers Lecture 2: Learning Models Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 1 / 98 Johann Von Neumann The sciences do not try to explain, they hardly even try to interpret,
Chris J. Myers
Lecture 2: Learning Models
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 1 / 98
The sciences do not try to explain, they hardly even try to interpret, they mainly make models. By a model is meant a mathematical construct which, with the addition of certain verbal interpretations, describes observed
mathematical construct is solely and precisely that it is expected to work.
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 2 / 98
There are many methods for predicting the future. For example, you can read horoscopes, tea leaves, tarot cards, or crystal balls. Collectively, these methods are known as “nutty methods.” Or you can put well-researched facts into sophisticated computer models, more commonly referred to as “a complete waste of time.”
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 3 / 98
Models are to be used, not believed.
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 4 / 98
First step of engineering approach is to use results form experiments to construct a mathematical model for the system of interest. DNA microarrays can measure expression levels of thousands of mRNA targets simultaneously. Today 100s of samples may be run in a microarray experiment. Given this data and an abstract class of potential models for various network configurations, how can we decide the most likely network configuration that generated this data? Even largest experiments do not provide enough samples for high statistical significance. Current technology has high noise to signal ratio.
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 5 / 98
Experimental methods Experimental data Cluster analysis Learning Bayesian networks Learning causal networks Experimental design
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 6 / 98
One of the most direct ways to see what is happening within a genetic circuit is to add a reporter gene that codes for a fluorescent protein. One example is green fluorescent protein (GFP) which comes from a jellyfish and fluoresces green when exposed to blue light. Since fluorescent proteins are typically not harmful to a cell, it allows the experimenter to observe the workings of a genetic circuit in a living cell. The downside is that there is only a small number of colors available making it impossible to observe more than a few genes at a time.
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 7 / 98
(Courtesy of Roger Tsien)
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 8 / 98
(Courtesy of Guet et al., Science 2002)
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 9 / 98
(Courtesy of Michael Elowitz)
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 10 / 98
In 1995, DNA microarrays were developed to allow the expression level of thousands of genes to be monitored simultaneously. They have led to an explosion in the amount of biological data generated. DNA microarrays are chips made of glass, plastic, or silicon. These chips have an array of thousands (or 10s of thousands) of single-stranded complementary DNA (cDNA) probes of about 20 bases. When strands of mRNA extracted from cells during an experiment are hybridized to complementary probes, they flouresce.
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 11 / 98
(Courtesy of National Human Genome Research Institute).
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 12 / 98
Used to measure gene expression levels which are correlated with the levels of mRNA present. During an experiment, several microarray measurements can be taken at different times to generate time series data. This provides data on changes in expression patterns over time in response to a stimulus. While microarrays allow substantially more genes to be observed than with fluorescent proteins, cells must be destroyed to extract their mRNA. Therefore, time series data is for population rather than single cell. Another limitation is that expression level does not precisely track protein levels since they are dependent on degradation rates and other factors.
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 13 / 98
There are about 25,000 genes in the human genome, but there may be more than 500,000 proteins in the human proteome. Many transcripts must result in many different proteins through alternative splicing and other post-transcriptional modifications. Further complicates ability of DNA microarray results to give accurate estimate of protein levels. The goal of proteomic experimental techniques is to determine the full set
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 14 / 98
(Courtesy of http://www.lecb.ncifcrf.gov/2DgelDataSets/)
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 15 / 98
(Courtesy of United States Geological Survey)
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 16 / 98
(Courtesy of http://en.wikipedia.org/wiki/Protein_mass_spectrometry)
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 17 / 98
Using data from methods just described, one can infer the connectivity of the genetic circuit that generated this data. Experimental techniques can also be employed to determine which proteins interact as well as which proteins serve as transcription factors for particular genes. Two such techniques:
Two-hybrid screening ChIP-on-chip, also known as genome-wide location analysis
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 18 / 98
(Courtesy of http://en.wikipedia.org/wiki/Two-hybrid_screening)
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 19 / 98
(Courtesy of Agilent Technologies)
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 20 / 98
Model learning begins with experimental data, E. Each data point in E, is a 3-tuple e,τ,ν:
e ∈ N is a natural number representing the experiment number,
τ ∈ R is the time at which the species values were measured, and ν ∈ (R∪{L}∪{H}∪{−})|S| is the state of each species s ∈ S. L and H represent that a species is mutated low or high, respectively. ‘−’ represents an unknown value.
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 21 / 98
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 22 / 98
Data is discretized into n bins where n is typically 3 or 4. Each bin is a range of values Φj(s) = [θj(s),θj+1(s)) where j = 0 to n−1 and θ(s) = θ0(s),...,θn(s) are levels with θ0(s) = 0 and θn(s) = ∞. A bin assignment, b ∈ {0,..,n − 1,∗}|S|, assigns each s ∈ S to a bin. The notation b(s) indicates the bin assignment for species s in b. A bin assignment of ‘∗’ for s indicates that there is no bin assignment to s. The bin for a ‘∗’ is defined by Φ∗(s) = [0,∞). A bin assignment that includes ∗’s is called a partial bin assignment.
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 23 / 98
Time series data values for CIII
50 100 150 200 250 25 50 75 100 Number of times seen Expression level
Experimental data. 45
Experiment 1 Experiment 20
5 10 40 8 10 20 60 8 16 .. .. .. .. .. .. Time CI CII CIII Cro N 100 12 29 35 88
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 25 / 98
Time series data values for CIII
50 100 150 200 250 25 50 75 100 Number of times seen Expression level
Experimental data. 45
Experiment 1 Experiment 20
5 10 40 8 10 20 60 8 16 .. .. .. .. .. .. Time CI CII CIII Cro N 100 12 29 35 88
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 25 / 98
Time series data values for CIII
50 100 150 200 250 25 50 75 100 Number of times seen Expression level
Bin Assignmentsp ..
Experiment 1 Experiment 20
5 1 10 1 .. .. .. .. .. Time CI CII CIII Cro N 100 2 1 1
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 25 / 98
Time series data values for CIII
50 100 150 200 250 25 50 75 100 Number of times seen Expression level
Experimental data. 45
Experiment 1 Experiment 20
5 10 40 8 10 20 60 8 16 .. .. .. .. .. .. Time CI CII CIII Cro N 100 12 29 35 88
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 25 / 98
Time series data values for CIII
50 100 150 200 250 25 50 75 100 Number of times seen Expression level
Bin Assignmentsp 2
Experiment 1 Experiment 20
5 10 .. .. .. .. Time CII CIII Cro 100 .. CI 1 2 2 1 1 2 1 .. N
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 25 / 98
Often applied to microarray data taken over a variety of conditions or a series of time points. Assume that genes that are active at the same time are likely involved in the same regulatory process. Also assume that genes are grouped and within a group the genes produce the same expression profile. Due to noise and other uncertainties, groupings are not clear. Goal: determine the original groupings of the genes. Assume that there exists a method to determine the pairwise distance between the expression profiles of any two genes. Many algorithms have been proposed for clustering.
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 26 / 98
Partitions N genes into K clusters. Begins with K initial clusters either determined by user or by random. For each cluster, computes its centroid (i.e., the average expression profile of the genes in a cluster). Reassigns each gene to cluster with centroid that is closest to the expression pattern of the gene. Centroids recalculated and process repeats until no change.
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 27 / 98
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 28 / 98
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 29 / 98
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 30 / 98
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 31 / 98
(Data courtesy of Osterhout et al., BMC Microbiology 2007)
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 32 / 98
Begins with N clusters each containing a single gene. Combines two clusters with smallest distance apart where distance is between their average expression profiles. Continues for N − 1 steps at which point all the genes are merged into a hierarchical tree.
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 33 / 98
U,V,H,M,L,K,I,J}
(Data courtesy of Osterhout et al., BMC Microbiology 2007)
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 34 / 98
Using clustering results, one can potentially determine which genes produce proteins with similar functions. Clustering results do not shed light on how these genes and their protein products interact.
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 35 / 98
Given expression data, E, learning techniques allow one to infer the network connectivity that best matches E. Bayesian networks are a promising tool to learn connectivity. A Bayesian network represents a joint probability distribution. Represented with directed acyclic graph, G, whose vertices correspond to random variables, X1, . . . , Xn, for gene expression level. Connections represent dependencies between random variables.
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 36 / 98
P(X,Y) is joint distribution over two variables X and Y. X and Y are independent if P(X,Y) = P(X)P(Y) for all values of X and Y (equivalently, P(X|Y) = P(X)). When X and Y are dependent, value of Y gives information about X. Correlation is sufficient but not necessary condition for dependence. When X and Y are dependent, this is represented in the Bayesian network by an arc between them. If the arc is directed from X to Y, X is a parent of Y.
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 37 / 98
Associated with each Xi is a conditional distribution, θ, given its parents. Graph G encodes the Markov Assumption, each variable Xi is independent of its non-descendents given its parents in G. This is known as Conditional independence, and graph G implies a set of conditional independence assumptions, Ind(G). Using the Markov assumption, the joint PDF can be decomposed: P(X1,...,Xn)
n
i=1
P(Xi|Pa(Xi)) where Pa(Xi) denotes the parants of Xi.
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 38 / 98
A P(A) 0.5 1 0.5 A P(B=1|A) P(B=0|A) 0.1 0.9 1 0.2 0.8 B P(C=1|B) P(C=0|B) 0.75 0.25 1 0.35 0.65 Ind(G) = {A ⊥
P(A,B,C) = P(A)P(B|A)P(C|B) P(A = 1,B = 0,C = 0) = 0.5× 0.8× 0.25 = 0.1
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 39 / 98
E P(E) 0.6 1 0.4 A E B=1 B=0 0.1 0.9 1 0.8 0.2 1 0.7 0.3 1 1 0.4 0.6 B C=1 C=0 0.75 0.25 1 0.35 0.65 A P(A) 0.35 1 0.65 A D=1 D=0 0.1 0.9 1 0.8 0.2 A is common cause of B and D. If A not measured, hidden common cause.
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 40 / 98
E P(E) 0.6 1 0.4 A E B=1 B=0 0.1 0.9 1 0.8 0.2 1 0.7 0.3 1 1 0.4 0.6 B C=1 C=0 0.75 0.25 1 0.35 0.65 A P(A) 0.35 1 0.65 A D=1 D=0 0.1 0.9 1 0.8 0.2 Ind(G) = {A ⊥
C ⊥
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 40 / 98
E P(E) 0.6 1 0.4 A E B=1 B=0 0.1 0.9 1 0.8 0.2 1 0.7 0.3 1 1 0.4 0.6 B C=1 C=0 0.75 0.25 1 0.35 0.65 A P(A) 0.35 1 0.65 A D=1 D=0 0.1 0.9 1 0.8 0.2 P(A,B,C,D,E) = P(A)P(B|A,E)P(C|B)P(D|A)P(E)
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 40 / 98
More than one graph can imply same set of independences. Graphs X → Y and X ← Y both have Ind(G) = /
G and G′ are equivalent if Ind(G) = Ind(G′). Equivalent graphs have same underlying undirected graph but may disagree on direction of some edges. Equivalence class represented by a partially directed graph (PDAG) where edges can be: X → Y, X ← Y, or X—Y.
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 41 / 98
A B C
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 42 / 98
A B C D E
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 43 / 98
Given training set of experimental data, E, find a network G,θ that best matches E. Evaluate using Bayesian scoring metric: P(G|E)
P(E|G)P(G) P(E) Score(G : E)
logP(G|E) = logP(E|G)+ logP(G)+ C where C = −logP(E) is constant and P(E|G) = R P(E|G,θ)P(θ|G)dθ is the marginal likelihood. Choice of priors P(G) and P(θ|G) influence the score.
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 44 / 98
Given priors and data, learning amounts to finding structure G that maximizes the score. NP-hard so use heuristics like greedy random search. For example, beginning with some initial network, a greedy random search would select an edge to add, delete, or reverse. It would then compute this networks score, and if it is better than the previous network, then it would keep the change. This process is repeated until no improvement is found.
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 45 / 98
Number of graphs is super-exponential in number of variables. Sparse candidate algorithm identifies small number of candidate parents for each gene based on local statistics. Pitfall is early choices can overly restrict the search space. Adapting the candidate sets during the search can help.
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 46 / 98
By learning Bayesian network, can answer questions like which genes depend on which other genes. Expression level of each gene modeled as a random variable. Need to define local probability model for each variable. Discretize gene expression into 3 categories: significantly lower, similar to, or significantly greater than control. Discretizing can lose information, but more levels can be used if more resolution in experimental data. Control expression level either determined experimentally or the average expression level can be used. Meaning of “significantly” defined by setting threshold to ratio between measured expression and control.
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 47 / 98
cIII N cII cro cI Probability 0.05 1 0.18 1 0.06 1 1 0.10 1 0.0 1 1 0.04 1 1 0.02 1 1 1 0.02 1 0.01 1 1 0.05 1 1 0.05 1 1 1 0.02 1 1 0.03 1 1 1 0.0 1 1 1 0.03 1 1 1 1 0.0 cIII N cII cro cI Probability 1 0.0 1 1 0.02 1 1 0.01 1 1 1 0.0 1 1 0.01 1 1 1 0.01 1 1 1 0.01 1 1 1 1 0.0 1 1 0.01 1 1 1 0.01 1 1 1 0.01 1 1 1 0.0 1 1 1 0.20 1 1 1 1 0.02 1 1 1 1 0.02 1 1 1 1 1 0.0
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 48 / 98
N P(N) 0.54 1 0.46 N cIII cII=1 cII=0 0.18 0.82 1 0.48 0.52 1 0.34 0.66 1 1 0.87 0.13
N cII cIII cI cro
N cIII=1 cIII=0 0.12 0.88 1 0.58 0.42 cII cI=1 cI=0 0.66 0.34 1 0.24 0.76 cI cro=1 cro=0 0.43 0.57 1 0.33 0.67 P(cIII,N,cII,cro,N) = P(N)P(cIII|N)P(cII|N,cIII)P(cI|cII)P(cro|cI)
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 49 / 98
cIII N cII cro cI Orig BN 0.05 0.08 1 0.18 0.17 1 0.06 0.08 1 1 0.10 0.17 1 0.00 0.04 1 1 0.04 0.01 1 1 0.02 0.04 1 1 1 0.02 0.01 1 0.01 0.03 1 1 0.05 0.06 1 1 0.05 0.03 1 1 1 0.02 0.06 1 1 0.03 0.03 1 1 1 0.00 0.01 1 1 1 0.03 0.03 1 1 1 1 0.00 0.01 cIII N cII cro cI Orig BN 1 0.00 0.01 1 1 0.02 0.01 1 1 0.01 0.01 1 1 1 0.00 0.01 1 1 0.01 0.01 1 1 1 0.01 0.01 1 1 1 0.01 0.01 1 1 1 1 0.00 0.01 1 1 0.01 0.01 1 1 1 0.01 0.01 1 1 1 0.01 0.01 1 1 1 0.00 0.01 1 1 1 0.20 0.10 1 1 1 1 0.02 0.04 1 1 1 1 0.02 0.10 1 1 1 1 1 0.00 0.04
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 50 / 98
N P(N) 0.54 1 0.46 N cIII cII=1 cII=0 0.18 0.82 1 0.48 0.52 1 0.34 0.66 1 1 0.87 0.13
N cII cIII cI cro
N cIII=1 cIII=0 0.12 0.88 1 0.58 0.42 cII cI=1 cI=0 0.66 0.34 1 0.24 0.76 cI cro=1 cro=0 0.43 0.57 1 0.33 0.67 P(cIII,N,cII,cro,N) = P(N)P(cIII|N)P(cII|N,cIII)P(cI|cII)P(cro|cI)
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 51 / 98
cIII P(cIII) 0.58 1 0.42 N cIII cII=1 cII=0 0.18 0.82 1 0.48 0.52 1 0.34 0.66 1 1 0.87 0.13
cI cro cII cIII N
cIII N=1 N=0 0.29 0.71 1 0.80 0.20 cII cI=1 cI=0 0.66 0.34 1 0.24 0.76 cI cro=1 cro=0 0.43 0.57 1 0.33 0.67 P(cIII,N,cII,cro,N) = P(cIII)P(N|cIII)P(cII|N,cIII)P(cI|cII)P(cro|cI)
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 52 / 98
Difficulty is data is for thousands of genes but often only a few dozen samples, but on positive side, networks typically sparse. A set of plausible networks needs to be considered. May characterize features common in a set of networks. Markov relations: Is Y in the Markov blanket of X? Order relations: Is X an ancestor of Y? (or cause?) Confidence is likelihood that a feature is actually true. confidence(f)
1 m
m
i=1
f(Gi) where m is the number of potential networks considered, Gi is a potential network, and f(Gi) is 1 if f is a feature of Gi and 0 otherwise. Can use bootstrap method to generating potential networks which considers multiple subsets of the experimental data.
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 53 / 98
Clustering approaches can only find correlations. Bayesian analysis can potentially discover causal relationships and interactions between genes. Probablistic semantics good for noisy biological systems. Can focus on extracting features rather than find single model. Can assist with experimental design. Bayesian networks though limited to acyclic graphs. Most (if not all) genetic circuits include feedback control.
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 54 / 98
A Dynamic Bayesian Networks (DBN) unrolls the cyclic graph T times. Nodes in DBN are random variables X (t)
1 ,...,X (t) n
where t equals 1 to T. The joint PDF can be decomposed as follows: P(X (1)
1 ,...,X (T) n
T
t=1 n
i=1
P(X (t)
i
i
DBNs require time series experimental data.
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 55 / 98
P(N(1),cI(1),cII(1),cIII(1),cro(1),N(2),cI(2),cII(2),cIII(2),cro(2)) = P(N(1))P(cI(1))P(cII(1))P(cIII(1))P(cro(1))P(N(2)|cro(1),cI(1)) P(cI(2)|cro(1),cI(1),cII(1))P(cII(2)|N(1),cI(1),cIII(1)) P(cIII(2)|N(1),cI(1),cro(1))P(cro(2)|cI(1),cro(1))
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 56 / 98
A Bayesian network represents correlative relationships, but ultimately we are interested in knowing causal relationships. In a causal network, parents are interpreted as immediate causes. Causal Markov assumption states that given values of variable’s immediate causes, it is independent of earlier causes. Causal networks model not only distribution of observations but also effects of interventions. In causal networks, X → Y and Y → X are not equivalent.
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 57 / 98
DBN approaches typically must perform an expensive global search, and they have difficulty learning networks with tight feedback. The method described here uses local analysis to efficiently learn networks with tight feedback. This method determines the likelihood that a gene’s expression increases in the next time point given the current gene expression levels. These likelihoods are then used to determine influences between the genes in the genetic circuit. Result is a directed graph representation of the genetic circuit.
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 58 / 98
Number of models = 3|S|2 where |S| is the number of species.
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 59 / 98
CI CII CIII Cro N i : r n n u a
CIII u = unknown n = no connection a = activation r = repression Act(i)
Rep(i)
Par(i)
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 60 / 98
inc(s)
val(s)
bin(b)
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 61 / 98
1 2 0 1 2 25 50 75 N’s Prob.
CII CIII N’s Prob.
(courtesy of Barker (2007))
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 62 / 98
40
49
70
58
42
38
66
38
26
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 63 / 98
To determine trends, a ratio is formed between two probabilities using two partial bin assignments, b and b′.
The partial bin assignment, b′, is called the base. b′(s) =
if i(s) = ‘n’ if (i(s) = ‘a’∧|Rep(i)| ≤ |Act(i)|)∨
n − 1
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 64 / 98
Ratio
40
49 1.23
70 1.75
58 1.45
42 1.05
38 0.95
66 1.65
38 0.95
26 0.65
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 65 / 98
More activating influences (i.e., |Rep(i)| ≤ |Act(i)|).
More repressing influences (i.e., |Rep(i)| > |Act(i)|).
The final score is determined using the following equation: score = vf − va vf + va + vn A score greater than zero indicates support for the vector while a negative score indicates there is not support for the vector.
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 66 / 98
Ratio Vote
40
49 1.23 va
70 1.75 va
58 1.45 va
42 1.05 vn
38 0.95 vn
66 1.65 va
38 0.95 vn
26 0.65 vf
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 67 / 98
When scoring an influence vector, i, for species s, the probability of increase can be influenced by the level of s. Similarly, when comparing two influence vectors, i and i′, it is useful to control for the species in i′, when evaluating the score for i and vice versa. In both cases, can partition bins using a control set, G. Now consider all assignments to species in Par(i)∪ G. Base bin assignment agrees with values in b for each member of G. b′(s) =
b(s) if s ∈ G
if i(s) = ‘n’ if (i(s) = ‘a’∧|Rep(i)| ≤ |Act(i)|)
n − 1
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 68 / 98
1 2 0 1 2 25 50 75 100 N’s Prob.
CII CIII N’s Prob.
(courtesy of Barker (2007))
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 69 / 98
Ratio Vote
40 base
58 1.45 va
83 2.08 va
67 1.66 va
55 1.37 va
59 1.47 va
100 2.5 va
44 1.09 vn
36 0.90 vn
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 70 / 98
Ratio Vote
55 base
40 0.72 vf
50 0.90 vn
54 0.98 vn
37 0.67 vf
41 0.75 vn
0.00 vf
42 0.76 vn
27 0.49 vf
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 71 / 98
Ratio Vote
27 base
22 0.81 vn
50 1.85 va
30 1.11 vn
28 1.04 vn
28 1.04 vn
100 3.70 va
30 1.11 vn
24 0.88 vn
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 72 / 98
Select initial influence vector set. Combine influence vectors. Compete influence vectors.
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 73 / 98
CI CII CIII Cro N u u n u u
CIII
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 74 / 98
CIII’s Influence Vector Scores Influence Vector vf va vn CI
CII
Cro
N
CI
CII
Cro
N
CIII at level 0 5 10 15 20 1 2 CIII’s rising probability (%) CI CI L0 CI L1 CI L2 L1 / L0 L2 / L0 CIII L0 19.0% 1.7% 1.0% 0.09 0.05 CIII L1 17.1% 2.6% 1.2% 0.15 0.07 CIII L2 11.6% 2.7% 1.1% 0.23 0.09
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 75 / 98
CIII’s Influence Vector Scores Influence Vector vf va vn CI
CII
Cro
N
CI
CII
Cro
N
CIII at level 0 5 10 15 20 1 2 CIII’s rising probability (%) CI CI L0 CI L1 CI L2 L1 / L0 L2 / L0 CIII L0 19.0% 1.7% 1.0% 0.09 0.05 CIII L1 17.1% 2.6% 1.2% 0.15 0.07 CIII L2 11.6% 2.7% 1.1% 0.23 0.09
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 75 / 98
CIII’s Influence Vector Scores Influence Vector vf va vn CI
CII
Cro
N
CI
CII
Cro
N
CIII at level 0 5 10 15 20 1 2 CIII’s rising probability (%) CI CI L0 CI L1 CI L2 L1 / L0 L2 / L0 CIII L0 19.0% 1.7% 1.0% 0.09 0.05 CIII L1 17.1% 2.6% 1.2% 0.15 0.07 CIII L2 11.6% 2.7% 1.1% 0.23 0.09
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 75 / 98
CIII’s Influence Vector Scores Influence Vector vf va vn CI
1 CII
Cro
N
CI
CII
Cro
N
CIII at level 0 5 10 15 20 1 2 CIII’s rising probability (%) CI CI L0 CI L1 CI L2 L1 / L0 L2 / L0 CIII L0 19.0% 1.7% 1.0% 0.09 0.05 CIII L1 17.1% 2.6% 1.2% 0.15 0.07 CIII L2 11.6% 2.7% 1.1% 0.23 0.09
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 75 / 98
CIII’s Influence Vector Scores Influence Vector vf va vn CI
2 CII
Cro
N
CI
CII
Cro
N
CIII at level 0 5 10 15 20 1 2 CIII’s rising probability (%) CI CI L0 CI L1 CI L2 L1 / L0 L2 / L0 CIII L0 19.0% 1.7% 1.0% 0.09 0.05 CIII L1 17.1% 2.6% 1.2% 0.15 0.07 CIII L2 11.6% 2.7% 1.1% 0.23 0.09
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 75 / 98
CIII’s Influence Vector Scores Influence Vector vf va vn CI
2 CII
Cro
N
CI
CII
Cro
N
CIII at level 1 5 10 15 20 1 2 CIII’s rising probability (%) CI CI L0 CI L1 CI L2 L1 / L0 L2 / L0 CIII L0 19.0% 1.7% 1.0% 0.09 0.05 CIII L1 17.1% 2.6% 1.2% 0.15 0.07 CIII L2 11.6% 2.7% 1.1% 0.23 0.09
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 75 / 98
CIII’s Influence Vector Scores Influence Vector vf va vn CI
2 CII
Cro
N
CI
CII
Cro
N
CIII at level 1 5 10 15 20 1 2 CIII’s rising probability (%) CI CI L0 CI L1 CI L2 L1 / L0 L2 / L0 CIII L0 19.0% 1.7% 1.0% 0.09 0.05 CIII L1 17.1% 2.6% 1.2% 0.15 0.07 CIII L2 11.6% 2.7% 1.1% 0.23 0.09
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 75 / 98
CIII’s Influence Vector Scores Influence Vector vf va vn CI
2 CII
Cro
N
CI
CII
Cro
N
CIII at level 1 5 10 15 20 1 2 CIII’s rising probability (%) CI CI L0 CI L1 CI L2 L1 / L0 L2 / L0 CIII L0 19.0% 1.7% 1.0% 0.09 0.05 CIII L1 17.1% 2.6% 1.2% 0.15 0.07 CIII L2 11.6% 2.7% 1.1% 0.23 0.09
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 75 / 98
CIII’s Influence Vector Scores Influence Vector vf va vn CI
3 CII
Cro
N
CI
CII
Cro
N
CIII at level 1 5 10 15 20 1 2 CIII’s rising probability (%) CI CI L0 CI L1 CI L2 L1 / L0 L2 / L0 CIII L0 19.0% 1.7% 1.0% 0.09 0.05 CIII L1 17.1% 2.6% 1.2% 0.15 0.07 CIII L2 11.6% 2.7% 1.1% 0.23 0.09
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 75 / 98
CIII’s Influence Vector Scores Influence Vector vf va vn CI
4 CII
Cro
N
CI
CII
Cro
N
CIII at level 1 5 10 15 20 1 2 CIII’s rising probability (%) CI CI L0 CI L1 CI L2 L1 / L0 L2 / L0 CIII L0 19.0% 1.7% 1.0% 0.09 0.05 CIII L1 17.1% 2.6% 1.2% 0.15 0.07 CIII L2 11.6% 2.7% 1.1% 0.23 0.09
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 75 / 98
CIII’s Influence Vector Scores Influence Vector vf va vn CI
4 CII
Cro
N
CI
CII
Cro
N
CIII at level 2 5 10 15 20 1 2 CIII’s rising probability (%) CI CI L0 CI L1 CI L2 L1 / L0 L2 / L0 CIII L0 19.0% 1.7% 1.0% 0.09 0.05 CIII L1 17.1% 2.6% 1.2% 0.15 0.07 CIII L2 11.6% 2.7% 1.1% 0.23 0.09
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 75 / 98
CIII’s Influence Vector Scores Influence Vector vf va vn CI
4 CII
Cro
N
CI
CII
Cro
N
CIII at level 2 5 10 15 20 1 2 CIII’s rising probability (%) CI CI L0 CI L1 CI L2 L1 / L0 L2 / L0 CIII L0 19.0% 1.7% 1.0% 0.09 0.05 CIII L1 17.1% 2.6% 1.2% 0.15 0.07 CIII L2 11.6% 2.7% 1.1% 0.23 0.09
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 75 / 98
CIII’s Influence Vector Scores Influence Vector vf va vn CI
4 CII
Cro
N
CI
CII
Cro
N
CIII at level 2 5 10 15 20 1 2 CIII’s rising probability (%) CI CI L0 CI L1 CI L2 L1 / L0 L2 / L0 CIII L0 19.0% 1.7% 1.0% 0.09 0.05 CIII L1 17.1% 2.6% 1.2% 0.15 0.07 CIII L2 11.6% 2.7% 1.1% 0.23 0.09
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 75 / 98
CIII’s Influence Vector Scores Influence Vector vf va vn CI
5 CII
Cro
N
CI
CII
Cro
N
CIII at level 2
5 10 15 20 1 2 CIII’s rising probability (%) CI
CI L0 CI L1 CI L2 L1 / L0 L2 / L0 CIII L0 19.0% 1.7% 1.0% 0.09 0.05 CIII L1 17.1% 2.6% 1.2% 0.15 0.07 CIII L2 11.6% 2.7% 1.1% 0.23 0.09
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 75 / 98
CIII’s Influence Vector Scores Influence Vector vf va vn CI
6 CII
Cro
N
CI
CII
Cro
N
CIII at level 2 5 10 15 20 1 2 CIII’s rising probability (%) CI CI L0 CI L1 CI L2 L1 / L0 L2 / L0 CIII L0 19.0% 1.7% 1.0% 0.09 0.05 CIII L1 17.1% 2.6% 1.2% 0.15 0.07 CIII L2 11.6% 2.7% 1.1% 0.23 0.09
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 75 / 98
CIII’s Influence Vector Scores Influence Vector vf va vn CI
6 CII
2 3 Cro
N
CI
CII
Cro
N
CIII at level 0 5 10 15 20 1 2 CIII’s rising probability (%) CII CII L0 CII L1 CII L2 L1 / L0 L2 / L0 CIII L0 3.1% 13.7%
4.4% 7.4% 12.6% 1.65 2.83 CIII L2 19.4% 5.5% 5.8% 0.28 0.35
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 76 / 98
CIII’s Influence Vector Scores Influence Vector vf va vn CI
6 CII
2 3 Cro
6 N
CI
CII
Cro
N
CIII at level 2 5 10 15 20 1 2 CIII’s rising probability (%) Cro Cro L0 Cro L1 Cro L2 L1 / L0 L2 / L0 CIII L0 11.55% 1.84% 1.47% 0.16 0.13 CIII L1 14.20% 4.74% 3.10% 0.33 0.22 CIII L2 9.70% 5.02% 4.17% 0.52 0.43
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 77 / 98
CIII’s Influence Vector Scores Influence Vector vf va vn CI
6 CII
2 3 Cro
6 N
5 1 CI
CII
Cro
N
CIII at level 1 5 10 15 20 1 2 CIII’s rising probability (%) N N L0 N L1 N L2 L1 / L0 L2 / L0 CIII L0 5.4% 2.9% 3.7% 0.53 0.68 CIII L1 9.3% 7.2% 6.7% 0.78 0.71 CIII L2 8.6% 6.4% 6.1% 0.75 0.71
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 78 / 98
CIII’s Influence Vector Scores Influence Vector vf va vn CI
6 CII
2 3 Cro
6 N
5 1 CI
6 CII
3 2 Cro
6 N
5 1 CIII at level 1 5 10 15 20 1 2 CIII’s rising probability (%) N N L0 N L1 N L2 L1 / L0 L2 / L0 CIII L0 5.4% 2.9% 3.7% 0.53 0.68 CIII L1 9.3% 7.2% 6.7% 0.78 0.71 CIII L2 8.6% 6.4% 6.1% 0.75 0.71
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 79 / 98
Keep Discard 1.0 0.75
CIII’s Influence Vector Scores Influence Vector vf va vn Score CI
6 1.0 CII
2 3
Cro
6 1.0 N
5 1 0.833 CI
6
CII
3 2 0.2 Cro
6
N
5 1
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 80 / 98
Keep Discard 1.0 0.75
CIII’s Influence Vector Scores Influence Vector vf va vn Score CI
6 1.0 CII
2 3
Cro
6 1.0 N
5 1 0.833 CI
6
CII
3 2 0.2 Cro
6
N
5 1
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 80 / 98
Keep Discard 1.0 0.75
CIII’s Influence Vector Scores Influence Vector vf va vn Score CI
6 1.0 Cro
6 1.0 N
5 1 0.833
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 80 / 98
Proteins produced from separate genes can work together to influence genes.
1
Consider Influence Vectors in the set two at a time
2
Merge if their combined score outweighs their individual scores.
3
Remove subsets. Influence Vector vf va vn Score CI
6 1.0 Cro
6 1.0 N
5 1 0.833 CI, Cro
1
1 3 CIII at 0 CIII at 2
1 2 0 1 2 10 20 30 CIII’s Prob.
CI Cro CIII’s Prob.
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 81 / 98
Proteins produced from separate genes can work together to influence genes.
1
Consider Influence Vectors in the set two at a time
2
Merge if their combined score outweighs their individual scores.
3
Remove subsets. Influence Vector vf va vn Score CI
6 1.0 Cro
6 1.0 N
5 1 0.833 CI, Cro
2
1 3 CIII at 0 CIII at 2
1 2 0 1 2 10 20 30 CIII’s RPD CI Cro CIII’s RPD Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 81 / 98
Proteins produced from separate genes can work together to influence genes.
1
Consider Influence Vectors in the set two at a time
2
Merge if their combined score outweighs their individual scores.
3
Remove subsets. Influence Vector vf va vn Score CI
6 1.0 Cro
6 1.0 N
5 1 0.833 CI, Cro
3
1 3 CIII at 0 CIII at 2
1 2 0 1 2 10 20 30 CIII’s RPD CI Cro CIII’s RPD Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 81 / 98
Proteins produced from separate genes can work together to influence genes.
1
Consider Influence Vectors in the set two at a time
2
Merge if their combined score outweighs their individual scores.
3
Remove subsets. Influence Vector vf va vn Score CI
6 1.0 Cro
6 1.0 N
5 1 0.833 CI, Cro
4
1 3 CIII at 0 CIII at 2
1 2 0 1 2 10 20 30 CIII’s RPD CI Cro CIII’s RPD Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 81 / 98
Proteins produced from separate genes can work together to influence genes.
1
Consider Influence Vectors in the set two at a time
2
Merge if their combined score outweighs their individual scores.
3
Remove subsets. Influence Vector vf va vn Score CI
6 1.0 Cro
6 1.0 N
5 1 0.833 CI, Cro
5
1 3 CIII at 0 CIII at 2
1 2 0 1 2 10 20 30 CIII’s RPD CI Cro CIII’s RPD Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 81 / 98
Proteins produced from separate genes can work together to influence genes.
1
Consider Influence Vectors in the set two at a time
2
Merge if their combined score outweighs their individual scores.
3
Remove subsets. Influence Vector vf va vn Score CI
6 1.0 Cro
6 1.0 N
5 1 0.833 CI, Cro
6
1 3 CIII at 0 CIII at 2
1 2 0 1 2 10 20 30 CIII’s RPD CI Cro CIII’s RPD Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 81 / 98
Proteins produced from separate genes can work together to influence genes.
1
Consider Influence Vectors in the set two at a time
2
Merge if their combined score outweighs their individual scores.
3
Remove subsets. Influence Vector vf va vn Score CI
6 1.0 Cro
6 1.0 N
5 1 0.833 CI, Cro
7
1 3 CIII at 0 CIII at 2
1 2 0 1 2 10 20 30 CIII’s RPD CI Cro CIII’s RPD Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 81 / 98
Proteins produced from separate genes can work together to influence genes.
1
Consider Influence Vectors in the set two at a time
2
Merge if their combined score outweighs their individual scores.
3
Remove subsets. Influence Vector vf va vn Score CI
6 1.0 Cro
6 1.0 N
5 1 0.833 CI, Cro
8
1 3 CIII at 0 CIII at 2
1 2 0 1 2 10 20 30 CIII’s RPD CI Cro CIII’s RPD Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 81 / 98
Proteins produced from separate genes can work together to influence genes.
1
Consider Influence Vectors in the set two at a time
2
Merge if their combined score outweighs their individual scores.
3
Remove subsets. Influence Vector vf va vn Score CI
6 1.0 Cro
6 1.0 N
5 1 0.833 CI, Cro
24 1.0
1 3 CIII at 0 CIII at 2
1 2 0 1 2 10 20 30 CIII’s RPD CI Cro CIII’s RPD Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 81 / 98
Proteins produced from separate genes can work together to influence genes.
1
Consider Influence Vectors in the set two at a time
2
Merge if their combined score outweighs their individual scores.
3
Remove subsets. Influence Vector vf va vn Score CI
6 1.0 Cro
6 1.0 N
5 1 0.833 CI, Cro
24 1.0 CI, N
21 3 0.875
1 3 CIII at 0 CIII at 2
1 2 0 1 2 10 20 30 CIII’s RPD N CI CIII’s RPD Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 82 / 98
Proteins produced from separate genes can work together to influence genes.
1
Consider Influence Vectors in the set two at a time
2
Merge if their combined score outweighs their individual scores.
3
Remove subsets. Influence Vector vf va vn Score CI
6 1.0 Cro
6 1.0 N
5 1 0.833 CI, Cro
24 1.0 CI, N
21 3 0.875
1 3 CIII at 0 CIII at 2
1 2 0 1 2 10 20 30 CIII’s RPD N CI CIII’s RPD Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 82 / 98
Proteins produced from separate genes can work together to influence genes.
1
Consider Influence Vectors in the set two at a time
2
Merge if their combined score outweighs their individual scores.
3
Remove subsets. Influence Vector vf va vn Score CI
6 1.0 Cro
6 1.0 N
5 1 0.833 CI, Cro
24 1.0 Cro, N
23 1 0.958
1 3 CIII at 0 CIII at 2
1 2 0 1 2 10 20 30 CIII’s RPD N Cro CIII’s RPD Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 83 / 98
Proteins produced from separate genes can work together to influence genes.
1
Consider Influence Vectors in the set two at a time
2
Merge if their combined score outweighs their individual scores.
3
Remove subsets. Influence Vector vf va vn Score CI
6 1.0 Cro
6 1.0 N
5 1 0.833 CI, Cro
24 1.0 Cro, N
23 1 0.958
1 3 CIII at 0 CIII at 2
1 2 0 1 2 10 20 30 CIII’s RPD N Cro CIII’s RPD Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 83 / 98
Proteins produced from separate genes can work together to influence genes.
1
Consider Influence Vectors in the set two at a time
2
Merge if their combined score outweighs their individual scores.
3
Remove subsets. Influence Vector vf va vn Score CI
6 1.0 Cro
6 1.0 N
5 1 0.833 CI, Cro
24 1.0
1 3 CIII at 0 CIII at 2
1 2 0 1 2 10 20 30 CIII’s RPD N Cro CIII’s RPD Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 84 / 98
Proteins produced from separate genes can work together to influence genes.
1
Consider Influence Vectors in the set two at a time
2
Merge if their combined score outweighs their individual scores.
3
Remove subsets. Influence Vector vf va vn Score N
5 1 0.833 CI, Cro
24 1.0
1 3 CIII at 0 CIII at 2
1 2 0 1 2 10 20 30 CIII’s RPD N Cro CIII’s RPD Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 84 / 98
Set levels of species that activate or repress to reduce correlation effects
Select a pair of influence vectors and obtain votes for both vectors in the combined state space. Discard the influence vector with the most neutral score. Influence Vector vf va vn Score N
12 27 15 0.27 CI, Cro
1 2 10 20 30 5 15 25 N 1 27 CIII’s Prob. of Incr.
0,∗,0,0,j 2,∗,2,2,j
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 86 / 98
Set levels of species that activate or repress to reduce correlation effects
Select a pair of influence vectors and obtain votes for both vectors in the combined state space. Discard the influence vector with the most neutral score. Influence Vector vf va vn Score N
12 27 15 0.27 CI, Cro
71 1 0.98
1 9 N, CIII at <0,0> N, CIII at <2,2>
1 2 0 1 2 10 20 30 CIII’s RPD CI Cro CIII’s RPD Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 88 / 98
Set levels of species that activate or repress to reduce correlation effects
Select a pair of influence vectors and obtain votes for both vectors in the combined state space. Discard the influence vector with the most neutral score. Influence Vector vf va vn Score N
12 27 15 0.27 CI, Cro
71 1 0.98
N CII CI CIII Cro
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 90 / 98
Set levels of species that activate or repress to reduce correlation effects
Select a pair of influence vectors and obtain votes for both vectors in the combined state space. Discard the influence vector with the most neutral score. Influence Vector vf va vn Score N
12 27 15 0.27 CI, Cro
71 1 0.98
N CII CI CIII Cro
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 90 / 98
Set levels of species that activate or repress to reduce correlation effects
Select a pair of influence vectors and obtain votes for both vectors in the combined state space. Discard the influence vector with the most neutral score. Influence Vector vf va vn Score CI, Cro
71 1 0.98
N CII CI CIII Cro
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 90 / 98
CI CII CIII Cro N n a n n n
CI r n n n n
CII r n n r n
CIII r n n n n
Cro r n n r n
N
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 91 / 98
Learning method considers and discards many potential models. Alternative models may actually be correct and mistakenly discarded. It may be useful to design an experiment that would provide data to either support the selected model or potentially an alternative model.
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 92 / 98
P1
P1
P1
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 93 / 98
Genetic Circuit
Host
Experiments
Experiments
Plasmid
Data
Knowledge
Sequence
Simulation
Equations
Experiments
Data
Analysis Synthesis
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 94 / 98
Genetic Circuit
Host
Simulations
Simulations
Plasmid
Data
Knowledge
Sequence
Simulation
Equations
Simulations
Data
Analysis Synthesis
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 95 / 98
L Note: the 7 Genes N-T are unconnected A C B G J F D M E I H K
J A B C G D H E F I
48 Networks inspired from Guet et al. 10 Random 10 Gene Networks 10 Yu et al. 20 Gene Networks
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 96 / 98
0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 GeneNet Yu’s DBN tool
BioSim wins in 57 and ties in 7 of the 68 cases.
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 97 / 98
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 97 / 98
0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 GeneNet Yu’s DBN tool
BioSim wins in 17 and ties in 13 of the 47 cases. Note that Yu’s method found no arcs in 21 cases.
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 97 / 98
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 97 / 98
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 97 / 98
Experimental methods:
Berg et al. (2002), Watson et al. (2003), and Alberts et al. (2002).
http://en.wikipedia.org. Cluster analysis:
Idea of cluster analysis - Tryon (1939) K-means clustering - MacQueen (1967) Hierarchical clustering - Johnson (1967) Cluster analysis of gene expression data - Tavozoie et al. (1999), Eisen et
Learning Bayesian networks:
Tutorial - Heckerman (1996) Application to biological systems - Friedman et al. (2000), Hartemink et al. (2001), Pe’er (2005), Sachs et al. (2005), etc. DBNs - Ong et al. (2002), Husmeier et al. (2003), Nachman et al. (2004), Yu et al. (2004), Bernard and Hartemink (2005), Beal et al. (2005), etc.
Learning Causal networks:
Barker et al., (2006, 2007, and 2010).
Chris J. Myers (Lecture 2: Learning Models) Engineering Genetic Circuits 98 / 98