Team:HZAU-China/testmath
From 2014.igem.org
Line 220: | Line 220: | ||
\end{equation} | \end{equation} | ||
- | Hence, $r_c( | + | Hence, $r_c($$2Cre\cdot loxP)$$dot r_c($$2Cre\cdot lox72)$$<r_c(2$$Cre\cdot lox66)$$\cdot r_c($$2Cre\cdot lox71)$, which means the synapsis between lox66 and lox71 is easy to form. |
This explanation is based on the kinetic model proposed in 1998. However, the real system may have additional processes. We introduce another explanation using a principle called kinetic proofreading that is widely employed to achieve high precision in diverse molecular recognition systems. Inspired by the DNA binding process described in available researches(Reardon and Sancar, 2004; Tlusty et al., 2004; Alon, 2007), we infer that the recombinase protein Cre undergoes a modification after binding to the half of lox site, and then it can recruit other proteins Cre to bind to another half of lox site. Such a modification can help the mutant lox site like lox66 and lox71 easily jump to the next state but prevent the incorrect binding of the double mutant lox site like lox72. This modification makes the second binding event more likely to happen and the corresponding dissociation event less likely to happen. So why $k_9>k_8$ and $k_{-9}<k_{-8}$ can also be explained by this modification. | This explanation is based on the kinetic model proposed in 1998. However, the real system may have additional processes. We introduce another explanation using a principle called kinetic proofreading that is widely employed to achieve high precision in diverse molecular recognition systems. Inspired by the DNA binding process described in available researches(Reardon and Sancar, 2004; Tlusty et al., 2004; Alon, 2007), we infer that the recombinase protein Cre undergoes a modification after binding to the half of lox site, and then it can recruit other proteins Cre to bind to another half of lox site. Such a modification can help the mutant lox site like lox66 and lox71 easily jump to the next state but prevent the incorrect binding of the double mutant lox site like lox72. This modification makes the second binding event more likely to happen and the corresponding dissociation event less likely to happen. So why $k_9>k_8$ and $k_{-9}<k_{-8}$ can also be explained by this modification. |
Revision as of 11:45, 11 October 2014
Our project aims to engineer cells that can process information to adapt to the environment as we expected, so we need to yield quantitative predictions of gene behaviors. Mathematic model becomes a powerful tool here. It not only helps us analyze the phenomena, but also contributes to the design of genetic circuits. Our models can be divided into several parts. Firstly, we will introduce the biological processes in our design and translate them into mathematic language. Then we will make some comparisons to explain why we designed circuits this way. Next, we will present the simulation of deterministic model and stochastic model. Parameter sensitivity analysis will also be made here. Finally, we will talk about the design principle of the rewirable circuit using ODE sets with matrices. We hope that any other researches can get insight from the mathematic model and make the rewirable circuit widely used. \section{Biological processes}
In this part, we will list the important biological processes and explain how the related molecules work, and then we can describe them using some equations. Because the timescales of many processes are separated, we use a quasi-steady-state approximation (QSSA) to reduce the number of dimensions in the systems in most cases. However, if we focus on some processes more than the final steady state, we may use other approximations such like prefactor method (Bennett et al., 2007). \subsection{Transcription and translation}
According to the Central Dogma, some DNAs can be transcribed into RNAs and some RNAs can be translated into proteins. These procedures sometimes can be regulated by some molecules like transcription factors or non-coding RNAs. The interactions among them can be understood by chemical reactions. We use reaction (1) to depict the interaction between transcription factor $TF$ (sometimes it forms a polymer, we omit this step) and inducible promoter $Pro$. \begin{equation} K_D=\frac{k_{-1}}{k_1}=\frac{[TF]^n\cdot [Pro]}{[Pro']}. \end{equation}
The equilibrium dissociation constant $K_D$ for the reaction can be calculated. \begin{equation} K_D=\frac{k_{-1}}{k_1}=\frac{[TF]^n\cdot [Pro]}{[Pro']}. \end{equation}
We assume that the DNA copy number is a constant n, \begin{equation} [Pro]+[Pro']=n. \end{equation} Then the $Pro'$ proportion depends on the concentration of $TF$, and this proportion also be regarded as the probability of the binding events. \begin{equation} P(\text{binding})=\frac{[Pro']}{n}=\frac{[TF]^n}{K_D+[TF]^n} \end{equation}
If the transcription factor is a activator, the gene is transcribed at the maximal transcription rate $\beta_1$ when the promoter is bound by the transcription factor. If the transcription factor is a repressor, the gene is transcribed at the maximal transcription rate $\beta_1$ when the promoter is free from the transcription factor. In deterministic model, we use Hill function to describe the rate of production, which is $\beta_1$ times its occurring probability. For transcription activation, the maximal transcription rate will occur if the activator bind to the promoter, so the Hill function is \begin{equation} f(x)=\beta_1\frac{[TF]^n}{K^n+[TF]^n}. \end{equation} For transcription repression, the maximal transcription rate will occur if the repressor doesn't bind to the promoter, so the Hill function is \begin{equation} f(x)=\beta_1\frac{K^n}{K^n+[TF]^n}. \end{equation}
$K$ is activation coefficient or repression coefficient, which is the same as $\sqrt[n]{K_D}$.
In addition, many genes have a nonzero minimal expression level, namely basal expression level. It can be described by adding a term $\beta_0$. And some genes that have a constitutive promoter cannot be described by Hill function. We just set their production rate to be a constant.
The translation and degradation processes in our model are based on following reaction:
\begin{equation}
\begin{split}
mRNA_{Y}&\autorightarrow{$K_{tl}$}{}mRNA_{Y}+Y\\
mRNA_{Y}&\autorightarrow{$K_{R}$}{}\emptyset\\
Y&\autorightarrow{$K_{P}$}{}\emptyset
\end{split}
\end{equation}
For translation process, the protein production rate is proportional to the corresponding mRNA concentration $K_{tl}\cdot mRNA$. $K_{tl}$ is determined by the efficiency and concentration of ribosome, the sequence of RBS and concentration of amino acids in the cell, which can be considered identical under the experiment condition. For degradation process, the degradation rate is proportional to the substrate. The proportion for mRNA is $K_{R}$ and the proportion for protein is $K_{P}$. So we list following equations,
\begin{equation}
\begin{aligned}
\frac{dmRNA_{Y}}{dt}&=\beta_0+\beta_1\frac{[TF]^n}{K^n+[TF]^n}-K_{R}\cdot mRNA_{Y} \text{ (activation)}\\
\frac{dmRNA_{Y}}{dt}&=\beta_0+\beta_1\frac{K^n}{K^n+[TF]^n}-K_{R}\cdot mRNA_{Y} \text{ (repression)}\\
\frac{dY}{dt}&=K_{tl}\cdot mRNA_{Y}-K_{P}\cdot Y.
\end{aligned}
\end{equation}
\subsection{RNA interaction}
Different riboregulators have different regulation mechanisms to control the gene expression. Here what we used is a classical engineered riboregulator designed by Isaacs and his colleagues in 2004. This riboregulator can control the gene expression at post-transcription level. More details can be found in (a link in project input module)%%%%
We describe the RNA interactions by this reaction:
\begin{equation}
taRNA+crRNA\autorightleftharpoons{$k_2$}{$k_{-2}$}RNA\ duplex,
\end{equation}
which means
\begin{equation}
\frac{d[RNA\ duplex]}{dt}=k_2\cdot [taRNA]\cdot [crRNA]-k_{-2}\cdot [RNA\ duplex]
\end{equation}
This reaction is much faster than gene expression, so the equilibrium state can be reached quickly,
\begin{equation}
\begin{split}
\frac{d[RNA\ duplex]}{dt}&=0 \\
[RNA\ duplex]_{st}&=\frac{k_2}{k_{-2}}\cdot [taRNA]\cdot [crRNA]
\end{split}
\end{equation}
The total mRNA of the recombinase has two forms, $crRNA$ and $RNA\ duplex$, resulting in another equation:
\begin{equation}
[crRNA]+[RNA\ duplex]=[mRNA_{Cre}].
\end{equation}
Then we have
\begin{equation}
[RNA\ duplex]_{st}=[mRNA_{Cre}]\cdot\frac{[taRNA]}{K_m+[taRNA]},
\end{equation}
where $K_m=\frac{k_{-2}}{k_2}$. So the protein production rate depends on not only the concentration of its corresponding mRNA but also the concentration of $taRNA$. This translation process with post-transcriptional control can be described by
\begin{equation}
\frac{d[Cre]}{dt}=K_{tl}\cdot [mRNA_{Cre}]\cdot\frac{[taRNA]}{K_m+[taRNA]}-K_{P}\cdot [Cre]
\end{equation}
\subsection{Processes related to AHL}
In this section, we describe the 3OC6HSL synthesis, degradation and regulation. 3OC6HSL, a kind of AHL, is enzymatically synthesised by LuxI proteins from some substrates. Here, we use AHL to refer to 3OC6HSL.
\begin{equation}
LuxI\autorightarrow{$k_3$}{some\ substrates}AHL + LuxI
\end{equation}
For simplicity, we assumed that the amount of substrates is sufficient so that the 3OC6HSL synthesis rate is proportional to the LuxI protein concentration. And the degradation of 3OC6HSL can be divided into two parts. With the help of the AHL-degrading enzyme, AHL-lactonase (AiiA), the AHL can be degraded at a high rate, since the complex of AiiA and AHL is more easy to be degraded. Also, the AHL itself can be degraded at a lower rate.
\begin{equation}
\begin{split}
AHL+AiiA&\autorightarrow{$k_4$}{}AHL\text{-}AiiA\ complex\\
AHL\text{-}AiiA\ complex&\autorightarrow{$k_5$}{}\emptyset\\
AHL&\autorightarrow{$k_6$}{}\emptyset\\
k_5&>k_6
\end{split}
\end{equation}
Besides, AHL can regulate gene expression by binding the protein LuxR.
\begin{equation}
2\ AHL+2\ LuxR\autorightarrow{$k_7$}{}AHL\text{-}LuxR\ complex
\end{equation}
The complex can activate luxpR promoter and repress luxpL promoter.
Although the binding events about AHL can be described by some differential equations, we apply QSSA when we focus on the gene expression process, because the timescale of these binding events is much less than the time scale of gene expression. So change of AHL and AHL-LuxR complex over time can be described by differential equations:
\begin{equation}
\begin{split}
&\frac{d[AHL]}{dt}=k_3\cdot [LuxI]-k_4k_5\cdot [AHL] \cdot [AiiA]-k_6\cdot [AHL]-k_7\cdot [LuxR]^2\cdot [AHL]^2\\
&\frac{d[AHL\text{-}LuxR\ complex]}{dt}=k_7\cdot [LuxR]^2\cdot [AHL]^2-K_{P}\cdot [AHL\text{-}LuxR\ complex]
\end{split}
\end{equation}
\subsection{DNA recombination}
Ringrose and his colleagues have developed mathematic models to describe the kinetics of Cre and Flp recombination. The site-specific recombination process can be mainly divided into four steps as the following figure depicts: DNA binding, synapsis, recombination and dissociation.
\begin{figure}[!htb]
\small
\centering
\includegraphics[width=\textwidth]{ReombMech.jpg}
\caption{recombination mechanism} \label{fig:aa}
\end{figure}
The detailed mathematical representation can be found in previous research (Ringrose et al., 1998), so we didn't list them here. We want to explain Cre-mediated inversion using the mutated Cre/loxP system we used prefers the forward reaction. The difference between the mutated loxP site and wild type loxP site only influences the DNA binding and dissociation processes. So we focus on these two steps. A Cre monomer binds to one half of a loxP site and then an asymmetrical homodimer is formed when a second Cre molecule binds to the other half of loxP.
\begin{equation}
\begin{split}
lox+Cre\autorightleftharpoons{$k_8$}{$k_{-8}$} Cre\cdot lox\\
Cre\cdot lox+Cre\autorightleftharpoons{$k_9$}{$k_{-9}$}2Cre\cdot lox
\end{split}
\end{equation}
Because the concentration of intermediate products $Cre\cdot lox$ is very low, and $k_9>>k_{-8}>k_{-9}$, we can apply steady-state treatment to it,
\begin{equation}
\frac{d[Cre\cdot lox]}{dt}=k_8\cdot [lox]\cdot [Cre]-k_{-8}[Cre\cdot lox]-k_9\cdot [Cre\cdot lox]\cdot [Cre]+k_{-9}\cdot[2Cre\cdot lox]=0,
\end{equation}
The second dissociation process is the slowest reaction within these four reactions. Hence, we ignore the last term $k_{-9}\cdot[2Cre\cdot lox]$. Then we get the steady-state of intermediate products $Cre\cdot lox$
\begin{equation}
[Cre\cdot lox]_{st}=\frac{k_8\cdot [lox]\cdot [Cre]}{k_{-8}+k_9\cdot [Cre]}.
\end{equation}
So the reaction rate for $2Cre\cdot lox$ is
\begin{equation}
r_c(2Cre\cdot lox)=\frac{k_9}{k_{-9}}\cdot[Cre]\cdot\frac{k_8\cdot [lox]\cdot [Cre]}{k_{-8}+k_9\cdot [Cre]}=\frac{k_8k_9\cdot [lox]\cdot [Cre]^2}{k_{-8}k_{-9}+k_9k_{-9}\cdot [Cre]}.
\end{equation}
\section{Comparison between different designs}
In this part, we want to demonstrate some advantages of our design by making quantitative comparisons. These advantages include safety, energy efficiency and stability.
\subsection{The post-transcriptional control ensures lower leakage}
Firstly, we don't expect that our designed processing modules will alter its function by some noises of environment. So we design a coherent feedforward loop to filter noise. Here we use $x$ to represent a general input signal. As we mentioned before, the dynamic of the input module with post-transcriptional control can be described by
\begin{equation}
\begin{aligned}
\frac{d[mRNA_{Cre}]}{dt}&=\beta_0+\beta_1\frac{[x]^n}{K^n+[x]^n}-K_{R}\cdot [mRNA_{Cre}]\\
\frac{d[taRNA]}{dt}&=\beta_0+\beta_1\frac{[x]^n}{K^n+[x]^n}-K_{R}\cdot [taRNA]\\
\frac{d[Cre]}{dt}&=K_{tl}\cdot [mRNA_{Cre}]\cdot\frac{[taRNA]}{K_m+[taRNA]}-K_{P}\cdot [Cre].
\end{aligned}
\end{equation}
If there is no post-transcriptional control, this process can be described by
\begin{equation}
\begin{aligned}
\frac{d[mRNA_{Cre}]}{dt}&=\beta_0+\beta_1\frac{[x]^n}{K^n+[x]^n}-K_{R}\cdot [mRNA_{Cre}]\\
\frac{d[Cre]}{dt}&=K_{tl}\cdot [mRNA_{Cre}]-K_{P}\cdot [Cre].
\end{aligned}
\end{equation}
We compare the expression dynamic of Cre in either case at different level of input signal $x$.
\begin{figure}[!htb]
\small
\centering
\includegraphics[width=\textwidth]{fig-FFL.pdf}
\caption{dynamic of the input module} \label{fig:aa}
\end{figure}
This comparison reveals that the riboregulator ensures lower leakage but doesn't affect the expression at a high input level. We also design some experiment to validate the model, the results are consistent with this model.
(some data will be showed here)
\subsection{The mutated Cre/loxP system determines the inversion direction}
Secondly, once the engineered cells receive a sure signal, they will process information as we designed but not the other way around. So we must ensure the direction of the DNA invertion. In other words, the site-specific recombination can be regarded as a unidirectional one. To this end, we choose Cre recombinase and a pair of mutant lox sites, lox66 and lox71 to rearrange DNA sequence. Here, we explain why the forward reaction rate is higher than the reverse one. The mutant site will have a lower affinity for Cre. The binding event mainly depends on the rate of free diffusion, but the dissociation rate will be high if the binding strength is weak. According to the equation
\begin{equation}
r_c(2Cre\cdot lox)=\frac{k_8k_9\cdot [lox]\cdot [Cre]^2}{k_{-8}k_{-9}+k_9k_{-9}\cdot [Cre]},
\end{equation}
we know that the second dissociation event is more significant than the first one. So mutant lox site like lox66 and lox71 can choose a wise way to benefit the formation of dimer. Here we assume that lox66 and lox71 will have higher $k_{-8}$. However, double mutant loxP site like lox72 will have high dissociation rate at both steps. For simplicity, we assume that the affected dissociation rate is $\epsilon$ times the original one. $\epsilon>1$.
After binding event, the two loxP-bound dimers associate to form a tetramer, and recombination proceeds via a Holiday Junction intermediate. Therefore, we can compare $r_c(2Cre\cdot loxP)\cdot r_c(2Cre\cdot lox72)$ with $r_c(2Cre\cdot lox66)\cdot r_c(2Cre\cdot lox71)$ to see what kind of synapsis is easy to form.
\begin{equation}
\begin{split}
r_c(2Cre\cdot loxP)\cdot r_c(2Cre\cdot lox72)&=\frac{k_8^2k_9^2\cdot[lox]^2\cdot [Cre]^4}{({\epsilon}^2k_{-8}k_{-9}+\epsilon k_9k_{-9}[Cre])(k_{-8}k_{-9}+k_9k_{-9}[Cre])}\\
r_c(2Cre\cdot lox66)\cdot r_c(2Cre\cdot lox71)&=\frac{k_8^2k_9^2\cdot[lox]^2\cdot [Cre]^4}{({\epsilon}k_{-8}k_{-9}+k_9k_{-9}[Cre])^2}
\end{split}
\end{equation}
For $\epsilon>1$,
\begin{equation}
\begin{aligned}
\frac{r_c(2Cre\cdot loxP)\cdot r_c(2Cre\cdot lox72)}{r_c(2Cre\cdot lox66)\cdot r_c(2Cre\cdot lox71)}&=\frac{({\epsilon}k_{-8}k_{-9}+k_9k_{-9}[Cre])^2}{({\epsilon}^2k_{-8}k_{-9}+\epsilon k_9k_{-9}[Cre])(k_{-8}k_{-9}+k_9k_{-9}[Cre])}\\
&=1-\frac{(\epsilon-1)(k_9k_{-9}[Cre]^2)+\epsilon(\epsilon-1)(k_{-8}k_9k_{-9}^2[Cre])}{({\epsilon}^2k_{-8}k_{-9}+\epsilon k_9k_{-9}[Cre])(k_{-8}k_{-9}+k_9k_{-9}[Cre])}\\
&<1
\end{aligned}
\end{equation}
Under the condition that the $Cre\cdot lox$ is much more easy to get a Cre monomer rather than loss a Cre monomer, this proportion can be approximately equal to $\frac{1}{\epsilon}$, because
\begin{equation}
\begin{aligned}
r_c(2Cre\cdot lox)&=\frac{k_8k_9\cdot [lox]\cdot [Cre]^2}{k_{-8}k_{-9}+k_9k_{-9}\cdot [Cre]}
&\approx\frac{k_8\cdot [lox]\cdot [Cre]}{k_{-9}}
\end{aligned}
\end{equation}
Hence, $r_c($$2Cre\cdot loxP)$$dot r_c($$2Cre\cdot lox72)$$
where $I$ is the identity matrix. Then we construct some indexes to quantify our needs. Next, we scan different possible matrices and different parameters and record the indexes. Finally we find the patterns that meet our requirements and choose some specific biobricks to construct our rewirable circuits.
\subsection{Demo}
If we want to send one pulse signal and then send another pulse signal, you can use this topological structure to achieve it.
\begin{figure}[!htb]
\small
\centering
\includegraphics[width=0.1\textwidth]{demo1.png}
\caption{Topological structure 1.} \label{fig:aa}
\end{figure}
The corresponding ODE sets is
\begin{equation}
\begin{split}
\left(
\begin{array}{c}
\frac{dA_{mRNA}}{dt}\\
\frac{dB_{mRNA}}{dt}\\
\frac{dC_{mRNA}}{dt}\\
\frac{dD_{mRNA}}{dt}\\
\end{array}
\right)
&=
\left(
\begin{array}{cccc}
0 & 0 & 0 & 0\\
1 & 0 & 0 & 0\\
0 & 0 & 0 & 0\\
0 & 0 & 0 & 0\\
\end{array}
\right)
\cdot
\left(
\begin{array}{c}
\frac{\beta_1A^n}{K^n+A^n}\\
\frac{\beta_2B^n}{K^n+B^n}\\
\frac{\beta_3C^n}{K^n+C^n}\\
\frac{\beta_4D^n}{K^n+D^n}\\
\end{array}
\right)\\
&+
\left(
\begin{array}{cccc}
0 & 0 & 0 & 0\\
0 & 0 & 0 & 0\\
0 & 0 & 1 & 0\\
1 & 1 & 0 & 0\\
\end{array}
\right)
\cdot
\left(
\begin{array}{c}
\frac{\beta_5K^n}{K^n+A^n}\\
\frac{\beta_6K^n}{K^n+B^n}\\
\frac{\beta_7K^n}{K^n+C^n}\\
\frac{\beta_8K^n}{K^n+D^n}\\
\end{array}
\right)\\
&-
\left(
\begin{array}{cccc}
1 & 0 & 0 & 0\\
0 & 1 & 0 & 0\\
0 & 0 & 1 & 0\\
0 & 0 & 0 & 1\\
\end{array}
\right)
\cdot
K_{dm}
\cdot
\left(
\begin{array}{c}
A_{mRNA}\\
B_{mRNA}\\
C_{mRNA}\\
D_{mRNA}\\
\end{array}
\right)
\end{split}
\end{equation}
\begin{equation}
\begin{split}
\left(
\begin{array}{c}
\frac{dA}{dt}\\
\frac{dB}{dt}\\
\frac{dC}{dt}\\
\frac{dD}{dt}\\
\end{array}
\right)
=
K_{tl}\cdot
\left(
\begin{array}{c}
A_{mRNA}\\
B_{mRNA}\\
C_{mRNA}\\
D_{mRNA}\\
\end{array}
\right)
-K_{dp}\cdot
\left(
\begin{array}{c}
A\\
B\\
C\\
D\\
\end{array}
\right).
\end{split}
\end{equation}
Then we can get the one simulation result with a set of parameters.
\begin{figure}[!htb]
\small
\centering
\includegraphics[width=\textwidth]{demo3.pdf}
\caption{The simulation result of topological structure 1.} \label{fig:aa}
\end{figure}
When we exchange the promoters of gene C and gene D, the topological structure of regulation is altered to another form.
\begin{figure}[!htb]
\small
\centering
\includegraphics[width=0.1\textwidth]{demo2.png}
\caption{Topological structure 2.} \label{fig:aa}
\end{figure}
Then we just exchange two rows in the regulatory relationship matrices. The ODE for transcription change with them.
\begin{equation}
\begin{split}
\left(
\begin{array}{c}
\frac{dA_{mRNA}}{dt}\\
\frac{dB_{mRNA}}{dt}\\
\frac{dC_{mRNA}}{dt}\\
\frac{dD_{mRNA}}{dt}\\
\end{array}
\right)
&=
\left(
\begin{array}{cccc}
0 & 0 & 0 & 0\\
1 & 0 & 0 & 0\\
0 & 0 & 0 & 0\\
0 & 0 & 0 & 0\\
\end{array}
\right)
\cdot
\left(
\begin{array}{c}
\frac{\beta_1A^n}{K^n+A^n}\\
\frac{\beta_2B^n}{K^n+B^n}\\
\frac{\beta_3C^n}{K^n+C^n}\\
\frac{\beta_4D^n}{K^n+D^n}\\
\end{array}
\right)\\
&+
\left(
\begin{array}{cccc}
0 & 0 & 0 & 0\\
0 & 0 & 0 & 0\\
1 & 1 & 0 & 0\\
0 & 0 & 1 & 0\\
\end{array}
\right)
\cdot
\left(
\begin{array}{c}
\frac{\beta_5K^n}{K^n+A^n}\\
\frac{\beta_6K^n}{K^n+B^n}\\
\frac{\beta_7K^n}{K^n+C^n}\\
\frac{\beta_8K^n}{K^n+D^n}\\
\end{array}
\right)\\
&-
\left(
\begin{array}{cccc}
1 & 0 & 0 & 0\\
0 & 1 & 0 & 0\\
0 & 0 & 1 & 0\\
0 & 0 & 0 & 1\\
\end{array}
\right)
\cdot
K_{dm}
\cdot
\left(
\begin{array}{c}
A_{mRNA}\\
B_{mRNA}\\
C_{mRNA}\\
D_{mRNA}\\
\end{array}
\right)
\end{split}
\end{equation}
We simulate it again and get another result with the same set of parameters.
\begin{figure}[!htb]
\small
\centering
\includegraphics[width=\textwidth]{demo4.pdf}
\caption{The simulation result of topological structure 2.} \label{fig:aa}
\end{figure}
\end{document}