Team:TU Darmstadt/Results/Modeling/ANS Engineering

From 2014.igem.org

(Difference between revisions)
 
(34 intermediate revisions not shown)
Line 1: Line 1:
{{:Team:TU_Darmstadt/Template}}
{{:Team:TU_Darmstadt/Template}}
 +
<html>
<html>
Line 17: Line 18:
<h1>Overview</h1>
<h1>Overview</h1>
-
 
+
<p>The anthocyanidin synthase from <i>Fragaria x ananassa</i> (ANS, EC 1.14.11.19) catalyzes many reactions in the anthocyanidin pathway. We used its functionality by catalyzing the conversion of the leucoanthocyanidin (2R,3S,4S)-cis-lucopelargonidin to the anthocyanidin pelargonidin. It also catalyzes the conversion of the leucoanthocyanidin to flavonol (kampferol).
 +
Earlier studies hypothesized that ANS may be involved in metabolic channeling in their native organisms. So the ANS became a
 +
target for an ambitious modeling pipeline. The project <b>eANS</b> was born. The modeling pipeline:</p> 
<div class="contentcenter">
<div class="contentcenter">
<a href="https://static.igem.org/mediawiki/parts/8/81/Project_Description.png" onclick="openPic('https://static.igem.org/mediawiki/parts/8/81/Project_Description.png','thePicture','width=600,height=600,status=0,menubar=0'); return false;" target="thePicture">
<a href="https://static.igem.org/mediawiki/parts/8/81/Project_Description.png" onclick="openPic('https://static.igem.org/mediawiki/parts/8/81/Project_Description.png','thePicture','width=600,height=600,status=0,menubar=0'); return false;" target="thePicture">
Line 23: Line 26:
</a>
</a>
</div>
</div>
-
<p>In order to optimize the metabolic channeling of ANS, we chose a rational protein engineering approach.&nbsp; <br />The first step of our multi scale and rational engineering project was the creation of<br />a sophisticated 3D model with <a href="http://www.yasara.org/" title="Link to yasara.org" target="_blank" class="external-link-new-window">YASARA</a> structure. This model was then used for a structural refinement with the <a href="http://www1.jcsg.org/scripts/prod/scwrl/serve.cgi" title="Link to jcsg.org" target="_blank" class="external-link-new-window">SCWRL</a> alghorithm and was energy minimized with YASARA nova force field.<br />Afterwards, we started a true mechanical engineering approach to determine the movements within the protein. Therefore, a Gaussian Network Model (GNM) (Fig. 15) and an Anisotropic Network Model (Fig. 16) were implemented.<br />Those are simple models which simulate the mechanical behavior of the protein. Moreover, Linear Response Theory (LRT) (Fig. 18) was used to simulate the substrate binding inside the pocket and thus trigger an induced fit mechanism.&nbsp;<br />Afterwards we collect our data, defined rational mutations and finally constructed eANS. With this eANS version another MD&nbsp;simulation was started and the sequence of the protein was given to the wetlab for <i>in vitro</i> construction and <i>in vivo</i> characterization.</p>
+
<p>In order to optimize the metabolic channeling of ANS, we chose a rational protein engineering approach. The first step of our multi scale and rational engineering project was the creation of a sophisticated 3D model with <a href="http://www.yasara.org/" title="Link to yasara.org" target="_blank" class="external-link-new-window">YASARA</a> structure. This model was then used for a structural refinement with the <a href="http://www1.jcsg.org/scripts/prod/scwrl/serve.cgi" title="Link to jcsg.org" target="_blank" class="external-link-new-window">SCWRL</a> alghorithm and was energy minimized with YASARA nova force field.<br />Afterwards, we started a true mechanical engineering approach to determine the movements within the protein. Therefore, a Gaussian Network Model (GNM) and an Anisotropic Network Model were implemented.Those are simple models which <b>simulate the mechanical behavior</b> of the protein. Moreover, Linear Response Theory (LRT) was used to simulate the substrate binding inside the pocket and thus simulate an induced fit mechanism.&nbsp;<br />Subsequently, we collected our data, defined rational mutations and finally constructed eANS. With this eANS version another MD&nbsp;simulation was started and the sequence of the protein was given to the <a href="https://2014.igem.org/Team:TU_Darmstadt/Results/Pathway" title="Wet lab results of eANS" target="_blank" class="external-link-new-window">wetlab</a> for <i>in vitro</i> construction and <i>in vivo</i> characterization.</p>
<h1>Coarse Grained Models (ANM &amp; GNM)</h1>
<h1>Coarse Grained Models (ANM &amp; GNM)</h1>
-
<p>With the GNM and ANM we can take a closer look inside the mechanics of the ANS. The result of the GNM computation shows a great peak at the C Terminus. It lead to the assumption that the C terminal region of the ANS is highly flexible. Unfortunately, this region belongs to the active side of the protein. &nbsp;One can imagine that this region may cover the active site and decrease the probability of substrate binding during the process of catalysis. &nbsp;</p>
+
<p>With a computed GNM and ANM we were able to take a closer look inside the mechanics of the ANS. The result of the GNM computation showed a great peak at the C-terminus. It lead to the assumption that the C-terminal region of the ANS is highly flexible. Unfortunately, this region belongs to the active side of the protein. &nbsp;One can imagine that this region may cover the active site and decrease the probability of substrate binding during the process of catalysis. &nbsp;</p>
<div class="contentcenter">
<div class="contentcenter">
Line 35: Line 38:
</a>
</a>
</div>
</div>
 +
<br></br>
-
 
+
<p>Following figure shows the flexibility, represented by the slow modes, of wild-type ANS in a three dimensional model as displayed above. The results are extracted out of an ANM. The spatial directions of the simulated movement displayed below. This directions are represented as arrows with color coded strength (from red ~ strong to blue ~ weak). Only the C-terminus exhibits a large correlated movement. </p>
-
<p>Following figure shows the flexibility, represented by the slow modes, of wild-type ANS in a three dimensional model as displayed above. The slow modes are plotted against the amino acid position. &nbsp;</p>
+
Line 48: Line 51:
<h2>LRT</h2>
<h2>LRT</h2>
-
<p>If we simulate the substrate binding in the pocket of the ANS by applying a force vector to the active site and binding region we can observe a strong deformation of the enzyme. This result reveals that the C Terminal region of the ANS is still highly flexible during the process of induced fit.&nbsp;</p>
+
<p>If we simulate the substrate binding in the pocket of the ANS by applying a force vector to the active site and binding region we can observe a strong deformation of the enzyme. This process is called induced fit. This result reveals that the C-terminal region of the ANS is still highly flexible during the process of induced fit.&nbsp; This is a problem because the
 +
substrate release as well as the binding is perturbed and thus the reaction rate is limited. </p>
<div class="contentcenter">
<div class="contentcenter">
Line 55: Line 59:
</a>
</a>
</div>
</div>
 +
<div class="contentcenter">
<div class="contentcenter">
Line 62: Line 67:
</div>
</div>
 +
<h1>Design Prediction</h1>
-
<h2>Design Prediction</h2>
+
We´ve concluded that <b>we´d have to remove the C-terminal region to increase the substrate binding</b> and destroy the fluctuating C-terminal tail near the active site. A model depicting the protein flexibility is presented below, which encouraged us to pursue our approach.
-
 
+
-
<p>We have to cut of the C-Terminal region to increase the substrate binding and destroy the fluctuating C-Terminal region near the active site. A model corresponding to proteins flexibility is presented below, which ensures our goal.</p>
+
<div class="contentcenter">
<div class="contentcenter">
Line 73: Line 77:
</div>
</div>
-
<h1>Molecular Dynamics</h1>
+
<h1>Molecular Dynamics Simulation (MD)</h1>
<h2>RMSD&nbsp;</h2>
<h2>RMSD&nbsp;</h2>
 +
<p>The Root mean square deviation (short: RMSD) can be computed as followed: </p>
-
<p>As can be seen in Figures RMSD (ANS short ; ANS_long), the wild type has minimal changes in the four calculated distances, which leads to the consumption that the central core stays quite stable during the simulation - equation is shown below.
 
-
</p>
 
<p>\[ RMSD(v,w) = \sqrt{\frac{1}{n} \sum_{i=1}^{n} ||v_i - w_i ||^2} \]</p>
<p>\[ RMSD(v,w) = \sqrt{\frac{1}{n} \sum_{i=1}^{n} ||v_i - w_i ||^2} \]</p>
<p>\[ &nbsp;= \sqrt{\frac{1}{n} \sum_{i=1}^{n} (v_{ix} - w_{ix} )^2+(v_{iy} - w_{iy} )^2+(v_{iz} - w_{iz} )^2} \]  
<p>\[ &nbsp;= \sqrt{\frac{1}{n} \sum_{i=1}^{n} (v_{ix} - w_{ix} )^2+(v_{iy} - w_{iy} )^2+(v_{iz} - w_{iz} )^2} \]  
-
</p>
 
-
<p>If we take a closer look at the RMSD distributions (RMSD Histograms) we can observe that the engineered ANS is more stable than the wilt type. Additionally, the wild typt reaches a higher plateau and overall RMSD.</p>
 
-
<p>Wild type ANS's results are displayed below.</p>
+
here n is the number of atoms (Cα or backbone atoms). V i is defined as the coordinates of protein V atom i. Here, the RMSD is used to quantify a comparison between the structures of two protein (v and w) folds. The RMSD was computed from the atomic coordinates of the C-alpha in R using the bio3d package.
 +
 
 +
 
 +
<h2>RMSD Results&nbsp;</h2>
 +
 
 +
<p>As can be seen in Figures RMSD (ANS short ; ANS_long), the wild type has minimal changes in the four calculated distances, which leads to the consumption that the central core stays quite stable during the simulation - equation is shown below.
 +
If we take a closer look at the RMSD distributions (RMSD Histograms) we can observe that the engineered ANS is more stable than the wilt type. Additionally, the wild typ reaches a higher plateau and overall RMSD.</p>
 +
 
 +
<p>RMSD of wild type ANS vs. the simulation time in ps is shown below.</p>
<div class="contentcenter">
<div class="contentcenter">
Line 91: Line 100:
</div>
</div>
-
<p>RMSD of engineered ANS is shown below.</p>
+
<p>RMSD of engineered ANS vs. the simulation time in ps is shown below.</p>
<div class="contentcenter">
<div class="contentcenter">
Line 106: Line 115:
-
<p>Conclusion: Overall structure of the Engineered ANS is more stable over time.&nbsp;</p>
+
<p><h2>Conclusion:&nbsp;</h2> Overall structure of <b>the engineered ANS is more stable over time</b>. Moreover, the RMSF (ref. RMSF Plots) computations reproduced the results derived from the coarse grained simulations (GNM, ANM and LRT). This reveals that coarse
 +
grained simulations are suitable for rational design approach. These models arent as computational expensive as MD Simulations. For example the ANM as well as GNM models can be computed on a single core Processor N270 (512K Cache, 1.60 GHz, 533 MHz FSB)
 +
in only a few minutes. Contrary to this the MD Simulation of the ANS and eANS calculated a few month on a Phenom II X6 1090T with 6 cores, 2.8 GHz&nbsp. This underlines the complexity and importance of coarse grained simulations for rational protein design. With the RMSF we can clearly bring to proof that the C Terminal region is highly flexible and thus a obstacle to the active site of the ANS.&nbsp</p>
-
<p>The RMSF (ref. RMSF Plots) computations reproduced the results derived from the coarse grained simulations (GNM, ANM and LRT).
+
<h2>RMSF </h2>
 +
<p>The Root mean square fluctuation (short: RMSF) describes the dynamic movement of a amino acid residue in a protein. High RMSF values in a certain area indicate a high flexibility. </p>
 +
<p>RMSF can be computed as followed:</p>
 +
<p>\[ RMSF= \sqrt{ \frac{1}{T} \sum_{t_j = 1}^T (x_i (t_j) - \tilde{x} )^2 &nbsp;} \]<span style="font-size: 12px;"></span>
</p>
</p>
-
<p>\[ RMSF= \sqrt{ \frac{1}{T} \sum_{t_j = 1}^T (x_i (t_j) - \tilde{x} )^2 &nbsp;} \]<span style="font-size: 12px;">&nbsp;</span>
+
<p>Where T is the duration of the simulation (time steps) and x i (t j) the coordinates of atom x i at time t j. Now we are calculating the sum of the squared difference of the mean coordinate x i and x i (t j). Next we divide the sum to T and extract the root of it. Hence we are able to calculate the fluctuation of an atom with its mean in trajectory files. The RMSF was computed from the atomic coordinates of the Cα in R using the bio3d library.</p>
-
</p>
+
 
-
<p>This underlines the complexity and importance of coarse grained simulations for rational protein design. With the RMSF we can clearly bring to proof that the C Terminal region is highly flexible and thus a obstacle to the active site of the ANS.&nbsp;
+
<h2>RMSF Results&nbsp;</h2>
-
</p>
+
<p>Plots of RMSF are shown below. Here, the residue position is plotted against the RMSF in Angström. The first plot shows the native ANS simulation, whereas the second displays the engineered eANS.</p>
-
<p>Plots of RMSF are shown below. On the left side is the native ANS, whereas on the right side engineered ANS's results can be viewed.</p>
+
<div class="contentcenter">
<div class="contentcenter">
<a href="https://static.igem.org/mediawiki/parts/e/e5/Rmsf_long.png" onclick="openPic('https://static.igem.org/mediawiki/parts/e/e5/Rmsf_long.png','thePicture','width=600,height=600,status=0,menubar=0'); return false;" target="thePicture">
<a href="https://static.igem.org/mediawiki/parts/e/e5/Rmsf_long.png" onclick="openPic('https://static.igem.org/mediawiki/parts/e/e5/Rmsf_long.png','thePicture','width=600,height=600,status=0,menubar=0'); return false;" target="thePicture">
-
<img src="https://static.igem.org/mediawiki/parts/e/e5/Rmsf_long.png" width="580" alt="">
+
<img src="https://static.igem.org/mediawiki/2014/8/86/ANS_long_neu_2.png" width="580" alt="">
</a>
</a>
</div>
</div>
Line 125: Line 138:
<a href="https://static.igem.org/mediawiki/parts/5/58/Rmsf_short.png" onclick="openPic('https://static.igem.org/mediawiki/parts/5/58/Rmsf_short.png','thePicture','width=600,height=600,status=0,menubar=0'); return false;" target="thePicture">
<a href="https://static.igem.org/mediawiki/parts/5/58/Rmsf_short.png" onclick="openPic('https://static.igem.org/mediawiki/parts/5/58/Rmsf_short.png','thePicture','width=600,height=600,status=0,menubar=0'); return false;" target="thePicture">
-
<img src="https://static.igem.org/mediawiki/parts/5/58/Rmsf_short.png" width="580" alt="">
+
<img src="https://static.igem.org/mediawiki/2014/a/a6/ANS_short_neu.png" width="580" alt="">
</a>
</a>
</div>
</div>
-
<p>Conclusion: It was necessary to unleash the active site by cutting of the C- Terminal region. Only with this modification we can increase the turnover of the ANS.&nbsp;</p>
+
 
 +
<p><h2>Conclusion:&nbsp;</h2> It was necessary to improve metabolic channeling of the active site by removing the C-terminal region. Moreover, the overall stability of eANS is increased (represented by the RMSD histograms). <b>We could demonstrate in our <a href="https://2014.igem.org/Team:TU_Darmstadt/Results/Pathway" title="Wet lab results of eANS" target="_blank" class="external-link-new-window">wetlab</a> experiments that our design approach helped to increase the pelargonidin yield <i>in vivo</i>.</b></p>
 +
 
 +
<div class="contentcenter">
 +
<img src="https://static.igem.org/mediawiki/parts/d/d7/Pelletf%C3%A4rbungII.png" width="581" height="299" alt="">
 +
<p>
 +
<i>E.coli</i> BL21 (DE3) pellet containing the pelargonidin producing operon after the fermentation. According to Yan <i>et al.</i> (2007) a pelargonidin producing <i>E.coli</i> should be red after a pelargenidin production. The operon with the engineered anthocyanindin synthase produces more pelargonidin.
 +
</p>
 +
</div>
</div><!--TYPO3SEARCH_end-->
</div><!--TYPO3SEARCH_end-->
</div> <!--ende contentWrap-->
</div> <!--ende contentWrap-->
</html>
</html>

Latest revision as of 03:36, 18 October 2014

Home


Overview

The anthocyanidin synthase from Fragaria x ananassa (ANS, EC 1.14.11.19) catalyzes many reactions in the anthocyanidin pathway. We used its functionality by catalyzing the conversion of the leucoanthocyanidin (2R,3S,4S)-cis-lucopelargonidin to the anthocyanidin pelargonidin. It also catalyzes the conversion of the leucoanthocyanidin to flavonol (kampferol). Earlier studies hypothesized that ANS may be involved in metabolic channeling in their native organisms. So the ANS became a target for an ambitious modeling pipeline. The project eANS was born. The modeling pipeline:

In order to optimize the metabolic channeling of ANS, we chose a rational protein engineering approach. The first step of our multi scale and rational engineering project was the creation of a sophisticated 3D model with YASARA structure. This model was then used for a structural refinement with the SCWRL alghorithm and was energy minimized with YASARA nova force field.
Afterwards, we started a true mechanical engineering approach to determine the movements within the protein. Therefore, a Gaussian Network Model (GNM) and an Anisotropic Network Model were implemented.Those are simple models which simulate the mechanical behavior of the protein. Moreover, Linear Response Theory (LRT) was used to simulate the substrate binding inside the pocket and thus simulate an induced fit mechanism. 
Subsequently, we collected our data, defined rational mutations and finally constructed eANS. With this eANS version another MD simulation was started and the sequence of the protein was given to the wetlab for in vitro construction and in vivo characterization.

Coarse Grained Models (ANM & GNM)

With a computed GNM and ANM we were able to take a closer look inside the mechanics of the ANS. The result of the GNM computation showed a great peak at the C-terminus. It lead to the assumption that the C-terminal region of the ANS is highly flexible. Unfortunately, this region belongs to the active side of the protein.  One can imagine that this region may cover the active site and decrease the probability of substrate binding during the process of catalysis.  



Following figure shows the flexibility, represented by the slow modes, of wild-type ANS in a three dimensional model as displayed above. The results are extracted out of an ANM. The spatial directions of the simulated movement displayed below. This directions are represented as arrows with color coded strength (from red ~ strong to blue ~ weak). Only the C-terminus exhibits a large correlated movement.

LRT

If we simulate the substrate binding in the pocket of the ANS by applying a force vector to the active site and binding region we can observe a strong deformation of the enzyme. This process is called induced fit. This result reveals that the C-terminal region of the ANS is still highly flexible during the process of induced fit.  This is a problem because the substrate release as well as the binding is perturbed and thus the reaction rate is limited.

Design Prediction

We´ve concluded that we´d have to remove the C-terminal region to increase the substrate binding and destroy the fluctuating C-terminal tail near the active site. A model depicting the protein flexibility is presented below, which encouraged us to pursue our approach.

Molecular Dynamics Simulation (MD)

RMSD 

The Root mean square deviation (short: RMSD) can be computed as followed:

\[ RMSD(v,w) = \sqrt{\frac{1}{n} \sum_{i=1}^{n} ||v_i - w_i ||^2} \]

\[  = \sqrt{\frac{1}{n} \sum_{i=1}^{n} (v_{ix} - w_{ix} )^2+(v_{iy} - w_{iy} )^2+(v_{iz} - w_{iz} )^2} \] here n is the number of atoms (Cα or backbone atoms). V i is defined as the coordinates of protein V atom i. Here, the RMSD is used to quantify a comparison between the structures of two protein (v and w) folds. The RMSD was computed from the atomic coordinates of the C-alpha in R using the bio3d package.

RMSD Results 

As can be seen in Figures RMSD (ANS short ; ANS_long), the wild type has minimal changes in the four calculated distances, which leads to the consumption that the central core stays quite stable during the simulation - equation is shown below. If we take a closer look at the RMSD distributions (RMSD Histograms) we can observe that the engineered ANS is more stable than the wilt type. Additionally, the wild typ reaches a higher plateau and overall RMSD.

RMSD of wild type ANS vs. the simulation time in ps is shown below.

RMSD of engineered ANS vs. the simulation time in ps is shown below.

Conclusion: 

Overall structure of the engineered ANS is more stable over time. Moreover, the RMSF (ref. RMSF Plots) computations reproduced the results derived from the coarse grained simulations (GNM, ANM and LRT). This reveals that coarse grained simulations are suitable for rational design approach. These models arent as computational expensive as MD Simulations. For example the ANM as well as GNM models can be computed on a single core Processor N270 (512K Cache, 1.60 GHz, 533 MHz FSB) in only a few minutes. Contrary to this the MD Simulation of the ANS and eANS calculated a few month on a Phenom II X6 1090T with 6 cores, 2.8 GHz&nbsp. This underlines the complexity and importance of coarse grained simulations for rational protein design. With the RMSF we can clearly bring to proof that the C Terminal region is highly flexible and thus a obstacle to the active site of the ANS.&nbsp

RMSF

The Root mean square fluctuation (short: RMSF) describes the dynamic movement of a amino acid residue in a protein. High RMSF values in a certain area indicate a high flexibility.

RMSF can be computed as followed:

\[ RMSF= \sqrt{ \frac{1}{T} \sum_{t_j = 1}^T (x_i (t_j) - \tilde{x} )^2  } \]

Where T is the duration of the simulation (time steps) and x i (t j) the coordinates of atom x i at time t j. Now we are calculating the sum of the squared difference of the mean coordinate x i and x i (t j). Next we divide the sum to T and extract the root of it. Hence we are able to calculate the fluctuation of an atom with its mean in trajectory files. The RMSF was computed from the atomic coordinates of the Cα in R using the bio3d library.

RMSF Results 

Plots of RMSF are shown below. Here, the residue position is plotted against the RMSF in Angström. The first plot shows the native ANS simulation, whereas the second displays the engineered eANS.

Conclusion: 

It was necessary to improve metabolic channeling of the active site by removing the C-terminal region. Moreover, the overall stability of eANS is increased (represented by the RMSD histograms). We could demonstrate in our wetlab experiments that our design approach helped to increase the pelargonidin yield in vivo.

E.coli BL21 (DE3) pellet containing the pelargonidin producing operon after the fermentation. According to Yan et al. (2007) a pelargonidin producing E.coli should be red after a pelargenidin production. The operon with the engineered anthocyanindin synthase produces more pelargonidin.