Team:TU Darmstadt/Results/Modeling/ANS Engineering

From 2014.igem.org

(Difference between revisions)
Line 67: Line 67:
</div>
</div>
 +
<h1>Design Prediction</h1>
-
<h2>Design Prediction</h2>
+
<h2>We have to cut of the C-Terminal region to increase the substrate binding and destroy the fluctuating C-Terminal region near the active site. A model corresponding to proteins flexibility is presented below, which ensures our goal.</h2>
-
 
+
-
<p>We have to cut of the C-Terminal region to increase the substrate binding and destroy the fluctuating C-Terminal region near the active site. A model corresponding to proteins flexibility is presented below, which ensures our goal.</p>
+
<div class="contentcenter">
<div class="contentcenter">

Revision as of 02:15, 18 October 2014

Home


Overview

The anthocyanidin synthase from Fragaria x ananassa (ANS, EC 1.14.11.19) catalyzes many reactions in the anthocyanidin pathway. We used its functionality by catalyzing the conversion of the leucoanthocyanidin (2R,3S,4S)-cis-lucopelargonidin to the anthocyanidin pelargonidin. It also catalyzes the conversion of the leucoanthocyanidin to flavonol (kampferol). In order to avoiding this side reaction and enhance metabolic flux, we designed a protein scaffold (BBa_K1497031).

Earlier studies hypothesized that ANS may be involved in metabolic channeling in their native organisms. By modelling the mechanical movements of ANS, we discovered a strong flexibility at the C-terminus. Subsequently we modelled ANS' structure and its movements. Thereby we detected a tail at the C-terminus of the enzyme fluctuating and covering the active site (Fig. 15). In oder to improve the enzyme we decided to remove the tail and to construct our pelargonidin operon with this engineered ANS (eANS). The laboratory results confirmed the previous modeling results. The engineered ANS exhibited better yields than the original one when used in an operon producing pelargonidin (BBa_K1497015).

In order to optimize the metabolic channeling of ANS, we chose a rational protein engineering approach. 
The first step of our multi scale and rational engineering project was the creation of
a sophisticated 3D model with YASARA structure. This model was then used for a structural refinement with the SCWRL alghorithm and was energy minimized with YASARA nova force field.
Afterwards, we started a true mechanical engineering approach to determine the movements within the protein. Therefore, a Gaussian Network Model (GNM) and an Anisotropic Network Model were implemented.
Those are simple models which simulate the mechanical behavior of the protein. Moreover, Linear Response Theory (LRT) was used to simulate the substrate binding inside the pocket and thus trigger an induced fit mechanism. 
Afterwards we collect our data, defined rational mutations and finally constructed eANS. With this eANS version another MD simulation was started and the sequence of the protein was given to the wetlab for in vitro construction and in vivo characterization.

Coarse Grained Models (ANM & GNM)

With a computed GNM and ANM we can take a closer look inside the mechanics of the ANS. The result of the GNM computation shows a great peak at the C Terminus. It lead to the assumption that the C terminal region of the ANS is highly flexible. Unfortunately, this region belongs to the active side of the protein.  One can imagine that this region may cover the active site and decrease the probability of substrate binding during the process of catalysis.  



Following figure shows the flexibility, represented by the slow modes, of wild-type ANS in a three dimensional model as displayed above. The results are extracted out of an ANM. The spatial directions of the simulated movement. This directions are represented as arrows with color coated strength (from red ~ strong to blue ~ weak). As one can see only the C- terminus exhibits a large correlated movement

LRT

If we simulate the substrate binding in the pocket of the ANS by applying a force vector to the active site and binding region we can observe a strong deformation of the enzyme. This process is called induced fit. This result reveals that the C Terminal region of the ANS is still highly flexible during the process of induced fit.  This is a problem because the substrate release as well as the binding is perturbed and thus the reaction rate is limited.

Design Prediction

We have to cut of the C-Terminal region to increase the substrate binding and destroy the fluctuating C-Terminal region near the active site. A model corresponding to proteins flexibility is presented below, which ensures our goal.

Molecular Dynamics Simulation (MD)

RMSD 

The Root mean square deviation (short: RMSD) can be computed as followed:

\[ RMSD(v,w) = \sqrt{\frac{1}{n} \sum_{i=1}^{n} ||v_i - w_i ||^2} \]

\[  = \sqrt{\frac{1}{n} \sum_{i=1}^{n} (v_{ix} - w_{ix} )^2+(v_{iy} - w_{iy} )^2+(v_{iz} - w_{iz} )^2} \] here n is the number of atoms (Cα or backbone atoms). V i is defined as the coordinates of protein V atom i. Here, the RMSD is used to quantify a comparison between the structures of two protein (v and w) folds. The RMSD was computed from the atomic coordinates of the C-alpha in R using the bio3d package.

RMSD Results 

As can be seen in Figures RMSD (ANS short ; ANS_long), the wild type has minimal changes in the four calculated distances, which leads to the consumption that the central core stays quite stable during the simulation - equation is shown below. If we take a closer look at the RMSD distributions (RMSD Histograms) we can observe that the engineered ANS is more stable than the wilt type. Additionally, the wild typ reaches a higher plateau and overall RMSD.

RMSD of wild type ANS vs. the simulation time in ps is shown below.

RMSD of engineered ANS vs. the simulation time in ps is shown below.

Conclusion: 

Overall structure of the Engineered ANS is more stable over time. Moreover, the RMSF (ref. RMSF Plots) computations reproduced the results derived from the coarse grained simulations (GNM, ANM and LRT). This reveals that coarse grained simulations are suitable for rational design approach. These models arent as computational expensive as MD Simulations. For example the ANM as well as GNM models can be computed on a single core Processor N270 (512K Cache, 1.60 GHz, 533 MHz FSB) in only a few minutes. Contrary to this the MD Simulation of the ANS and eANS calculated a few month on a Phenom II X6 1090T with 6 cores, 2.8 GHz&nbsp. This underlines the complexity and importance of coarse grained simulations for rational protein design. With the RMSF we can clearly bring to proof that the C Terminal region is highly flexible and thus a obstacle to the active site of the ANS.&nbsp

RMSF  

The Root mean square fluctuation (short: RMSF) can be computed as followed:&nbsp

\[ RMSF= \sqrt{ \frac{1}{T} \sum_{t_j = 1}^T (x_i (t_j) - \tilde{x} )^2  } \] 

where T is the duration of the simulation (time steps) and x i (t j) the coordinates of atom x i at time t j. Now we are calculating the sum of the squared difference of the mean coordinate x i and x i (t j). Next we divide the sum to T and extract the root of it. Hence we are able to calculate the fluctuation of an atom with its mean in trajectory files. The RMSF was computed from the atomic coordinates of the Cα in R using the bio3d library. &nbsp

RMSF Results 

Plots of RMSF are shown below. Here, the residue position is plotted against the RMSF in Angström. The first plot shows the native ANS simulation, whereas the second displays the engineered eANS.

Conclusion: 

It was necessary to improve metabolic channeling of the active site by cutting of the C- Terminal region. Moreover the overall stability of eANS is increased (represented by the RMSD histograms). This modification helped to increase the turnover of our product. We could demonstrate in our wetlab with in vivo characterizations that our design approach helped to increase the yield.