Team:UC Davis/Signal Processing

From 2014.igem.org

Revision as of 03:20, 18 October 2014 by AaronC (Talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

UC Davis iGEM 2014

Mathematical Approach

Testing Our Model

Olive Oil

Our signal processing data set can be downloaded here.

Mathematical Approach

Testing Our Model

To test our model we built a combinatorial set of aldehydes. We chose three representative aldehydes from each bin, saturated medium, saturated long, and unsaturated. The three aldehydes were chosen such that the enzyme's response to each would best represent their respective groups. We created a total of 64 different combinations by mixing Pentanal, Decanal, and E-2-Decenal in four different values, 0 µM, 10 µM, 100 µM, & 1000 µM. Three combinatorial well plates were made and mixed with each enzyme separately. The observed velocity is recorded in each well.

The catalytic matrix is inverted and multiplied by the observed velocity in each well and out comes our predicted concentrations!

In high concentrations, we found that aldehydes would crash out of solution. We chose to focus on only the data set below 1000 µM. A more serious issue was brought to light however. The observed velocities from the combinatorial plates suggested competitive inhibition was occurring when E-2-Decenal was present in solution. This obfuscated our model considerably. Our primary assumption was that competitive inhibition would not come into play. We needed to think more abstractly. We asked a simple question: If measured catalytic values would not work in our suggested model, was there still a set of numbers that would work?

Machine Learning

Instead of using the measured K_cat/K_m, we randomized the catalytic matrix and tested the variants for prediction accuracy. One million variants later and we were starting to produce consistently better predictions.

We found that with decreasing aldehyde concentrations, our model accuracy would increase. This is most likely due to the inherent non-linearity of Michaelis Menten kinetics. We redesigned the experiment; this time we would vary aldehyde in the following concentrations: 0 µM, 12.5 µM, 25 µM, & 50 µM. We would also train on an incomplete set and test the best matrices on the full set. This mimics a concept from protein crystallography, R-Free, where several atoms are removed from a model and later used to test the model's accuracy. The matlab code we used to produce these results is provided here. The results were astonishing. With our best mutant matrix, we were able to predict aldehyde concentration with an average error of only 6.25 µM!

Olive Oil

With a working model, it was time for the ultimate test: Olive Oil

Nine samples of Extra Virgin Olive Oil were obtained and prepared for assay. The velocities were recorded with each enzyme for a total of 27 data points. We used the best catalytic matrix from our previous model and again inverted the matrix and multiplied by the observed velocity. The results are plotted below.