Team:UC Davis/Signal Test

From 2014.igem.org

UC Davis iGEM 2014

Mathematical Approach

Mathematical Approach

Testing Our Model

Testing Our Model

Olive Oil

Olive Oil

Combinations

To test our model we built a combinatorial set of aldehydes. We chose three representative aldehydes from each bin, saturated medium, saturated long, and unsaturated. The three aldehydes were chosen such that the enzyme's response to each would best represent their respective groups. We created a total of 64 different combinations by mixing Pentanal, Decanal, and E-2-Decenal in four different values, 0 µM, 10µM, 100µM, & 1000µM. Three combinatorial well plates were made and mixed with each enzyme separately. The observed velocity is recorded in each well.




The catalytic matrix is inverted and multiplied by the observed velocity in each well and out comes our predicted concentrations!



In high concentrations, we found that aldehydes would crash out of solution. We chose to focus on only the data set below 1000 µM. A more serious issue was brought to light however. The observed velocities from the combinatorial plates suggested competitive inhibition was occurring when E-2-Decenal was present in solution. This obfuscated our model considerably. Our primary assumption was that competitive inhibition would not come into play. We needed to think more abstractly. We asked a simple question: If measured catalytic values would not work in our suggested model, was there still a set of numbers that would work?

Machine Learning

Instead of using the measured Kcat/Km, we randomized the catalytic matrix and tested the variants for prediction accuracy. One million variants later and we were starting to produce consistently better predictions.



We found that with decreasing aldehyde concentrations, our model accuracy would increase. This is most likely due to the inherent non-linearity of Michaelis Menten kinetics. We redesigned the experiment; this time we would vary aldehyde in the following concentrations: 0 µM, 12.5 µM, 25 µM, & 50 µM. We would also train on an incomplete set and test the best matrices on the full set. This mimics a concept from protein crystallography, R-Free, where several atoms are removed from a model and later used to test the model's accuracy. The matlab code we used to produce these results is provided here. The results were astonishing. With our best mutant matrix, we were able to predict aldehyde concentration with an average error of only 6.25 µM!


To see how our model preformed in olive oil, click here.