|
|
(14 intermediate revisions not shown) |
Line 4: |
Line 4: |
| <h>Growth curve & pH model</h> | | <h>Growth curve & pH model</h> |
| <h1>Purpose</h1> | | <h1>Purpose</h1> |
- | <h1>Introduction: Growth Curve</h1> | + | <div class='abstract mdl'> |
- | <p>A growth curve for a population of bacteria illustrates some of the dynamics that affect the population size over time.<br>Four distinct phases are recognized:<br><br> | + | <ul> |
- | <b>The lag phase:</b><br> | + | <li>To figure out the suitable model for <i>S. mutans</i> growth curve of our wet lab data, so that we would have some criteria in choosing the model for our next model, <a href='/Team:NYMU-Taipei/modeling/m2'><b>the competition model</b></a>.</li> |
- | The curve remains at a plateau. During this time, bacteria adapt to their new environment, store nutrients and prepare for binary fission.<br><br> | + | <li>To know how the population size of <i>S. mutans</i> affects pH value, which is also derived from our wet lab data, and therefore we can estimate the pH value in our next model, the competition model.</li> |
- | <b>The logarithmic phase:</b><br> | + | </ul> |
- | This phase is also called the exponential growth phase: the population of bacteria enters an active stage of growth, the mass of each cell increases rapidly, and the number of bacteria doubles.<br><br> | + | </div> |
- | <b>The stationary phase:</b><br> | + | <div class='cont-panel mdl-panel'> |
- | At this stage reproductive and death rates equalize, the population enters another plateau.<br><br></p> | + | <div href='#2c1-1'><p>Background</p></div> |
- | <h1>Models and mathematic equations</h1> | + | <div href='#2c1-2'><p>Equations</p></div> |
- | <p>Three well known growth models (Logistic, Gompertz, and Richartz)are used in this work. Characteristic model parameters (such as lag phase (λ), maximal growth rate (µ-max slope), stationary phase (A-max growth value)) are derived from our experimental data. Bootstrap and cross-validation techniques are used for estimating confidence intervals of | + | <div href='#2c1-3'><p>Result</p></div> |
- | all derived parameters.<br><br> | + | <div href='#2c1-4'><p>Reference</p></div> |
- | The aim is to integrate the experimental data into different growth models and to compare the models using statistical methods (AIC and maximum likelihood were used). We believe, and many scientists do, that model selection is the most important part in model-experiment based research-<i>The right data with the right model</i>.</p><br> | + | <div style="display:inline-block; width: 640px; height: 0.1px; border: none; margin: 0px"></div> |
| + | </div> |
| + | <div class='article modeling indent'> |
| + | <h1 id='2c1-1'>Background: Growth Curve</h1> |
| + | <p>A growth curve for a population of bacteria illustrates some of the dynamics that affect the population size over time.</p> |
| + | <h3>Four distinct phases are recognized:</h3> |
| + | <h3>The lag phase:</h3> |
| + | <p class='indent'>The curve remains at a plateau. During this time, bacteria adapt to their new environment, store nutrients and prepare for binary fission.</p> |
| + | <h3>The logarithmic phase:</h3> |
| + | <p class='indent'>This phase is also called the exponential growth phase: the population of bacteria enters an active stage of growth, the mass of each cell increases rapidly, and the number of bacteria doubles.</p> |
| + | <h3>The stationary phase:</h3> |
| + | <p class='indent'>At this stage reproductive and death rates equalize, the population enters another plateau.<br><br></p> |
| + | </div> |
| + | <div class="article modeling"> |
| + | <h1 id='2c1-2'>Models and mathematic equations</h1> |
| + | <p class='indent'><b>Three well known growth models (Logistic, Gompertz, and Richartz)</b>are used in this work. Characteristic model parameters (such as lag phase (λ), maximal growth rate (µ-max slope), stationary phase (A-max growth value)) are derived from our experimental data. Bootstrap and cross-validation techniques are used for estimating confidence intervals of |
| + | all derived parameters.</p> |
| + | <p class='indent'>The aim is to integrate the experimental data into different growth models and to compare the models using statistical methods (AIC and maximum likelihood were used). We believe, and many scientists do, that model selection is the most important part <b>in model-experiment based research-<i>The right data with the right model</i>.</b></p> |
| <p> | | <p> |
| <b>Logistic Model:</b> | | <b>Logistic Model:</b> |
Line 31: |
Line 48: |
| $\nu$ is a shape parameter (in the richartz model only) | | $\nu$ is a shape parameter (in the richartz model only) |
| </p> | | </p> |
- | <h1>pH model</h1> | + | <h3>pH model</h3> |
| <p>Existed model is | | <p>Existed model is |
| $$\begin{align} | | $$\begin{align} |
| \frac{dpH}{dP}&=k(pH-pH_{min}) | | \frac{dpH}{dP}&=k(pH-pH_{min}) |
- | \end{align}$$ | + | \end{align}$$</p> |
- | However, this model is not appropriate for our experimental data; we used linear regression model instead. | + | <p>However, this model is not appropriate for our experimental data; we used linear regression model instead.</p> |
| $$pH=\alpha + \beta OD +\epsilon$$ | | $$pH=\alpha + \beta OD +\epsilon$$ |
- | where $\epsilon$ is a random error or noice (which helps to capture a measurement error and other unknown factors). And it is assumed to be Gaussian (normal distribution function), $\epsilon = N(\theta,\sigma^{2})$, with mean $\theta$ and constant variance $\sigma^{2}$. $\alpha$ and $\beta$ are model parameters. | + | <p>where $\epsilon$ is a random error or noice (which helps to capture a measurement error and other unknown factors). And it is assumed to be Gaussian (normal distribution function), $\epsilon = N(\theta,\sigma^{2})$, with mean $\theta$ and constant variance $\sigma^{2}$. $\alpha$ and $\beta$ are model parameters. </p> |
- | </p>
| + | </div> |
- | <h1>Result and model validation</h1> | + | <div class='article modeling indent'> |
- | <img src="/wiki/images/a/af/NYMU14_model1_simulGrowth.png" style="display: block;width: 500px;margin: 0 auto;"><p style=" text-align: center; ">Figure 1: Simulation results of the growth curve</p><br> | + | <h1 id='2c1-3'>Result and model validation</h1> |
- | <p>Our experimental data are implemented in the three proposed models. AIC is used to measure the performance of the models and the result shows that Logistic and Richartz models are approximately the same, but slightly different from Gompertz. However, using 95% confidence interval all of them are appropriate to fit the given data.<br><br> | + | <img src="/wiki/images/a/af/NYMU14_model1_simulGrowth.png" style="display: block;width: 500px;margin: 0 auto;"><p style=" text-align: center; ">Figure 1: Simulation results of the growth curve</p> |
- | <img src="/wiki/images/5/52/NYMU14_model1_modelValidation.png" style="display: block;width: 500px;margin: 0 auto;"><p style=" text-align: center; ">Figure 2: Shows validation test(t*) for the fitted model</p><br> | + | <p>Our experimental data are implemented in the three proposed models. AIC is used to measure the performance of the models and the result shows that <b>Logistic</b> and Richartz models are approximately the same, but slightly different from Gompertz. However, using 95% confidence interval all of them are appropriate to fit the given data.</p><br> |
- | <p>The validation test (plot of the residuals), as Figure 2, shows that the simulation result obtained from bootstrap samples is suitable to estimate the model parameters.<br><br> | + | <img src="/wiki/images/5/52/NYMU14_model1_modelValidation.png" style="display: block;width: 500px;margin: 0 auto;"><p style=" text-align: center; ">Figure 2: Shows validation test(t*) for the fitted model</p> |
- | Using our experimental data as initials samples, we applied Bootsrap statistical to sample empirical data. The simulated result is presented as Figure (1). And, the estimated model parameters (mu(µ), lamda(λ), A) are summarized as Table 1. | + | <p>The validation test (plot of the residuals), as Figure 2, shows that the simulation result obtained from bootstrap samples is suitable to estimate the model parameters.</p> |
- | $$\text{Table 1: Estimated Model Parameter}$$ | + | <p>Using our experimental data as initials samples, we applied Bootsrap statistical to sample empirical data. The simulated result is presented as Figure (1). And, the estimated model parameters (mu(µ), lamda(λ), A) are summarized as Table 1.</p> |
- | </p> | + | <p class='Jaxcenter'>$$\text{Table 1: Estimated Model Parameter}$$</p> |
| <pre> | | <pre> |
| ------------------------------------------- | | ------------------------------------------- |
Line 63: |
Line 80: |
| From the result on table $1$, the $lower$ and $upper$ values are the estimated confidence intervals of the corresponding parameters. $Std$-standard deviation) and $mean$-average values are obtained from the bootstrap samples. These results are used in the proposed three growth models. After the approprate models are selected (in our case, the three proposed models are equally approprate), the parameters are taken as an intial values for the interaction model; which is used to study the growth of S.mutant in the presence of other species(see <a href='/Team:NYMU-Taipei/modeling/m2'><b>competition model</b></a>). | | From the result on table $1$, the $lower$ and $upper$ values are the estimated confidence intervals of the corresponding parameters. $Std$-standard deviation) and $mean$-average values are obtained from the bootstrap samples. These results are used in the proposed three growth models. After the approprate models are selected (in our case, the three proposed models are equally approprate), the parameters are taken as an intial values for the interaction model; which is used to study the growth of S.mutant in the presence of other species(see <a href='/Team:NYMU-Taipei/modeling/m2'><b>competition model</b></a>). |
| </p> | | </p> |
- | <p>$$\text{Table 2: Analysis of Regression Model}$$</p> | + | <p class='Jaxcenter'>$$\text{Table 2: Analysis of Regression Model}$$</p> |
| <pre> | | <pre> |
| ============================================================== | | ============================================================== |
Line 73: |
Line 90: |
| =============================================================== | | =============================================================== |
| </pre> | | </pre> |
- | <p>Using the table 2 result, the pH model can be estimated as: | + | <p>Using the table 2 result, the pH model can be estimated as: $pH=7.61 - 2.04 OD$</p> |
- | $$pH=7.61 - 2.04 OD$$
| + | <p>There is a strong linear correlation between pH and OD (R=0.9135, F=254.5, P.value $<$0.05). The coefficient value indicates that for every additional unit in OD we can expect pH to decrease by an average of 2.04. For examples: if the OD=1, pH is expected to be (pH=7.61 - 2.04(1))=5.57. The red fitted line graphically shows the same information.</p><br> |
- | There is a strong linear correlation between pH and OD (R=0.9135, F=254.5, P.value $<$0.05). The coefficient value indicates that for every additional unit in OD we can expect pH to decrease by an average of 2.04. For examples: if the OD=1, pH is expected to be (pH=7.61 - 2.04(1))=5.57. The red fitted line graphically shows the same information.</p><br> | + | |
| <img src='/wiki/images/2/25/NYMU14_model1_FittedModels.png' style="display: block;width: 700px;margin: 0 auto;"><p style=" text-align: center; ">Figure 3: Shows model comparison-the shaded regions indicate a 95% confiden interval of the model fits (bold lines)</p><br> | | <img src='/wiki/images/2/25/NYMU14_model1_FittedModels.png' style="display: block;width: 700px;margin: 0 auto;"><p style=" text-align: center; ">Figure 3: Shows model comparison-the shaded regions indicate a 95% confiden interval of the model fits (bold lines)</p><br> |
| <p>If we move left or right along the x-axis by an amount that represents a one unit change in OD, the fitted line falls by 2.04 units. If the fitted line was flat (a slope coefficient of zero), the expected value for pH would not change no matter how far up and down the line you go. So, the low p-value ($\leq$ 0.05) suggests that the slope is not zero, which in turn suggests that changes in the predictor variable(OD) are associated with changes in the response variable (pH). <i>R-squared</i> is a statistical measure of how close the data are to the fitted regression line. It is also known as the coefficient of determination. It is the percentage of the response variable variation that is explained by a linear model. In this case, R-square =0.9135 shows that 91.35% of the pH variation is explained by OD, and the remain 8.65 can be explained by other factors that are not considered in the regression model. </p> | | <p>If we move left or right along the x-axis by an amount that represents a one unit change in OD, the fitted line falls by 2.04 units. If the fitted line was flat (a slope coefficient of zero), the expected value for pH would not change no matter how far up and down the line you go. So, the low p-value ($\leq$ 0.05) suggests that the slope is not zero, which in turn suggests that changes in the predictor variable(OD) are associated with changes in the response variable (pH). <i>R-squared</i> is a statistical measure of how close the data are to the fitted regression line. It is also known as the coefficient of determination. It is the percentage of the response variable variation that is explained by a linear model. In this case, R-square =0.9135 shows that 91.35% of the pH variation is explained by OD, and the remain 8.65 can be explained by other factors that are not considered in the regression model. </p> |
Line 81: |
Line 97: |
| <img src="/wiki/images/1/11/NYMU14_model1_regression.png" style="display: block;width: 500px;margin: 0 auto;"><p style=" text-align: center; ">Figure 5: Model Fit</p><br> | | <img src="/wiki/images/1/11/NYMU14_model1_regression.png" style="display: block;width: 500px;margin: 0 auto;"><p style=" text-align: center; ">Figure 5: Model Fit</p><br> |
| <img src="/wiki/images/4/43/NYMU14_model1_residuals.png" style="display: block;width: 500px;margin: 0 auto;"><p style=" text-align: center; ">Figure 6: Validation Test</p><br> | | <img src="/wiki/images/4/43/NYMU14_model1_residuals.png" style="display: block;width: 500px;margin: 0 auto;"><p style=" text-align: center; ">Figure 6: Validation Test</p><br> |
- | <h1>Conclusion</h1>
| + | </div> |
- | <h1>Reference</h1>
| + | |
- | | + | |
| </div> | | </div> |
| </html> | | </html> |
| {{:Team:NYMU-Taipei/NYMU14_Footer}} | | {{:Team:NYMU-Taipei/NYMU14_Footer}} |
Growth curve & pH model
Purpose
- To figure out the suitable model for S. mutans growth curve of our wet lab data, so that we would have some criteria in choosing the model for our next model, the competition model.
- To know how the population size of S. mutans affects pH value, which is also derived from our wet lab data, and therefore we can estimate the pH value in our next model, the competition model.
Background: Growth Curve
A growth curve for a population of bacteria illustrates some of the dynamics that affect the population size over time.
Four distinct phases are recognized:
The lag phase:
The curve remains at a plateau. During this time, bacteria adapt to their new environment, store nutrients and prepare for binary fission.
The logarithmic phase:
This phase is also called the exponential growth phase: the population of bacteria enters an active stage of growth, the mass of each cell increases rapidly, and the number of bacteria doubles.
The stationary phase:
At this stage reproductive and death rates equalize, the population enters another plateau.
Models and mathematic equations
Three well known growth models (Logistic, Gompertz, and Richartz)are used in this work. Characteristic model parameters (such as lag phase (λ), maximal growth rate (µ-max slope), stationary phase (A-max growth value)) are derived from our experimental data. Bootstrap and cross-validation techniques are used for estimating confidence intervals of
all derived parameters.
The aim is to integrate the experimental data into different growth models and to compare the models using statistical methods (AIC and maximum likelihood were used). We believe, and many scientists do, that model selection is the most important part in model-experiment based research-The right data with the right model.
Logistic Model:
$$\begin{align}
y(t)&=\frac{A}{1+\exp\left(\frac{4\mu}{A}(\lambda-t)+2\right)}
\end{align}$$
Gompertz Model:
$$\begin{align}
y(t)&=A.\exp\left[-\exp\left(\frac{\mu.\exp(1)}{A}(\lambda-t)+1\right)\right]
\end{align}$$
Richartz Model:
\begin{align}
y(t)&=A.\left[1+\nu\exp\left(1+\nu+\frac{\mu}{A}(1+\nu)^{1+1/\nu}(\lambda-t)\right)\right]^{-1/\nu}
\end{align}
$\nu$ is a shape parameter (in the richartz model only)
pH model
Existed model is
$$\begin{align}
\frac{dpH}{dP}&=k(pH-pH_{min})
\end{align}$$
However, this model is not appropriate for our experimental data; we used linear regression model instead.
$$pH=\alpha + \beta OD +\epsilon$$
where $\epsilon$ is a random error or noice (which helps to capture a measurement error and other unknown factors). And it is assumed to be Gaussian (normal distribution function), $\epsilon = N(\theta,\sigma^{2})$, with mean $\theta$ and constant variance $\sigma^{2}$. $\alpha$ and $\beta$ are model parameters.
Result and model validation
Figure 1: Simulation results of the growth curve
Our experimental data are implemented in the three proposed models. AIC is used to measure the performance of the models and the result shows that Logistic and Richartz models are approximately the same, but slightly different from Gompertz. However, using 95% confidence interval all of them are appropriate to fit the given data.
Figure 2: Shows validation test(t*) for the fitted model
The validation test (plot of the residuals), as Figure 2, shows that the simulation result obtained from bootstrap samples is suitable to estimate the model parameters.
Using our experimental data as initials samples, we applied Bootsrap statistical to sample empirical data. The simulated result is presented as Figure (1). And, the estimated model parameters (mu(µ), lamda(λ), A) are summarized as Table 1.
$$\text{Table 1: Estimated Model Parameter}$$
-------------------------------------------
mu lamda A
===========================================
Lower 0.294 16.041 1.155
Mean 0.384 16.456 1.187
Upper 0.474 16.871 1.220
Std 0.046 0.212 0.016
-------------------------------------------
Integral value: 16.622
-------------------------------------------
From the result on table $1$, the $lower$ and $upper$ values are the estimated confidence intervals of the corresponding parameters. $Std$-standard deviation) and $mean$-average values are obtained from the bootstrap samples. These results are used in the proposed three growth models. After the approprate models are selected (in our case, the three proposed models are equally approprate), the parameters are taken as an intial values for the interaction model; which is used to study the growth of S.mutant in the presence of other species(see competition model).
$$\text{Table 2: Analysis of Regression Model}$$
==============================================================
Estimated parametrs: | Statistical Tests:
Intercept (alpha)=7.6074 | R-squared= 0.9135
OD (beta)=-2.0365 | p-value=6.228e-14
--------------------------------------------------------------
F-statistic=254.6
===============================================================
Using the table 2 result, the pH model can be estimated as: $pH=7.61 - 2.04 OD$
There is a strong linear correlation between pH and OD (R=0.9135, F=254.5, P.value $<$0.05). The coefficient value indicates that for every additional unit in OD we can expect pH to decrease by an average of 2.04. For examples: if the OD=1, pH is expected to be (pH=7.61 - 2.04(1))=5.57. The red fitted line graphically shows the same information.
Figure 3: Shows model comparison-the shaded regions indicate a 95% confiden interval of the model fits (bold lines)
If we move left or right along the x-axis by an amount that represents a one unit change in OD, the fitted line falls by 2.04 units. If the fitted line was flat (a slope coefficient of zero), the expected value for pH would not change no matter how far up and down the line you go. So, the low p-value ($\leq$ 0.05) suggests that the slope is not zero, which in turn suggests that changes in the predictor variable(OD) are associated with changes in the response variable (pH). R-squared is a statistical measure of how close the data are to the fitted regression line. It is also known as the coefficient of determination. It is the percentage of the response variable variation that is explained by a linear model. In this case, R-square =0.9135 shows that 91.35% of the pH variation is explained by OD, and the remain 8.65 can be explained by other factors that are not considered in the regression model.
Figure 4: ph vs OD with 95% confident interval
Figure 5: Model Fit
Figure 6: Validation Test