Measurement
Our main motivation to attempt to do this model was because we couldn´t find in literature the promoter strength of Pnrd, which plays an essential role in our modeling. However, data about its fluorescence is available, so we figured out a way in which we can translate this fluorescence into real promoter strength, which will be applicable to all promoters
Following the central dogma of molecular biology, we can write down the following pair of ODE ́s, describing the process of synthesis and degradation of a fluorescent protein:
\[ \frac{d[mRNA]}{dt} = K - \gamma_1 [mRNA] \]
\[ \frac{d[FP]}{dt} = \alpha [mRNA] - \gamma_2 [FP] \]
Where \(K\) is the promoter strength, \(\gamma_{1}\) the mRNA degradation constant, \(\alpha\) the protein synthesizing constant and \(\gamma_{2}\) the protein degradation constant.
Solving the system, we get the equation for fluorescent protein in time:
\[ FP(t) = \frac{-\alpha c_1}{(\gamma_2 - \gamma_1)\gamma_1}e^{- \gamma_1t} + c_2e^{- \gamma_2t} + \frac{\alpha K}{\gamma_1 \gamma_2} \]
From the equation above, we can see that it consists of two decreasing exponential functions and one constant term, therefore, we can take the limit when the time tends to infinity to predict what will be the maximum fluorescence. The two decreasing exponential functions will cancel out as they tend to zero, giving that the maximum fluorescence is completely determined by the constant term:
\[ FP_{max} (t) = \frac{\alpha K}{\gamma_1 \gamma_2} \]
Here we present the process of obtaining and validating the data with our model.
Robustness of the method
For the Pnrd promoter, we also used a plot from igem parts, as seen in the figure below. The link to iGEM parts page is: http://parts.igem.org/Part:BBa_K2070012
As seen in the table below, we extracted 5 points from the experimental data from the plot above.
Time(minutes) | Fluorescence |
---|---|
0 | 25000 |
18 | 26500 |
45 | 27500 |
63 | 28500 |
81 | 31000 |
As we can see from the general equation, we only need 3 points to calculate the 3 unknown coefficients \(c_1\),\(c_2\) and \(K\) of the formula, with special interest in K which is the promoter strength. To check reproducibility and statistical significance, we made all the possible combinations of 3 points in a set of 5, which give us 10 different scenarios.
Combination | Value(protein/s) |
---|---|
1 | 0.0022 |
2 | 0.0022 |
3 | 0.0024 |
4 | 0.0023 |
5 | 0.0025 |
6 | 0.0026 |
7 | 0.0023 |
8 | 0.0025 |
9 | 0.0026 |
10 | 0.0027 |
Mean | 0.002424 |
Standard Deviation | 0.000181 |
Table 6: Results of the value of promoter strength for Pnrd, for all the possible 3 combinations for 5 data points.
As we can see from the table above, a very low standard deviation was obtained, at least one order of magnitude lower than the mean, which means that the promoter strength obtained through all the different combinations was very homogeneous, suggesting that indeed the experimental data fit properly to the model proposed and that it would be okay if we only use 3 points to calculate the promoter strength, without fear of doing a bad generalization.
Relative Strength Comparison
Now we are going to try to prove the model´s applicability by comparing the outputs
given by the method with real lab data.
As a first approach, we built upon the work of iGEM Tacoma team of 2019.
They collected characterization data on the relative RFP expression rates of
Anderson Promoters Bba_J23100, Bba_J23101, Bba_J23102, Bba_J23105, Bba_J23106, and
Bba_J23112 in the RFP expression plasmid Bba_J61002 (samples provided in the 2019
iGEM Distribution Kit).
Their protocols were adapted from the iGEM 2018 Interlab protocol.
As the original characterization of the Anderson promoters and this study
performed by the Tacoma iGEM team were done using the red fluorescent protein(RFP),
we adapted our model with the new parameters corresponding to this protein, as
previously we just took into account the green fluorescent protein for the method.
The fluorescence in these experimental results was measured after 0, 3 and 5 hours
of growth.
In the table below we summarized the results obtained as well as the
relative strength of the promoters compared to J23100.
Promoter | Promoter Strength(protein/s) | Relative Strength |
---|---|---|
\(J23100\) | 0.0263 | 1 |
\(J23102\) | 0.0163 | 0.62 |
\(J23101\) | 0.0166 | 0.63 |
\(J23106\) | 0.0128 | 0.49 |
\(J23105\) | 0.0104 | 0.40 |
\(J23112\) | 0.0073 | 0.28 |
In the table below we compare our results against the original characterization from the Anderson Lab.
Promoter | Anderson Lab | Relative Strength Predicted | %Error |
---|---|---|---|
1 | 1 | 1 | 0 |
2 | 0.86 | 0.62 | 27.9 |
3 | 0.70 | 0.63 | 10 |
4 | 0.47 | 0.49 | 4.3 |
5 | 0.24 | 0.40 | 66.7 |
6 | 0.00 | 0.28 | 100 |
It is important to notice that(as seen in the table above) the comparison against
BBa_J23100 is consistent with the original Anderson characterization, except for
position 2 and 3, where it is supposed that the promoter J23102 is stronger than
J23101. A possible error that could explain this difference is that the OD660 values
were still increasing from 3 hours and 5 hours, which indicates the cultures were
still growing.
Allowing the cultures to grow further would allow a better determination regarding
the difference in promoter strength of BBa_J23101 and BBa_J23102, as Anderson
promoters were measured until they reached saturation.
Also the relative values obtained by our method present a good consensus
with the 3 data points in time the iGEM Tacoma team reported.
Direct Comparison
We tried to make a direct comparison between values of promoter strength predicted
by our model and the corresponding reported literature values, however, we have
failed until now mainly because of lack of biological units on fluorescence data(the
vast majority is reported in arbitrary units) and the great variance that exists
between labs because of lack of standardized methods of measurement and calibration.
We have tried to compare the values obtained by our method of promoter
J23101 and J23106(precisely were the ones which presented the least percentage
relative error) against the values in biological units established in the iGEM
Interlab study, which also take into account these two promoters. We have data in
arbitrary fluorescence units of these promoters, as well as their optical density,
however we have failed or don´t know how to convert that into the proposed units of
Molecules of Equivalent Fluorescein(MEFL)/cell in order to be able to compare them.
Other sources of possible error in our method and things to improve:
- Our equation describes fluorescent protein, and data available and used to calculate promoter strength is fluorescence, so there is a slight mismatch. It is known that the relation between the fluorescent protein and its fluorescence is linear, however this has to be measured and computed by the lab that performed the measurements.
- It is important to take into account and report the RBS strength of the RBS used in the experiments.
- It is very important to reach an agreement about what biological unit to use to report promoter strength. We propose the unit of Molecules of Equivalent Fluorescein(MEFL)/cell, as proposed in the iGEM Interlab study.
- Calibration methods and materials should be reported along with fluorescence data, because it describes the conditions in which the experiment was performed and this data could be useful in the comparison process.
Lastly, we created a software tool to translate fluorescence into promoter strength
in order that other iGEM teams could use it, as seen in the figure below. The app
can receive data in GFP or RFP, but we plan to include other fluorescent proteins.
You can download it on our Github page, it is named “Promoter_Strength.mlapp”.
Please refer to the next PDF document in order to see the detailed
mathematical derivation of the technique as well as more validations that we did.