<!DOCTYPE html>
E. coli
Introduction
The economic value of our co-culture depends heavily on butanol production by E. coli, making it important to model its growth and butanol production. We used Flux Balance Analysis (FBA) to study the growth dynamics and butanol production in E.coli. We also aimed to optimize the strain by identifying gene targets for overexpression and knockouts in order to improve butanol yields using OptKnock and FSEOF.
Model Construction
We aimed to replicate in silico the KJK01 butanol-producing strain that we received from Dr. ‪Syed Shams Yazdani‬ at ICGEB, Delhi.
We used the iML1515 model from the BIGG database1 which is a metabolic flux model of E.coli MG1655 that contains 1516 genes. This model was the base strain that we further modified to replicate the KJK01 strain. All modifications were made using the COBRA Toolbox in MATLAB and all optimization problems were solved using the Gurobi solver2.
We began by deleting the reactions corresponding to the following genes deleted in the KJK01 strain3, - by setting the fluxes through those reactions to zero.
- pta - codes for an enzyme that catalyses the conversion of acetyl-CoA to acetate.
- adhE - codes for an enzyme that catalyses the conversion of acetyl-CoA to ethanol.
- frdA - codes for an enzyme that catalyzes the conversion of fumarate to succinate.
- ldhA - codes for an enzyme that catalyses the conversion of pyruvate to lactate.
These genes catalyze reactions that direct flux away from the butanol producing pathway and produce acetate, ethanol, succinate and lactate respectively, which impact butanol yields. Thus their deletion will reroute flux back into the butanol producing pathway and increase yields.
We then added reactions catalyzed by the heterologous genes expressed in E. coli in order to construct a butanol-producing pathway in the KJK01 strain. We added the reactions using the MetaCyc5 database and the KEGG6,7,8 database.
These were reactions catalyzed by the following genes:
- From Clostridium acetobutylicum:
- hbd - codes for beta-hydroxybutyryl-CoA dehydrogenase which converts acetoacetyl-coA to (S)-3-hydroxybutanoyl-CoA
- crt - codes for 3-hydroxy-butyryl-CoA dehydratase that converts (S)-3-hydroxybutanoyl-CoA to crotonyl-CoA
- adhE2 - codes for butyraldehyde/butanol dehydrogenase, which is a multifunctional enzyme with both alcohol dehydrogenase and acetaldehyde dehydrogenase activities.
- From Treponema denticola
- ter - codes for trans-enoyl-coenzyme A (CoA) reductase that converts acyl-CoA to trans-2,3-dehydroacyl-CoA
- From E. coli
- atoB - coding for acetyl-CoA acetyltransferase that converts acetyl-coA to acetoacetyl-coA.
The reactions corresponding to the genes in the model were taken from the EcoCyc database4.
We also added the reactions catalyzed by the CscA (invertase), CscB (sucrose permease), CscK (fructokinase) genes from E.coli W to allow the strain to be able to consume and metabolize sucrose9.
However, we noticed that some of the heterologous gene reactions corresponding to the hbd, crt, cscK, and atoB genes already existed in the base iML1515 strain, and upon consultation with our advisors, we decided to consider the flux through those reactions as overexpressions of the corresponding genes.
Modelling E. coli for Optimum Butanol production
After adding the relevant genes and reactions to our model, the next step was to enforce additional flux through the reactions corresponding to the genes hbd, crt, atoB and cscK. While these genes were added in KJK01, in a metabolic model, we do not add duplicate reactions but rather increase flux through the existing reaction. The issue here is that fluxes through individual reactions are usually not available in literature.
On suggestion from our Ph.D mentor, Indumathi, we used a computational way to create a model with suitable over expression levels. She suggested proceeding as follows:
- Add all the reactions necessary for butanol synthesis
- Make the appropriate deletions and set realistic bounds for our exchange reactions
- Optimize the model for growth and obtain the base value fluxes for all the reactions that were to be overexpressed
Some important fluxes are mentioned in the table below:
Reaction in the model corresponding to the reaction of the inserted geneBase flux valueHBD12.0665CRT 112.0665CRT 212.0665CSCK20Biomass0.2238Butanol exchange0 - We then developed a nested loop designed as follows:
- We force flux in the range 12 to 15 (with a step size of 0.5) through the hbd reaction
- For each flux value of hbd we had a loop forcing flux in the same range through the reaction CRT1
- For each flux value for CRT1 we had a loop forcing the same range of fluxes through the CRT 2 reaction
- For each flux value of CRT2 reaction, we had a loop forcing flux in the range 20 to 24 (for a step size of 1) through the cscK gene reaction
- We then optimize the model with growth as the objective function for each set of flux values and check for growth and butanol production
We thus generated a table (Supplementary material [ecoli_ovrxps_fluxes_v2]) of enforced flux distributions with their effects on growth and butanol production.
Figure 1: A graph showing the overexpression of the genes and the growth and butanol production resulting from the overexpression. Note that we have plotted hbd as a representative of the hbd, crt_2 and atoB reactions since they all have the same fluxes through them. CRT 1 is not shown as it does not directly participate in butanol synthesis. The selected data point has been pointed out. The flux values are as follows; hbd - 14, crt_1 - 12, crt_2 - 14, atoB - 14, cscK - 20, biomass - 0.136 , butanol flux - 2.
Out of the 12,005 expected data points, only 133 showed viable growth. We observed that the fluxes through the reactions of the genes hbd, crt-2, and atoB were the same whenever growth was observed, which is to be expected given that these fractions follow one after the other in the metabolic pathway. We set a cutoff of 60% for the biomass reaction and selected the datapoint showing the highest butanol production. This lead to the following flux distribution for the reactions to be overexpressed:
At this value of overexpression we observed a growth of 0.136 and butanol production of 2.
The final model with the overexpression fluxes is available here (Supplementary material)
Flux Variability Analysis for Overexpression Targets
We used flux variability analysis (FVA)11 to find the range of possible fluxes through the reactions of the crt, atoB, cscK genes and the butanol exchange reaction.
FVA can be used to find the minimum and maximum fluxes for any reaction for a particular state of the metabolic network. The FVA for these reactions were performed in anaerobic conditions, when growth was set to 60% of its optimal value.
Thus we can see what optimal overexpression fluxes are required to maximize butanol yields, even when growth is not optimal.
Results
Growth Rate v/s Sucrose
Fig 12: Relationship between sucrose uptake and growth rate.
Observations:
- Growth rate increases linearly with sucrose uptake rate
- Growth rate is higher under aerobic conditions for all sucrose uptake rates as compared to anaerobic conditions
Inferences
Growth rate increases linearly with sucrose uptake rate as expected, since the carbon flux from sucrose metabolism will be directed to the biomass reaction, leading to an increase in growth rate. This confirms that the CSCA, CSCB, CSCK reactions added to the model work in the metabolic network and the model E. coli can grow on sucrose as a carbon source.
Butanol Production vs Growth Rate
The objective function was first set to maximize the growth rate of the model. Butanol production was then set as the objective function for different percentages of the optimal growth rate at a sucrose uptake rate of 10 mmol/gDW/h to obtain the theoretical maximum yield of butanol if E.coli were growing at X% of its optimal growth rate.
Figure 3: Relationship between maximum butanol production and growth rate.
Observations:
- The maximal theoretical yield of butanol production decreases as growth rate increases
- Maximal butanol production is much higher for anaerobic conditions as compared to aerobic conditions
- Optimal growth rate (100% on the graph) leads to zero butanol production in both aerobic and anaerobic conditions
Inferences:
If E. coli grows at its optimal growth rate, butanol production is zero as would be expected, since it is a secondary metabolite that takes carbon flux away from the growth and has no role in the physiology of the cell as it is a heterologous compound in the E. coli metabolic network
As the growth rate is decreased from its optimal value, the maximal theoretical yield of butanol increases implying that carbon flux is rerouted to the butanol producing pathway in the network.
This confirms that the butanol production pathway in the model is functional. Butanol production happens under anaerobic or microaerobic conditions10. We wanted to thus test the effect of oxygenation on butanol production in the model, and as expected found that butanol production was significantly higher in anaerobic conditions as compared to aerobic conditions at all growth rates.
Butanol Production vs Sucrose Uptake
Figure 4: Relationship between maximum butanol production and sucrose uptake when growth is 60% of its optimal value
Observations:
- Maximum butanol production increases linearly with increasing sucrose uptake
- Maximum butanol production is higher for anaerobic conditions as compared to aerobic conditions at all sucrose uptake rates
Inferences
Increasing sucrose uptake rates lead to an increase in butanol production when butanol is the objective function. However we expect newly engineered strains that haven’t evolved for very long to optimize growth as an objective instead of butanol production. Thus there is a tradeoff between growth and butanol production, which we imitated by setting the growth rate of E.coli to 60% of the optimal growth rate and setting the objective function as butanol production. This shows that even at low growth rates, the extra flux from sucrose uptake can be used to increase butanol production over other metabolites in the model.
Butanol Production vs Oxygen Uptake
Our co-culture modelling results indicated that butanol production in the co-culture is optimum i microaerobic conditions as compared to anaerobic conditions and we wanted to test whether that would hold true for a monoculture as well, by testing the maximum butanol production at varying oxygen uptake rates, when growth was set to 60% of the optimal value
Figure 5: Relationship between maximum butanol production and oxygen uptake when growth is 60% of its optimal value
Observations:
- As oxygen uptake rates increase, maximum butanol production decreases
Inferences
In a mono-culture, there is a slight decrease in maximal butanol production in microaerobic as compared to anaerobic conditions
Side Products
Abdelaal el. al3 observed pyruvate, ethanol and butyrate to be the major side products in the KJK01 butanol producing strain, which divert flux away from the butanol production pathway and reduce yields. Pyruvate was the major side product which indicated a redox imbalance, while ethanol was produced in equal quantities even after the deletion of the endogenous alcohol dehydrogenase gene (adhE).
We wanted to estimate the maximal side product formation at different growth levels in anaerobic and aerobic conditions and see whether our results matched with those reported by Abdelaal el. al3.
We used flux variability analysis (FVA)11 to find the maximum possible fluxes through the side product reactions under a growth rate constraint. FVA can be used to find the minimum and maximum fluxes for any reaction for a particular state of the metabolic network. For example, it can be used to find the minimum and maximum fluxes for a reaction such that X% of the optimal growth rate is maintained.
We obtained the optimal growth rates for aerobic and anaerobic conditions and for a particular percentage of that growth rate, FVA was performed for the exchange reactions of butanol, ethanol, pyruvate, butyrate and sucrose uptake to find the maximal fluxes through those reactions. The minimum fluxes for these reactions will be zero since all the carbon flux will be directed toward growth.
Fig 13: Maximal side product formation as a function of growth in anaerobic conditions
Fig 14: Maximal side product formation as a function of growth in aerobic conditions
Observations:
- In anaerobic and aerobic conditions, pyruvate production is consistently higher than the other side products
- Butyrate seems to be almost equally produced with butanol (particularly in aerobic conditions) whereas in both anaerobic and aerobic conditions, ethanol production is zero
- Sucrose consumption in both anaerobic and aerobic conditions is quite similar
Inferences
Consistent with the observations of Abdelaal el. al3, pyruvate was a major byproduct which indicates a redox imbalance in the network. Ethanol was zero, which is to be expected since there is only one gene (adhE) that produces ethanol from acetyl-coA, whose flux we set to zero to duplicate the KJK01 strain knockout of adhE. Given that the flux through pyruvate and butyrate production is equal or greater than that of butanol, these present alternate possible targets to increase butanol production
OptKnock
We ran OptKnock on our base model without the knockouts but with the overexpressions to identify the possible knockouts that could help increase butanol production. Our constructed model had 4 knockouts, so for OptKnock, we set the total number of knockouts to 5. Since the metabolic pathway for butanol production is linked to the TCA cycle in E. coli, we decided to look for knockouts in this cycle. Thus, we looked for knockouts in the glycolytic pathway plus the TCA cycle. We were hoping to find the same deletions as in our constructed model, plus some additional ones. We selected the top 5 knockouts. The results are summarized in the table below
Optknock Table
We observed that all the deletions (except ‘ACLD19’) we made in our constructed model showed up in some or the other combination of the OptKnock results. We observed a growth rate of 0.17 and butanol production of 18.6 in our suggested deletions. Furthermore, D-lactate dehydrogenase (LDH_D) and Triose-phosphate isomerase (TPI) were suggested in all the top 5 OptKnock deletions. While the suggested reactions have been knocked out in our constructed model, TPI is a new deletion suggested by OptKnock. TPI catalyzes the conversion of Dihydroxyacetone phosphate (DHAP) to 3-Phosphoglyceraldehyde (PGAL). It isn’t clear how this helps in butanol production yet. Experimental validation might be needed.
FSEOF
We ran FSEOF on our constructed model without the overexpressions to identify reactions whose fluxes increased with increase in the enforced flux through the butanol exchange reactions. The fluxes through a total of 65 reactions showed a strict increase with increase in enforced butanol flux. Some of the reactions showing large increase in flux include proton transfer, CO2 exchange, etc. but also reactions of genes like pyruvate dehydrogenase, pyruvate kinase, Acyl-CoA dehydrogenase, 3-hydroxyacylCoA dehydratase, 3-hydroxyacylCoA dehydrogenase. Apart from the transport and exchange reactions of butanol, we did not observe any increase in flux in the reactions added by us. The full list of reactions is available
References
- King, Z. A., Lu, J., Dräger, A., Miller, P., Federowicz, S., Lerman, J. A., ... & Lewis, N. E. (2016). BiGG Models: A platform for integrating, standardizing and sharing genome-scale models. Nucleic acids research, 44(D1), D515-D522.
- Gurobi Optimization, L.. (2021). Gurobi Optimizer Reference Manual.
- Abdelaal, A. S., Jawed, K., & Yazdani, S. S. (2019). CRISPR/Cas9-mediated engineering of Escherichia coli for n-butanol production from xylose in defined medium. Journal of Industrial Microbiology and Biotechnology, 46(7), 965-975.
- Keseler, I. M., Collado-Vides, J., Santos-Zavaleta, A., Peralta-Gil, M., Gama-Castro, S., Muñiz-Rascado, L., ... & Karp, P. D. (2010). EcoCyc: a comprehensive database of Escherichia coli biology. Nucleic acids research, 39(suppl_1), D583-D590.
- Caspi, R., Altman, T., Billington, R., Dreher, K., Foerster, H., Fulcher, C. A., ... & Karp, P. D. (2014). The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of Pathway/Genome Databases. Nucleic acids research, 42(D1), D459-D471.
- Kanehisa, M., & Goto, S. (2000). KEGG: kyoto encyclopedia of genes and genomes. Nucleic acids research, 28(1), 27-30.
- Kanehisa, M. (2019). Toward understanding the origin and evolution of cellular organisms. Protein Science, 28(11), 1947-1951.
- Kanehisa, M., Furumichi, M., Sato, Y., Ishiguro-Watanabe, M., & Tanabe, M. (2021). KEGG: integrating viruses and cellular organisms. Nucleic acids research, 49(D1), D545-D551.
- Bruschi, M., Boyes, S. J., Sugiarto, H., Nielsen, L. K., & Vickers, C. E. (2012). A transferable sucrose utilization approach for non-sucrose-utilizing Escherichia coli strains. Biotechnology advances, 30(5), 1001-1010.
- Wu, P., Wang, G., Wang, G., Børresen, B. T., Liu, H., & Zhang, J. (2016). Butanol production under microaerobic conditions with a symbiotic system of Clostridium acetobutylicum and Bacillus cereus. Microbial cell factories, 15(1), 1-11.
- Mahadevan, R., & Schilling, C. H. (2003). The effects of alternate optimal solutions in constraint-based genome-scale metabolic models. Metabolic engineering, 5(4), 264-276.
Team IISER Pune India