On this page the main findings and interpretations of the results of our project can be found. In our project we aimed to achieve four different goals. These four proof of concepts are summarized in Figure 1.

Main proof of concepts that we evidenced during our experiments in the wet and dry lab.

Ammonia as the only nitrogen source

As we plan on feeding our GMO with ammonia extracted from the MOF, an essential part of our project is that our GMO should be able to grow in the presence of ammonia as its only nitrogen source. Therefore, we grew 6 different Saccharomyces strains under different ammonia concentrations to see if the presence of ammonia would affect their growth. In the end we were able to identify the best performing strains. The protocol can be found in the Experiments page.

Growth under different ammonia concentrations of different wild type yeast (Saccharomyces cerevisiae and Saccharomyces paradoxus) strains. The strains tested are a. sYB76 (NCYC3597, S. cerevisiae) b. sYB77 (NCYC3566, S. cerevisiae) c. sYB78 (NCYC3598, S. cerevisiae) d. sYB79 (NCYC3635, S. paradoxus) e. sYB83 (NCYC3647, S. paradoxus) f. sYB85 (NCYC3687, S. paradoxus) g. Control (no cells)

As can be seen from Figure 2, there are 4 strains that outperformed in the experiment. These strains were chosen for further experimentation. The doubling time of the chosen strains can be seen in Table 1.

Doubling time (TD) for each strain in different ammonia concentrations in the exponential phase.

Strain TD (min)
0.05 g/L 5 g/L 7.5 g/L
ySB76 51,71 70,71 69,3
ySB77 130,75 90 60,78
ySB78 101,91 94,93 88,85
ySB85 103,43 91,18 108,28

We learned that two of the strains that we were planning to use didn’t perform well when ammonia was the only source of nitrogen. This experiment helped us to exclude them for further tests as they weren’t suitable for our purpose (see Description page). Four strains proved to grow in different concentrations of ammonia, with strain ySB76 (S. cerevisiae) being the fastest grower overall. We could not observe a clear tendency that would suggest the optimal ammonia concentration for growing the cells. This result is relevant for our project, since the concentration of this nutrient doesn't necessarily need to be kept constant, which allows for more flexibility in ammonia concentration of the cell culture and thus more flexibility in how much ammonia the MOF should be able to capture. As we wanted to test different chassis in the optimization process, after this experiment we decided to express alpha amylase in yYS76, yYB77, ySB78 and ySB85, which correspond with 3 S. cerevisiae strains and 1 S. paradoxus strain.

Engineering success

For our second proof of concept, we wanted to demonstrate that it is possible to use Golden Gate Assembly to clone heterologous genes for alpha-amylase in Saccharomyces spp. Going from the gene of interest to the cassette plasmid expressed in Saccharomyces spp. is a long process. Throughout it there were some checkpoints that we used to confirm the success of the cloning: GFP screening; DNA concentration measurements with a Nanodrop; basic parts sequencing; and digestion of the final cassette plasmids.

GFP screening

The empty plasmid (pRS426__ConLS'-GFPdropout-ConRE'-URA3-2micron-Kan) used in the Golden Gate cassette plasmid assemblies contains a GFP dropout. This way, when the assembled plasmid is inserted correctly, the GFP insert will dropout and white colonies will be formed. Whenever the plasmid was not inserted correctly, the GFP dropout will still be present and colonies will appear green when examined under UV-light (Figure 3). With this simple technique we make sure that the chosen colonies have uptaken the assembled plasmid without the need of performing sequencing or digestion experiments prior to plasmid isolation. This GFP screening was used both for assembling our part plasmids and cassette plasmids.

Screening in E. coli colonies with the cassette plasmid SP034. The green fluorescent colony can be found underneath the cross, the rest appears white coloured. Only white colonies were selected for plasmid isolation.

GFP screening smoothened the process, considering the amount of samples that we needed to assemble. Generally, all the transformed E.coli with cassette plasmids (64 plates) showed a high efficiency of the Golden Gate assembly; a few countable green colonies were observed in contrast to hundreds of white colonies. One sample (SP007) didn’t show any white colonies, the sample was excluded for future experiments.

DNA concentrations

After purification of cassette plasmids amplified in E. coli, DNA concentrations were measured in order to assure a successful purification prior to Saccharomyces spp. transformations, results can be found in Table 2. These measurements were performed using a nanodrop spectrophotometer, measuring the DNA concentration at a spectrum of wavelengths.

DNA concentration and purity after E. coli transformation of cassette plasmid and plasmid isolation.

Sample number ng/ul A260/280 ratio Sample number ng/ul A260/280 ratio
SP001 22,6 1,823 SP034 80 0,001
SP002 15,6 1,803 SP035 35,5 1,529
SP003 29 1,598 SP036 15 1,523
SP004 19,7 1,662 SP037 29,4 1,699
SP005 54,3 2,027 SP038 48,5 1,960
SP006 53,5 1,508 SP039 14,6 2,021
SP008 69,5 1,853 SP040 18,3 1,649
SP009 16,1 2,414 SP041 25,1 1,668
SP010 12,1 1,898 SP042 45,1 1,488
SP011 13,3 1,430 SP043 15 2,150
SP012 58,9 1,812 SP044 29,9 1,472
SP013 19,5 2,058 SP045 78,4 1,708
SP014 29,1 1,473 SP046 33 1,451
SP015 28,3 1,461 SP047 62,3 1,489
SP016 52,3 1,458 SP048 97,6 0,002
SP017 72,1 1,553 SP049 13,4 1,949
SP018 34,7 1,627 SP050 119 0,002
SP019 79,8 1,788 SP051 21,2 1,612
SP020 15,7 1,627 SP052 56,5 1,471
SP021 17,9 1,729 SP053 6,3 2,117
SP022 71 1,514 SP054 39,2 1,798
SP023 13,2 1,688 SP055 25,7 1,490
SP024 10,6 2,078 SP056 50,7 2,069
SP025 82,7 0,002 SP057 73,3 1,806
SP026 9,5 1,602 SP058 51,7 1,987
SP027 47,2 1,512 SP059 30,6 1,363
SP028 64,2 1,544 SP060 61,6 1,936
SP029 10,5 2,297 SP061 62,4 1,848
SP030 16,6 1,892 SP062 82,4 1,741
SP031 51,3 1,507 SP063 66 1,547
SP032 45 2,067 SP064 58,8 1,718
SP033 48,7 1,617

The A260/280 ratio represents the purity of the DNA, an ideal sample would have a value of approximately 1.8. Due to large variation in DNA concentration and A260/280 ratio, 20 μl of each cassette plasmid was used in the Saccharomyces spp. transformation instead of 5 μl. Overall, the concentration of DNA in the samples after transformation and plasmid isolation were good and ensured the continuation with Saccharomyces spp. transformations.

DNA sequencing

To ensure that our ordered G-blocks for part 3 and part 3B we made were correct, we sequenced the basic parts (4 full alpha amylases and 4 alpha amylases without the native secretion peptide) assembled in the entry vector (more on this on the Engineering page). pYTK001 (entry vector) with Part 3 and 3b was sequenced using two primers flanking the gene, the vector is used to compose the cassettes plasmids in further experiments. All sequencing results are summarized in Table 3.

Sequencing results from all the parts.

Part Alpha amylase Gene Mutations Comments
Part 3 From A. oryzae - Good quality sequence, 100% gene coverage.
From B. amyloliquefaciens c.301C>Tp.T101I The amino acid is not in the substrate binding site of the protein. 100 % gene coverage.
From B. licheniformis c.690G>Tp.D231Y Mutations in the flanking aspartate have shown a decrease in thermostability [1]. No data about our mutation. 100% gene coverage.
From B. subtilis - Good quality sequence, 100% gene coverage.
Part 3b From A. oryzae - Good forward sequencing, 63% gene coverage.
From B. amyloliquefaciens - Good forward sequencing, 52%
From B. licheniformis - Good forward sequencing, 66% gene coverage.
From B. subtilis - Good forward sequencing, 50% gene coverage.

Generally, the results were quite successful and no major mutations were observed. However, two clear missense mutations were identified in alpha-amylases from B. amyloliquefaciens and B. licheniformis that may have an effect on the protein functionality. It would have been nice if we had the time to restart the cloning from the beginning, unfortunately, the cassette plasmids were already assembled and the time was very limited. In Part 3b full coverage of the gene couldn’t be achieved, repeating the sequencing was planned but sadly we didn’t have the time.

All in all, the mutations found will be taken into account for further results of those enzymes.

Constructs digestion

We used the restriction enzyme BsmBI to check for the correct ligation of the different constructs of the library. There are two sites for the enzyme BsmBI in the vector employed (pRS246) for our constructs, the sites are in part 1 and part 5, on both sides of the insert of interest (Figure 4). The use of BsmBI allows us to check for the size (Table 4) of the construct and assess the success of the assembly. The results of the digestion can be found in Figure 5.

Scheme of one of the constructs from the library, SP001, where the sites for the enzyme BsmBI are displayed.

Expected sizes of the different constructs of the library. The elements of each sample can be found in the pages of the Parts plasmids.

Sample Expected sizes A (bp) Expected sizes B (bp) Sample Expected sizes A (bp) Expected sizes B (bp)
SP001 3729 3387 SP034 3729 2983
SP002 3729 2869 SP035 3729 2970
SP005 3729 3387 SP037 3729 3387
SP006 3729 2953 SP038 3729 3387
SP008 3729 3237 SP039 3729 2982
SP009 3729 3237 SP040 3729 2970
SP010 3729 3387 SP041 3729 2802
SP011 3729 2776 SP042 3729 2982
SP012 3729 2755 SP043 3729 2953
SP013 3729 3381 SP044 3729 2886
SP014 3729 2982 SP045 3729 2982
SP015 3729 2869 SP046 3729 2866
SP016 3729 2790 SP047 3729 2755
SP017 3729 2869 SP048 3729 2970
SP018 3729 2880 SP049 3729 2749
SP019 3729 2796 SP050 3729 2970
SP020 3729 2886 SP051 3729 2970
SP021 3729 3387 SP052 3729 2982
SP022 3729 2982 SP053 3729 2755
SP023 3729 2970 SP054 3729 3291
SP024 3729 2886 SP055 3729 2982
SP025 3729 2802 SP056 3729 2796
SP026 3729 2982 SP057 3729 2970
SP027 3729 3237 SP058 3729 2755
SP028 3729 2982 SP059 3729 2886
SP029 3729 2982 SP060 3729 3237
SP030 3729 2866 SP061 3729 2970
SP031 3729 2886 SP062 3729 2976
SP032 3729 2970 SP063 3729 2982
SP033 3729 3387 SP064 3729 2869
pRS426-GFP 3729 1338
Digestion of the library of constructs with the enzyme BsmBI.

After the digestion of all the constructs it can be seen that the expected sizes of the engineered plasmids match the sizes of the bands shown in the gel. The efficiency of the Golden Gate assembly is high. There are two samples in which a band of the size of the GFP-drop out is observed, SP012 and SP058, besides the expected size. This result may be due to contamination while pipetting. Nevertheless, the expected band can be observed, the samples were used in follow up experiments, consideration on alpha amylase activity will be taken into account. Additionally, some samples (SP006, SP011, SP022, SP023, SP044, SP046, SP047, SP052, SP053, SP055) show faint bands, likely evidencing that the concentration of DNA was low in the first place. The faint bands have the expected size, and considering the high efficiency in the rest of the digests we conclude that our Golden Gate assemblies and further screening was highly successful. No samples were excluded after digestion.

Alpha amylase production

For our third proof of concept, we wanted to show that our GMO does indeed show alpha-amylase activity after taking up our construct. We aimed to measure the alpha-amylase production of 64 samples listed at our Engineering page. This was tested with two main experiments. The principal one being an alpha amylase kit assay used to perform high throughput experiments and obtain quantitative data, needed for our model (see Model page). Additionally, a qualitative assay was employed to confirm the production of alpha-amylase in some of the samples. To understand more how this assay works, see the Experiments page.

Alpha-amylase starch breakdown

Functionality of alpha-amylase can qualitatively be proved by observing the breakdown of starch. The ability to break down starch is only available when amylase is produced, which in the case of S. cerevisiae and S. paradoxus doesn’t happen natively (Figure 6e). However, the assembled constructs should be able to break down starch. In order to confirm this, selective media plates without uracil (specific for the assembled constructs) were created with the addition of 1% starch. Several samples were cultured ON and treated before being plated on the starch plates, as described in the notebook (wk 39), after which pictures were taken (Figure 6) of the with/without iodine treated starch plates. Samples tested with this assay can be found in Table 5.

Composition of samples that were tested by the Halo assay.

Sample Yeast Strain Promoter Gene Terminator
SP001 ySB77 pPGK1 αMF + BS tTDH1
SP010 ySB77 pHHF1 αMF + BS tTDH1
SP021 ySB77 pPGK1 αMFΔ + BA tTDH1
SP026 ySB76 pHHF1 αMF + BL tTDH1
SP027 ySB76 pRNR1 BS tTDH1
SP034 ySB76 pTEF1 αMF + BL tTDH1
SP035 ySB76 pPAB1 αMF_no_EAEA + BL tTDH1
SP051 ySB76 pRNR1 αMF_no_EAEA + BL tTDH1
SP063 ySB76 pRNR2 αMF + BL tTDH1
Halo assay for 9 constructs and non engineered Saccharomyces spp. in triplets, each plate contains 3 samples, SP001, SP010 and SP021 in plate 1 (left one), SP026, SP027, SP034 in plate 2 (middle) and SP035, SP061 and SP063 in plate 3 (right). Cells were pretreated differently to check for intra- or extracellular alpha amylase activity. A) Pure cells, B) Supernatant of overnight culture, C) Lysate of cells by bead beating 10s, D) Lysate of cells by bead beating during 10s + 30s.

As can be seen in Figure 6, a larger halo was observed in the cells’ lysate samples. Halos were also visible in most of the supernatant samples. This implies that alpha-amylase is present extra- and intracellular after cell lysis. We were able to confirm that this activity was only produced in the strains that had taken up the created constructs, as the non engineered strains didn’t show any halo at all. Overall, we see a difference in the alpha-amylase activity when samples were treated differently, suggesting that the secretion of the enzyme varies between constructs. More specifically to our project, we planned to isolate the protein from the supernatant, this way the processing of the cells is minimal and lysating is not directly necessary. Therefore, it was interesting to find that lysating some of the samples did produce a bigger halo, since this could change our proposed way of extracting alpha-amylase. Only few samples of the library were tested through this essay, therefore general conclusions can not be drawn, but confirmation of alpha amylase-activity could be assessed.

Alpha-amylase assay kit

We used a colorimetric assay to obtain quantitative data of the activity of the alpha amylase produced by the engineered yeast, allowing for comparison between samples and for quantitative data to use in our model.

Activity of alpha amylase in a random set of samples from the library. The sample number refers to the SP0 (being #3 the same as SP003). Activity is measured as the absolute change in absorbance due to the appearance of p-nitrophenol after 3h.

The assay is based on the capacity of alpha amylase to cleavage ethylidene-pNP-G7, resulting in p-nitrophenol, a product that can be read at a wavelength of 405 nm, more details on the Experiments page. A testing round was first performed (Figure 7) in duplicates per sample to check how the experiment was done and identify possible steps that may need to be optimized. This round also helped to exclude samples SP003, SP004 and SP036 for further analysis as we were tight on reagents and the results weren’t promising. 

During the high throughput experiments, we tested the capacity to produce functional alpha-amylase of all the samples which were not excluded from the subset of the combinatorial library (the excluded samples are listed on the Model page, section Accounting for lost samples). Activity of alpha-amylase in triplicates of each sample can be found in Figure 8. Supernatants from overnight cultures were incubated with ethylidene-pNP-G7 in 96 well plates and the appearance of p-nitrophenol was measured every 5 min during the first hour, then once after 2h and 3h. If alpha-amylase was in the supernatant, an increase of absorbance was expected due to the appearance of p-nitrophenol, an indirect measurement of alpha-amylase activity. Therefore, activity (y-axis of Figure 8) was measured as the increase of absorbance (quantity of p-nitrophenol) that appears per time unit (d[p-nitrophenol]/dt).

High throughput experiment on alpha amylase activity. We tested the activity of alpha amylase in triplicates in 60 samples. The different graphs account for 3 different 96-well plates a) Samples measured together in a 96-well plate on October 2nd, b) Samples measured together in a 96-well plate on October 5th and c) Samples measured together in a 96-well plate on October 5th.

After having excluded the samples in which the variance between replicates deviated too much from the average variance, a table summarizing the mean activity of each sample can be found below. The different combinations from the subset library are also displayed in Table 6.

Alpha-amylase activity of the construct from the subset of combinations. The different components from each sample are also displayed.

Sample Strain Promoter Secretion signal Gene Activity values (d[p-nitrophenol]/dt)
SP001 ySB77 pPGK1 αMF B.subtilis 0,005476099
SP002 ySB85 pRNR1 αMFΔ A.oryzae 3,4293E-05
SP005 ySB78 pHHF1 αMF B.subtilis 0,000260468
SP008 ySB78 pHHF1 Native B.subtilis 0,000192798
SP009 ySB85 pRNR1 Native B.subtilis 0,00052307
SP010 ySB77 pHHF1 αMF B.subtilis 0,001100158
SP012 ySB78 pRNR1 Native A.oryzae 9,26803E-05
SP013 ySB85 pRAD27 αMF B.subtilis 0,000945715
SP014 ySB76 pREV1 αMF B.amyloliquefaciens 0,00001
SP015 ySB76 pPGK1 αMFΔ A.oryzae 1,44142E-05
SP016 ySB77 pRAD27 Native B.licheniformis 1,2205E-05
SP018 ySB76 pRAD27 αMFΔ B.amyloliquefaciens 0,00001
SP019 ySB78 pTEF1 Native B.licheniformis 1,08206E-05
SP022 ySB77 pREV1 αMF B.amyloliquefaciens 1,14565E-05
SP023 ySB77 pRNR2 αMF_no_EAEA B.licheniformis 0,00001
SP024 ySB77 pTEF1 αMFΔ B.licheniformis 0,000341834
SP025 ySB78 pPAB1 Native B.amyloliquefaciens 1,15319E-05
SP026 ySB76 pHHF1 αMF B.licheniformis 0,000200209
SP027 ySB76 pRNR1 Native B.subtilis 0,001931731
SP028 ySB77 pTEF1 αMF B.amyloliquefaciens 1,20322E-05
SP029 ySB85 pRPL18B αMF B.licheniformis 5,32401E-05
SP030 ySB85 pTDH3 αMFΔ B.amyloliquefaciens 0,00001
SP031 ySB77 pPGK1 αMFΔ B.amyloliquefaciens 2,44344E-05
SP032 ySB78 pRPL18B αMF_no_EAEA B.amyloliquefaciens 1,85894E-05
SP033 ySB85 pREV1 αMF B.subtilis 0,000845811
SP034 ySB76 pTEF1 αMF B.licheniformis 0,000211464
SP035 ySB76 pPAB1 αMF_no_EAEA B.licheniformis 0,00021031
SP037 ySB85 pRPL18B αMF B.subtilis 0,00113086
SP038 ySB78 pRNR2 αMF B.subtilis 0,002933777
SP039 ySB85 pTEF1 αMF B.licheniformis 1,41915E-05
SP041 ySB85 pHHF1 Native B.amyloliquefaciens 1,17257E-05
SP044 ySB77 pRNR2 αMFΔ B.licheniformis 4,16163E-05
SP045 ySB78 pPAB1 αMF B.licheniformis 9,44327E-05
SP046 ySB77 pTDH3 αMFΔ B.licheniformis 1,15677E-05
SP047 ySB77 pPGK1 Native A.oryzae 0,00019171
SP049 ySB85 pRAD27 Native A.oryzae 0,00029359
SP050 ySB77 pPGK1 αMF_no_EAEA B.amyloliquefaciens 1,39692E-05
SP051 ySB76 pRNR1 αMF_no_EAEA B.licheniformis 0,000290985
SP052 ySB77 pPAB1 αMF B.amyloliquefaciens 1,01931E-05
SP053 ySB85 pHHF1 Native A.oryzae 1,35179E-05
SP054 ySB85 pRPL18B αMFΔ B.subtilis 0,000154906
SP055 ySB77 pRPL18B αMF B.licheniformis 0,00001
SP056 ySB78 pPAB1 Native B.licheniformis 1,94392E-05
SP058 ySB78 pRNR2 Native A.oryzae 6,79755E-05
SP060 ySB85 pREV1 Native B.subtilis 0,000223757
SP061 ySB85 pREV1 αMF_no_EAEA B.amyloliquefaciens 1,40926E-05
SP062 ySB85 pRAD27 αMF B.amyloliquefaciens 1,38021E-05
SP063 ySB76 pRNR2 αMF B.licheniformis 0,000596797

Experiment were performed in triplicate to account for stochasticity, samples SP006, SP011, SP020, SP021, SP042 and SP059 were excluded, as the variance between replicates deviated from the mean variance, evidencing that the signal was more likely an artefact than actual alpha amylase activity. For the rest of the samples displayed in the graphs it can be concluded that there is a high variability in terms of how active alpha amylase is. Some constructs aren’t successful in expressing active alpha amylase in the supernatant (any sample with an activity value below 0,00001 was considered non active). However, other samples produce bigger amounts of p-nitrophenol and thus have more alpha-amylase activity in the supernatant.

Some general trends can be observed by looking at the graphs and the table, for instance, a relatively high alpha amylase activity is observed in samples that contain the alpha amylase gene from the host B. subtilis. The interpretation of such a big set of samples and identification of the best performers is an arduous task. AI technology could help getting deeper insights on our library and finding a better combination from the combinatorial library, more on this can be found in the model page.

Improving alpha amylase production

We used the Automated Recommendation Tool (ART) to model which samples from the combinatorial library discussed on the engineering page would yield a high alpha-amylase activity in the supernatant (exploitative recommendations). Furthermore, the ART also provides recommendations on which samples should be tested in a future DBTL-cycle in order to improve the performance of the model (exploratory recommendations). On this page, only the final results of the ART are discussed. How these results are generated and how to analyze them is discussed on the modeling page.

Exploitative recommendations

Exploitative recommendation given by the ART.

Rank strain promoter secretion gene Predicted ln(activity) Predicted activity
0 ySB78 pPGK1 αMF B.subtilis -6.241298 0.001947
1 ySB78 pTDH3 αMF B.subtilis -6.323765 0.001793
2 ySB76 pRNR2 αMF B.subtilis -6.363736 0.001723
3 ySB85 pPGK1 αMF B.subtilis -6.388392 0.001681
4 ySB76 pPGK1 αMF B.subtilis -6.435489 0.001604
5 ySB77 pRNR2 αMF B.subtilis -6.521911 0.001471
6 ySB77 pTDH3 αMF B.subtilis -6.566358 0.001407
7 ySB76 pHHF1 αMF B.subtilis -6.573551 0.001397
8 ySB76 pRNR1 αMF B.subtilis -6.666158 0.001273
9 ySB76 pTDH3 αMF B.subtilis -6.676021 0.001261
10 ySB85 pRNR1 αMF B.subtilis -6.689628 0.001244
11 ySB77 pRNR1 αMF B.subtilis -6.707219 0.001222
12 ySB76 pPAB1 αMF B.subtilis -6.75746 0.001162
13 ySB77 pREV1 αMF B.subtilis -6.892306 0.001016
14 ySB78 pRNR1 αMF B.subtilis -6.892674 0.001015
15 ySB85 pPAB1 αMF B.subtilis -6.914319 0.000993
16 ySB76 pTEF1 αMF B.subtilis -6.930234 0.000978
17 ySB78 pPAB1 αMF B.subtilis -6.938977 0.000969
18 ySB76 pREV1 αMF B.subtilis -6.942261 0.000966
19 ySB85 pTEF1 αMF B.subtilis -6.95399 0.000955
20 ySB77 pPAB1 αMF B.subtilis -6.968985 0.000941
21 ySB78 pRPL18B αMF B.subtilis -6.980546 0.00093
22 ySB85 pTDH3 αMF B.subtilis -6.996441 0.000915
23 ySB77 pTEF1 αMF B.subtilis -7.012753 0.0009
24 ySB78 pREV1 αMF B.subtilis -7.057184 0.000861
25 ySB78 pRAD27 αMF B.subtilis -7.06477 0.000855
26 ySB85 pRNR2 αMF B.subtilis -7.082414 0.00084
27 ySB78 pRNR1 αMF_no_EAEA B.subtilis -7.096215 0.000828
28 ySB78 pTEF1 αMF B.subtilis -7.134024 0.000798
29 ySB76 pRAD27 αMF B.subtilis -7.223168 0.000729

Table 7 shows that all exploitative recommendations contain the alpha-amylase encoding genes from B.subtilis. Furthermore, the αMF secretion peptide occurs in 29 out of 30 recommendations. It is also striking that 8 out of 10 recommendations contain either the pRNR2, pTDH3 or pPGK1 promoter, with the pPGK1 promoter occurring in 3 out of the top 5 recommendations. No clear pattern can be noticed in the recommendations regarding which Saccharomyces spp. should be used as a chassis.

Exploratory recommendations

Exploratory recommendation given by the ART

Rank strain promoter secretion gene Predicted ln(activity) Predicted activity
0 ySB78 pTDH3 αMF B.amyloliquefaciens -10.158696 0.000039
1 ySB78 pTDH3 Native B.amyloliquefaciens -10.167739 0.000038
2 ySB78 pRPL18B αMF_no_EAEA A.oryzae -9.694006 0.000062
3 ySB78 pPAB1 αMF_no_EAEA B.licheniformis -9.807968 0.000055
4 ySB78 pTDH3 αMF B.subtilis -6.323765 0.001793
5 ySB78 pPGK1 αMF B.amyloliquefaciens -10.157919 0.000039
6 ySB78 pTDH3 Native B.subtilis -7.246373 0.000713
7 ySB78 pRNR1 αMF_no_EAEA B.amyloliquefaciens -10.167606 0.000038
8 ySB78 pRNR1 αMF_no_EAEA A.oryzae -9.742028 0.000059
9 ySB78 pHHF1 αMF_no_EAEA B.amyloliquefaciens -10.261079 0.000035
10 ySB78 pREV1 αMFΔ A.oryzae -9.40958 0.000082
11 ySB78 pPGK1 αMF A.oryzae -9.517837 0.000074
12 ySB77 pPGK1 αMF A.oryzae -9.564904 0.00007
13 ySB77 pPGK1 αMF B.licheniformis -9.353338 0.000087
14 ySB78 pRAD27 αMF_no_EAEA A.oryzae -9.827786 0.000054
15 ySB78 pRNR1 αMF_no_EAEA B.licheniformis -9.966545 0.000047
16 ySB76 pTDH3 αMFΔ B.licheniformis -9.201508 0.000101
17 ySB78 pREV1 αMF_no_EAEA A.oryzae -9.898024 0.00005
18 ySB78 pHHF1 αMF_no_EAEA B.subtilis -7.240897 0.000717
19 ySB77 pREV1 αMF A.oryzae -9.653788 0.000064
20 ySB77 pTDH3 αMF B.subtilis -6.566358 0.001407
21 ySB85 pRAD27 αMF_no_EAEA A.oryzae -9.909267 0.00005
22 ySB77 pTDH3 Native B.subtilis -7.454713 0.000579
23 ySB85 pRAD27 αMF A.oryzae -9.6954 0.000062
24 ySB78 pRNR2 αMF A.oryzae -9.705825 0.000061
25 ySB78 pHHF1 αMF_no_EAEA B.licheniformis -10.126075 0.00004
26 ySB77 pHHF1 αMF A.oryzae -9.733338 0.000059
27 ySB78 pPGK1 αMF_no_EAEA B.licheniformis -10.143406 0.000039
28 ySB78 pTDH3 αMF_no_EAEA B.amyloliquefaciens -10.471749 0.000028
29 ySB78 pPGK1 αMF B.licheniformis -9.498398 0.000075

The exploratory results of the ART are shown in Table 8. While these patterns might seem mostly random at first sight, there are some striking patterns that reflect the frequency and variance distributions shown in figures 7 and 8 discussed in the “Step 1: Creating a well-balanced subset of the combinatorial library” section on the modeling page. For instance the pTDH3 promoter which was shown to be underrepresented in the training data, is part of 8 out the 30 exploratorium recommendations. If all promoters were equally distributed across the recommendations, it should have occurred around 3 times. Also the αMF_no_EAEA secretion signal occurs more often (13 out of 30) than expected (around 7.5 out of 30). However, the ySB76 occurs a lot less (1 out of 30) than expected (around 7.5 out of 30) even though figure 8 on the modeling page shows it contributes less to the overall variance in the dataset than the other strains.

Based on the results from the ART, it is expected that a device containing the alpha-amylase encoding gene from B.subtilis and the αMF secretion signal will yield a high alpha-amylase activity in the supernatant. Furthermore, the exploratory recommendations show some correlations with the frequency and variance distributions of the part-variants shown on the modeling page.

Final discussion, finding the most active alpha amylase and future perspectives

Overall, we were able to successfully clone and functionally express alpha-amylase in Saccharomyces spp. using Golden Gate Assembly. However, the level of alpha-amylase activity did differ based on the way of extraction (lysising or not) and the combination of construct and chassis. Some results by themselves may not appear to be very conclusive, but bringing all of them together helped us to get a better insight on how to find the most active alpha-amylase.  

When checking the activity of alpha-amylase the samples that holded the native sequence (Parts 3 and Parts 3b) from B. amyloliquefaciens and licheniformis, a general trend was observed, the samples didn’t perform very well (in terms of alpha-amylase activity). Combining these results together with the sequencing results, it turns out that those two samples had clear missense mutations that may affect the performance of them. Two possible scenarios could explain the lack of activity, 1. the deficient production of the enzyme or 2. the lack of functionality. Considering the experiments performed it is hard to tell what holds true in this case. Performing direct measurements like SDS gel electrophoresis to identify the presence of the protein would have given us more insights into where the problem was. Further experiments would be needed to check for the consequences of the mutations. Ideally, we would have avoided unknown mutations by an early detection of them, unfortunately, at the time we found them all the constructs of the library were already assembled and we didn’t have time to restart again. 

Besides the engineering strategy, a very important part when expressing heterologous proteins is measuring the successful expression and activity of the protein. Using different assays as we did is a safe and insightful manner of testing the performance of your device. Far from being redundant, we draw different but complementary conclusions from both experiments. The halo test helped us understand the importance of the secretion of the enzyme. On the other hand, the high throughput experiment enabled for more precise comparison between samples and proved to be an efficient way of getting quantitative data from your devices.

In our attempt to screen as many combinations as possible to find the one that would achieve the highest yield of functional alpha-amylase, we combined in vitro high throughput experiments with a machine learning tool (ART). By doing so we expanded the search space, by not being just restricted to the actual combinations that were tested in the lab. When merging these two elements (experiments and machine learning), we encountered some aspects that needed to be taken into consideration. In our case, the subset of samples from the combinatorial library that are going to be evaluated experimentally needed to be thoroughly selected, trying to find an even and representative distribution of all the elements inside each category (as explained in the Model page). Another important aspect is the selection of a performance measure that is representative of what would constitute a “good” device. We selected the activity of alpha-amylase in the supernatant of the machine to be this performance measure as it represents what we aimed to achieve (an efficient production of alpha-amylase) and initial results showed that it can be accurately measured.

There are some improvements that need to be made in the experiments pipeline (see section Improvements in the Model page). In the next DBTL cycle those changes would need to be implemented in order to get more accurate results. 

In general, we learnt that when expressing a new gene in a chassis, it is rather unpredictable how the different elements in the device would interact between each other. From our experience, we believe that by using this kind of technology more knowledge about the performance of your device can be acquired and the optimal results can presumably be obtained in fewer DBTL cycles.


  1. Declerck N, Machius M, Wiegand G, Huber R, Gaillardin C. Probing structural determinants specifying high thermostability in Bacillus licheniformis alpha-amylase. J Mol Biol. 2000 Aug 25;301(4):1041-57. doi: 10.1006/jmbi.2000.4025. PMID: 10966804.