Laboratory evolution of synthetic electron transport system variants reveals a larger metabolic respiratory system and its plasticity

Anand, Amitesh; Patel, Arjun; Chen, Ke; Olson, Connor A.; Phaneuf, Patrick V.; Lamoureux, Cameron; Hefner, Ying; Szubin, Richard; Feist, Adam M.; Palsson, Bernhard O.

doi:10.1038/s41467-022-30877-5

Download PDF

Article
Open access
Published: 27 June 2022

Laboratory evolution of synthetic electron transport system variants reveals a larger metabolic respiratory system and its plasticity

Nature Communications volume 13, Article number: 3682 (2022) Cite this article

5818 Accesses
4 Citations
18 Altmetric
Metrics details

Subjects

Abstract

The bacterial respiratory electron transport system (ETS) is branched to allow condition-specific modulation of energy metabolism. There is a detailed understanding of the structural and biochemical features of respiratory enzymes; however, a holistic examination of the system and its plasticity is lacking. Here we generate four strains of Escherichia coli harboring unbranched ETS that pump 1, 2, 3, or 4 proton(s) per electron and characterized them using a combination of synergistic methods (adaptive laboratory evolution, multi-omic analyses, and computation of proteome allocation). We report that: (a) all four ETS variants evolve to a similar optimized growth rate, and (b) the laboratory evolutions generate specific rewiring of major energy-generating pathways, coupled to the ETS, to optimize ATP production capability. We thus define an Aero-Type System (ATS), which is a generalization of the aerobic bioenergetics and is a metabolic systems biology description of respiration and its inherent plasticity.

Pareto optimality between growth-rate and lag-time couples metabolic noise to phenotypic heterogeneity in Escherichia coli

Article Open access 28 May 2021

A universal trade-off between growth and lag in fluctuating environments

Article 15 July 2020

Awakening a latent carbon fixation cycle in Escherichia coli

Article Open access 16 November 2020

Introduction

Respiration requires organisms to have an electron transport system (ETS) for the generation of proton-motive force across the membrane that drives ATP synthase. Although the molecular details of the ETS are well-studied and constitute textbook material, few studies have appeared to elucidate its systems biology. The most thermodynamically efficient ETS consists of two enzymes, an NADH: quinone oxidoreductase (NqRED) and a dioxygen reductase (O₂RED), which facilitate the shuttling of electrons from NADH to oxygen. However, evolution has produced variations within the ETS which modulate the overall energy efficiency of the system even within the same organism^1,2,3. The systems level impact of these variations and their individual physiological optimality remain poorly determined. To mimic varying ETS efficiency we generated four Escherichia coli deletion strains (named ETS-1H, 2H, 3H, and 4H), each with one of the four unbranched ETS variants that pump 1, 2, 3, or 4 proton(s) per electron, respectively. We then performed systems level characterization of these ETS variants. We observe that: (a) adaptive laboratory evolution (ALE) enables all four ETS variants to evolve to a similar growth rate; (b) the evolution of ETS variants is supported by specific rewiring of major energy-generating pathways that couple to the ETS to optimize their ATP production capability; (c) proteome allocation per ATP generated is the same for all the variants, (d) the aero-type, that designates the overall ATP generation strategy⁴ of a variant, remain conserved during its laboratory evolution, with the exception of the ETS-4H variant; and (e) integrated computational analysis of the data supports a proton-to-ATP ratio of 10 protons per 3 ATP for ATP synthase for all four ETS variants.

Results and discussion

E. coli has a highly flexible ETS consisting of 15 dehydrogenases and 10 reductases to allow growth in both oxic and anoxic environments⁵. The expression of these enzymes is regulated by a variety of electron acceptors with a known hierarchy, such that oxygen represses all anoxic respiratory pathways and nitrate represses other anoxic pathways^3,5. Despite this thermodynamic hierarchy, co-expression of different respiratory chains was reported in another γ-proteobacterium to expand the flexibility of its electron transfer network³. We probed the condition-dependent expression of all these dehydrogenases and reductases using a large RNA-seq compendium for E. coli⁶. We observed a spectrum of expression values of these genes across the experimental conditions showing the contribution of these enzymes in generating plasticity in energy metabolism (Supplementary Fig. 1).

To examine the contributions of individual oxic respiratory pathways to bioenergetics, we sought to design unbranched pathways through the ETS. The oxic component is contributed by both proton pumping and non-pumping NqREDs (hereafter referred to as NDH-I and NDH-II, respectively) along with three types of O₂REDs (Fig. 1a). Cytochrome bd O₂REDs (CBDs) are less electrogenic compared to Cytochrome bo₃ O₂REDs (CYO). There are two CBDs, bd-I and bd-II, and both functions similarly to generate proton-motive force (PMF) by a vectorial movement of protons involving transmembrane charge separation. The similar PMF generation strategies make bd-I and bd-II O₂REDs equivalent when choosing gene knockout strategies^7,8.

**Fig. 1: Generation and evolution of unbranched ETS variants.**

Based on these characteristics, we designed four ETS variants with unbranched electron flows representing all alternate oxic respiratory routes translocating 1, 2, 3, or 4 proton(s) per electron (designated as ETS-nH, with n = 1, 2, 3, 4). The designs of the four ETS variants are illustrated in Fig. 1b.

Next, we analyzed their growth phenotype (Fig. 1c). Interestingly, the unevolved variants (called uETS) showed different growth rates that had no clear association with their H⁺/e⁻ value. While the loss of activity of NDH-I showed a lesser growth rate retardation, the deletion of NDH-II significantly compromised the growth rate of the deletion strains.

To allow the ETS variants to overcome the growth defects resulting from gene deletions, we performed ALE with four independent replicates of each variant in an oxic environment (Supplementary Table 1) (evolved variants are named eETS-nHm with the replicate evolutionary endpoints indexed as m = A, B, C, D)⁹. We evolved all variants until their growth rate plateaued. ETS-1H, 2H & 3H required evolution for approximately 400 generations, while ETS-4H required approximately 700 generations. In spite of the different number of protons pumped per electron, all four ETS variants evolved to a similar optimized growth rate in replicate evolutions (~0.85 h⁻¹) (Fig. 1c).

Next, we sought to determine the acquired mutations that enabled adaptation to a higher growth rate for all ETS variants. We performed whole-genome sequencing of each strain and used a comprehensive database of mutations from ALE experiments (aledb.org¹⁰) to interpret the potential impact of the identified mutations. The mutation calling revealed only a few genetic changes in the evolved strains except for eETS-4HC which acquired 15 genetic changes (Supplementary Data 1). The higher number of mutations in eETS-4HC could be due to the mutated DNA mismatch repair enzyme mutS in this strain¹¹. Every ETS variant acquired mutations responsible for enabling faster growth on M9 minimal medium (Supplementary Table 2, Supplementary Data 1)^12,13,14,15. An intergenic mutation between pyrE and rph has been reported to alleviate pyrimidine pseudo-auxotrophy resulting in a faster growth rate. RNA polymerase subunit mutations are proposed to favor a higher growth rate by accelerating the transcriptional processes. Another common mutation reported to support a faster growth rate is in the intergenic region between hns and tdk. This mutation is expected to downregulate several stress response pathways and shift resources to support growth. uETS-1H carried the pyrE-rph intergenic mutation, which explains the relatively faster initial growth rate of this strain.

Besides mutations responsible for acclimatization to media, uETS-3H and uETS-4H acquired a common gene-related mutation in all four independently evolved lineages. This mutational convergence simplified the otherwise difficult task of establishing the genotype-phenotype relationship^16,17,18.

All four evolved replicates of uETS-3H acquired point mutations in sdhA, the catalytic subunit of succinate dehydrogenase (Supplementary Table 2). eETS-3HB acquired a point mutation that brings in a premature termination codon in the sdhA open reading frame, suggesting a loss of functional enzyme (Supplementary Data 1). We explored the potential impact of other mutations by investigating whether the SNPs could affect the protein’s function based on amino acid properties and sequence homology (SIFT)¹⁹ or structural stability (ΔΔG)²⁰. Almost all mutations in sdhA were either in or near interface surfaces and seem to be working to disrupt its functionality by either disrupting a substrate-binding site or causing a structural-functional perturbation (Fig. 1d, e). Notably, the deletion of another subunit of this enzyme, sdhC, has been reported to increase the biomass yield in an oxic environment²¹. uETS-3H appeared to adopt a similar metabolic route to increase its growth rate.

All four replicates of uETS-4H acquired mutations in an inadequately characterized gene, yjjX (Supplementary Table 2, Supplementary Data 1). The structural and biochemical evidence suggests that YjjX, an inosine/xanthosine triphosphatase, may be involved in the mitigation of the deleterious impact of oxidative stress by preventing the accumulation of altered nucleotides²². Also, the physical association of YjjX with the elongation factor suggests a negative impact on the translational rate. The STRING-based protein-protein interaction predicted the association of YjjX with glycolytic and ATP biosynthetic processes²³. Interestingly, eETS-4HA replaced the start codon, ATG, with ATA (Supplementary Data 1). Apart from replacing methionine with isoleucine, this substitution potentially diminishes the expression of this protein²⁴. A similar disruptive impact is expected from other yjjX mutations (Fig. 1f, g). The cysteine to tyrosine substitution at amino acid residue 30 was predicted to destabilize the structure as it lies just beside a subunit interface residue, and a charge reversion due to the glutamate to lysine substitution at amino acid residue 38 targets the metal-binding site. Thus, it appears that eETS-4H is attempting to prevent translational halting to achieve a higher growth rate.

Since the restoration of the evolved variants to the same growth rate cannot be deciphered from genetic changes alone, we took a broader systems view to understand the underlying metabolic perturbations. We examined how the evolved variants rewired the fluxes through the major metabolic pathways that couple to the ETS. We generated RNA sequencing and metabolite profiling data for all the strains and performed targeted and systems level analyses. We observed a high transcriptional correlation among the evolved replicates (Spearman’s rank correlation coefficient >0.75) of each variant, but the correlation between pre-and post-evolved variants was lower (Fig. 2a). Notably, consistent genetic and transcriptomic changes supported a common evolutionary trajectory for the replicates of each variant.

**Fig. 2: Metabolic rewiring supporting growth rate optimization in the ETS variants.**

Bacterial physiology displays a remarkable compensatory potential facilitated by altered metabolic flux states resulting from genetic and transcriptomic changes²¹. Therefore, we examined if the surrogate NqRED or O₂RED compensated for the loss of function resulting from deleted ETS enzymes (Fig. 2b). There was no clear compensatory trend in the strains with unbranched ETS except for ETS-4H. ETS-4H increased the expression of NDH-I while increasing or maintaining the expression of CYO after evolution. Surprisingly, the compensatory upregulation of ndh in uETS-2H was lost after evolution to a higher growth rate.

Since RNA expression levels may not correlate with metabolic fluxes due to differential translation efficiency and different enzyme catalytic turnover rates, we performed a metabolic flux distribution analysis. To obtain the metabolic flux map, we measured the medium exchange rates of the major metabolites related to respiratory metabolism (Supplementary Table 3). We used both the metabolite exchange rates and transcriptomic data as constraints to simulate the flux through the pathways of the central carbon metabolism using a genome-scale model of metabolism and protein expression (ME-model)²⁵. We observed a high correlation in the metabolic flux distributions of the four evolved replicates of each strain, further supporting a similar evolutionary pathway followed by replicates of each variant (Fig. 2a).

To more deeply understand the different metabolic states exhibited by the evolved variants, we examined the variations in their computed proteome allocation using the solutions from the phenotypic and transcriptomic constrained ME-models. We observed a clear distinction between strains with alternate NqRED for the preferred glycolytic pathway (Fig. 2c). NDH-I has approximately 10-times higher molecular mass as compared to NDH-II^26,27. Therefore, despite its PMF generation potential, NDH-I is a less preferred dehydrogenase during oxic respiration to achieve faster growth⁵. The non-proton pumping high turnover dehydrogenase, NDH-II, is better suited to relieve the growth bottleneck that may arise due to excess built-up of PMF while allowing the operation of oxic ETS^5,28.

The finite resource carrying capacity of a cell creates metabolic tradeoffs on how to partition the proteome to support metabolic pathways best suited for a given growth condition. With an approximately 3.5-fold higher protein cost, the Embden–Meyerhoff–Parnass (EMP) pathway consumes a larger proportion of proteome as compared to the Entner–Doudoroff (ED) pathway²⁹. However, the higher ATP yield of the EMP pathway alludes to a potential tradeoff between the two glycolytic pathways for optimizing ATP production while maintaining a growth-supporting proteome³⁰. The ETS-3H and ETS-4H strains forced to respire using larger NqRED (NDH-I) increased the flux through the proteome conservative ED pathway. Thus, we observed a compensatory selection of the preferred pathway to achieve a balanced proteome.

Interestingly, while strains with nuoB deletion (ETS-1H and ETS-2H) increased metabolic flux through complex II of ETS, ETS-3H appeared to minimize the flux through complex II (Supplementary Data 2). Notably, eETS-3H lacks ndh and acquired a mutation in the gene sdhA which codes for a complex II subunit. However, ETS-4H, which also lacks the ndh gene, increased the flux through complex II, albeit at a lower level compared to ETS-1H and ETS-2H.

Thus, metabolic plasticity (reflected in metabolic rewiring and associated proteome allocation) allows for redundancy in the eETS variants while supporting the same growth rate. Knowledge of this metabolic plasticity motivated the examination of the overall bioenergetics-state of the evolved ETS variants to fully understand the basis for the evolution to the same growth rate. We have earlier defined an approach to classify the E. coli phenotypes into aero-types, which is a quantitative fitness descriptor based on cellular respiratory behavior and proteome allocation⁴. The stratification of aero-types is based on the multimodal distribution of the fraction of total ATP produced through ATP synthase which is modulated through the discrete usage of ETS enzymes. We have reported a non-uniform distribution of phenotypic growth data in the rate-yield plane that can be approximately segregated in different aero-types based on sampling simulations. Here we used aero-types to examine the fitness distribution of ETS variants.

We observed that ETS-1H, ETS-2H, and ETS-3H did not show a major shift in their biomass yield during evolution and thus preserved their respective aero-types (Fig. 3a). The evolutionary optimization of growth rate appears to be largely driven by rewiring central carbon metabolism while oxidative energy metabolism is conserved. ETS-4H jumped from a lower to a higher aero-type after evolution, suggesting an increase in oxic metabolism. The ETS-4H variant has the highest PMF generation capacity. Its aero-type shift to higher classes occurred only after adaptive evolution.

**Fig. 3: Systems-level examination of ETS variants.**

The clustering of each evolved ETS variant along the same growth rate isocline (Fig. 3a) indicated global remodeling of the energy metabolic network to produce similar growth-supporting bioenergetics. We thus defined a larger respiratory system, called the Aero-Type System (ATS), consisting of oxidative phosphorylation, glycolysis, pyruvate metabolism, the TCA cycle, and the Pentose Phosphate pathway, that together define the overall state of oxic energy metabolism (Supplementary Fig. 3). The total proteome allocated to the ATS was very similar in each eETS variant, and the total ATP output of each proteome expressed was almost constant (Fig. 3b). Thus, the composition of the ATS was malleable and able to provide the same supply of ATP, allowing similar growth rates for all eETS variants. We also observed a trend in the metabolic location of ATP production across the variants, where the relative contribution of oxidative phosphorylation was highest for eETS-4H and lowest for eETS-1H (Fig. 3c). Accordingly, an inverse trend was observed for glycolytic and fermentative ATP production.

We next examined the transcriptome to identify the tradeoffs in gene expression that enabled the different metabolic states. We applied a blind source signal separation algorithm, called independent component analysis (ICA)³¹, to examine differential partitioning of the transcriptome of the 209 ATS genes. ICA decomposed the ATS transcriptome into independently modulated sets of genes (called iModulons) (Supplementary Data 3). The activities of several iModulons showed a clear association with the aero-type of the ETS variants (Supplementary Fig. 4). iModulons consisting of genes associated with oxic respiration showed a positive correlation with aero-type status (iModulons 8, 13, and b2287), and those constituted by anoxic and/or metabolic genes showed a negative correlation (iModulons 7, 9, 10, 16, and b3366) (Supplementary Fig. 2). Thus, an oxic-anoxic transcriptomic tradeoff enabled the four ETS variants to maintain similar ATP production capacity (Fig. 3d).

The direct measurement of the number of protons translocated through ATP synthase to produce one molecule of ATP (H⁺/ATP) is technically challenging and, therefore, it is still an area of active research³². The rotational catalysis-based calculation suggests the H⁺/ATP value to be 3.3, due to the symmetry mismatch between the F_o and F₁ complexes of ATP synthase: threefold symmetry of α3β3 in F1 and tenfold symmetry of the c-ring in F_o^33,34. The proton-to-ATP ratio may vary depending upon any change in the number of c-subunits and this modulation allows tailoring to meet the bioenergetic demand of various organisms³⁵. The H⁺/ATP value derived using a synthetically reconstituted membrane system was found to be 4³⁶. With our comprehensive definition of the state of the ATS amongst the variants, we could address the issue of ATP synthase proton-to-ATP ratio. We used data generated on the variants to computationally estimate the most likely proton-to-ATP ratio for E. coli ATP synthase³². We constrained the ME-model using the observed metabolic exchange rates and gene expression data and optimized for the H⁺/ATP value of ATP synthase that produces the experimentally estimated growth rates of the variants. The ME-model calculates the median value of the H⁺/ATP to be 3.25, a value close to 3.3 supporting the rotational catalysis hypothesis (Fig. 3e). Notably, while 10 is the preferred number of c subunits in the E. coli F_o motor of ATP synthase, the number of subunits can vary, which will change the H⁺/ATP value^37,38,39,40.

Taken together, our results lead to an expanded definition of oxic respiration beyond the conventional ETS, which involves an electron transport chain to create PMF, that then drives the ATP synthase. Here, we define the Aero-Type System that encompasses the ETS and coupled metabolic pathways (Supplementary Fig. 3). The ATS is composed of 209 genes (Supplementary Data 3). The ATS represents about 38% proteome allocation in all evolved variants. A decrease in the ETS energetic efficiency (often measured in terms of the P/O ratio) can be balanced by increased flux through the coupled metabolic pathways. This balance is governed by the cost of protein synthesis.

Remarkably, the overall proteome allocation to the ATS is similar in the evolved variants and generates the same amount of ATP, enabling them to achieve the same growth rate. The different ways in which the ATS is balanced underlies its plasticity and represents a demonstration of the key systems biology concept of alternate optimal states. These alternate states have a different combination of proton pumping efficiency, complementary metabolic rewiring achieved through tradeoffs in the composition of the transcriptome, and concomitant efficiency of proteome allocation, but enable the same overall cellular function. The cytoplasmic-periplasmic adaptive nexus that the ATS represents thus illustrates the deep plasticity inherent in achieving balanced energetic systems to match metabolic needs in different environmental niches.

Methods

Examining PRECISE 2.0 for expression levels of respiratory enzymes

PRECISE 2.0 is a compendium of high-quality RNA-seq for E. coli K-12⁶. It contains 815 RNA-seq datasets of samples with different genetic changes or varied growth conditions. We examined the expression of respiratory dehydrogenases and reductases in the entire dataset. For intelligible purposes, we plotted the expression levels in samples that are directly or indirectly associated with energy metabolism. The expression levels shown are the median value across replicates for a sample.

Strain generation and adaptive laboratory evolution

E. coli K-12 MG1655 (ATCC 700926) was used as the wild-type strain. P1 phage transduction method was used to generate the knockout strains⁴¹, and strains from the Keio collection were used as a donor for the gene knockout cassettes⁴². uETS-1H and uETS-3H were generated and used for validation purposes in an earlier study⁴. uETS-2H and uETS-4H were generated here and all four ETS variants were evolved for this study.

ALE was performed using 4 independent replicates of each ETS variant. Cultures were serially propagated on M9 minimal medium with 4 g/L glucose at 37 °C and well-mixed for proper aeration using an automated system that passed the cultures to fresh flasks once they had reached an A₆₀₀ of 0.3 (Tecan Sunrise plate reader, equivalent to an A₆₀₀ of ~1 on a traditional spectrophotometer with a 1 cm path length). Cultures were always maintained in excess nutrient conditions assessed by non-tapering exponential growth. The evolution was performed for a sufficient time interval to allow the cells to reach their fitness plateau.

Prediction of the effect of amino acid substitutions

The ALE mutation datasets supporting the conclusions of this article is available in the following open-access archive repository: https://doi.org/10.5281/zenodo.5431595. These datasets are also available in the ALEdb database¹⁰.

Mutated DNA sequence data processing was performed using Python 3. The mutations from ALEdb are described according to their experiment, evolution replicate, sample, and technical replicate. Some evolutions include midpoint samples that could inflate the frequency a mutation is observed. Unique ALE mutations were therefore only considered once per ALE. Starting strain mutations and hypermutator samples were filtered out of the ALE experiment mutation datasets according to their publications. Mutation needle plots were generated using the trackViewer R software package⁴³. The visualizations for the 3D protein structures were generated using the NGL software package⁴⁴. The software implementation of these actions is available in the following open-access archive repository: https://doi.org/10.5281/zenodo.5431595.

Mutation effects were predicted according to multiple methods. Truncations were predicted according to the potential effect of mutations on the function of start codons and their potential to introduce a premature stop codon. The predicted deleterious effects of SNPs were assumed according to significant SIFT (sorting intolerant from tolerant) scores (SIFT score < 0.05)¹⁹. The predicted structural destabilization effects of SNPs were assumed according to predicted significant ΔΔG scores (ΔΔG > 2)²⁰. SIFT and ΔΔG scores were acquired from Mutfunc⁴⁵. Functional annotations were acquired from UniProt⁴⁶ and Mutfunc.

DNA sequencing and RNA sequencing

A clone from the endpoints of evolved strains was picked for DNA sequencing and RNA sequencing. The strains were grown in an M9 minimal medium supplemented with 4 g/l glucose. Total DNA was sampled from an overnight grown culture and total RNA was sampled from a culture at an A₆₀₀ ~0.6. Nucleic acid isolation, library preparation, and subsequent analysis were performed as previously described⁴⁷. Briefly, genomic DNA was isolated using a Nucleospin Tissue kit including treatment with RNase A. Resequencing libraries were prepared following the manufacturer’s protocol using Nextera XT kit. RNA was isolated using the Qiagen RNeasy Mini Kit following suggested protocol. Ribosomal RNA was removed using Illumina Ribo-zero kit and a KAPA Stranded RNA-Seq Kit (Kapa Biosystems KK8401) was used to prepare sequencing libraries. Sequencing was performed on an Illumina HiSeq and/or NextSeq.

Phenotype characterization

Phenotype characterization was performed using two independent biological replicates. Samples for the substrate uptake and secretion rate were collected at regular intervals and filtered using a 0.22 μm filter (PVDF, Millipore). The measurements were performed using refractive index detection by HPLC (Agilent 12600 Infinity) with a Bio-Rad Aminex HPX87-H ion exclusion column. The HPLC method was the following: injection volume of 10 μL and 5 mM H₂SO₄ mobile phase set to a flow rate and temperature of 0.5 mL/min and 45 °C, respectively. The phenotype dataset was used for the aero-type classification of the strains as described previously⁴.

Metabolic flux mapping and estimation of H⁺/ATP value for ATP synthase

Flux mapping was done as previously described using a genome-scale model of metabolism and protein expression⁴⁸. The same FoldME model was used for estimating the H⁺/ATP value for ATP synthase within each ETS variant and replicates. The model was constrained with phenotypic data (glucose uptake rate, acetate production rate) and expression data was layered on using the same methods used for the flux mapping⁴⁸. In addition to these constraints, the necessary ETS genes for each variant were knocked out. Proton pumping ratios from 2.5 to 4.5 were sampled by changing the stoichiometry of the ATPS4rpp reaction in the ME-model, and then the proton pumping ratio was optimized so that the model produced a biomass dilution rate that matched the experimentally determined growth rate.

ATS proteome allocation calculation

The same FoldME model was used for the proteome allocation calculation as the flux mapping and ATP synthase estimation calculations. The model was constrained with phenotypic data (glucose uptake rate, acetate production rate, growth rate) and expression data was layered on using the same methods used for the flux mapping. Solutions from the fully constrained ME-models were then used for calculating proteome allocation. Total proteome allocation for each strain was calculated as follows:

$${Total}\,{Proteome}\,{Allocation}\,=\,\mathop{\sum}\limits_{i}{{mw}}_{i}\,*\, {V}_{i}^{{translation}}$$

Where ${{mw}}_{i}$ and ${V}_{i}^{{translation}}$ represents the molecular weight and translation flux of the ith protein in the model. Total proteome allocated to the ATS was calculated as follows:

$${Proteome}\,{Allocated}\,{to}\,{ATS}\,=\,\mathop{\sum}\limits_{i}{{mw}}_{i}\,*\, {V}_{i}^{{translation}}$$

where ${{mw}}_{i}$ and ${V}_{i}^{{translation}}$ represents the molecular weight and translation flux of the ith protein in the ATS (209 genes total). The list of 209 ATS genes was generated based on Clusters of Orthologous Groups (COG) and Gene Ontology (GO) categories to include as many relevant genes as possible to represent pathways involved in ATP production, then filtered to remove genes that are never expressed in the multiple model simulations. Mass fraction of proteome allocation to the ATS was calculated as a ratio of the two values for each strain.

Calculation of the total ATP produced by the ATS used the same fully constrained ME-model. A list of all metabolic reactions associated with ATS genes was curated. Reactions that consumed or produced ATP were noted and the stoichiometric coefficient associated with ATP was used as a modifier for calculating the total ATP production as follows (Table 1):

$${Total}\,{ATP}\,{Production}\,=\,\mathop{\sum}\limits_{i}{c}_{i}\,*\, {V}_{i}^{{metabolic}}$$

where ${c}_{i}$ and ${V}_{i}^{{metabolic}}$represents the ATP stoichiometric coefficient and the metabolic flux of the ith ATS associated reaction in the table below.

Table 1 Reactions that consume or produce ATP and corresponding stoichiometric coefficient.

Full size table

Total ATP Production/Total Proteome Allocated was calculated as a ratio of the total ATP production to the mass fraction of proteome allocated to the ATS for each strain.

ATS transcriptome ICA decomposition

Independent component analysis was performed on an RNA-seq dataset with steps described in⁶. The only genes included in the dataset were those contained in the list of 209 ATS genes. The dataset consisted of all unevolved strains, uETS-1H through 4H, and all evolved replicates eETS-1HA through eETS-4HD. Additionally, the unevolved and evolved wild-type strains were included with the former being used as a reference to center the data. The final and resulting dataset that was used for ICA contained 209 genes by 22 conditions.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

Resequencing and expression profiling data that support the findings of this study can be accessed from NCBI Sequence Read Archive accession number PRJNA835443 and Gene Expression Omnibus accession number GSE202144 respectively. The PRECISE compendium and all associated data files can be found at https://github.com/SBRG/precise2. Source data for the figures can be found in Supplementary Tables 2, 3 as well as in the uploaded RNA-seq data.

Code availability

The software scripts supporting the prediction of mutation effects to the encoding of genes described in this article are available in the following open-access archive repository: https://doi.org/10.5281/zenodo.5431595. All the simulations performed in this manuscript can be reproduced using the FoldME model, which is constructed using the COBRApy toolbox version 0.5.11 for constraint-based modeling and its extension for ME-models, COBRAme version 0.0.9, ECOLIme version 0.0.9, and solveME, all publicly available on Github (https://github.com/SBRG/ME-script, https://github.com/SBRG/ecolime, https://github.com/SBRG/solvemepy). Custom code for constraining and solving ME-models can be found at https://github.com/SBRG/ME-script. GraphPad Prism version 9.2.0 was used for generating the plots.

References

Alberts, B. et al. The evolution of electron-transport chains. in Molecular Biology of the Cell. 4th edn (Garland Science, 2002).
Brochier-Armanet, C., Talla, E. & Gribaldo, S. The multiple evolutionary histories of dioxygen reductases: Implications for the origin and evolution of aerobic respiration. Mol. Biol. Evol. 26, 285–297 (2009).
Article CAS PubMed Google Scholar
Sturm, G. et al. A dynamic periplasmic electron transfer network enables respiratory flexibility beyond a thermodynamic regulatory regime. ISME J. 9, 1802–1811 (2015).
Article PubMed PubMed Central Google Scholar
Chen, K. et al. Bacterial fitness landscapes stratify based on proteome allocation associated with discrete aero-types. PLoS Comput. Biol. 17, e1008596 (2021).
Article CAS PubMed PubMed Central Google Scholar
Unden, G. & Bongaerts, J. Alternative respiratory pathways of Escherichia coli: energetics and transcriptional regulation in response to electron acceptors. Biochim. Biophys. Acta 1320, 217–234 (1997).
Article CAS PubMed Google Scholar
Lamoureux, C. R. et al. PRECISE 2.0—an expanded high-quality RNA-seq compendium for Escherichia coli K-12 reveals high-resolution transcriptional regulatory structure. https://doi.org/10.1101/2021.04.08.439047 (2021).
Borisov, V. B. et al. Aerobic respiratory chain of Escherichia coli is not allowed to work in fully uncoupled mode. Proc. Natl Acad. Sci. USA 108, 17320–17324 (2011).
Article CAS PubMed PubMed Central ADS Google Scholar
Ingledew, W. J. & Poole, R. K. The respiratory chains of Escherichia coli. Microbiol. Rev. 48, 222–271 (1984).
Article CAS PubMed PubMed Central Google Scholar
Sandberg, T. E., Salazar, M. J., Weng, L. L., Palsson, B. O. & Feist, A. M. The emergence of adaptive laboratory evolution as an efficient tool for biological discovery and industrial biotechnology. Metab. Eng. 56, 1–16 (2019).
Article CAS PubMed PubMed Central Google Scholar
Phaneuf, P. V., Gosting, D., Palsson, B. O. & Feist, A. M. ALEdb 1.0: a database of mutations from adaptive laboratory evolution experimentation. Nucleic Acids Res. 47, D1164–D1171 (2019).
Article PubMed Google Scholar
Acharya, S., Foster, P. L., Brooks, P. & Fishel, R. The coordinated functions of the E. coli MutS and MutL proteins in mismatch repair. Mol. Cell 12, 233–246 (2003).
Article CAS PubMed Google Scholar
LaCroix, R. A. et al. Use of adaptive laboratory evolution to discover key mutations enabling rapid growth of Escherichia coli K-12 MG1655 on glucose minimal medium. Appl. Environ. Microbiol. 81, 17–30 (2015).
Article PubMed ADS Google Scholar
González-González, A., Hug, S. M., Rodríguez-Verdugo, A., Patel, J. S. & Gaut, B. S. Adaptive mutations in RNA polymerase and the transcriptional terminator Rho have similar effects on Escherichia coli gene expression. Mol. Biol. Evol. 34, 2839–2855 (2017).
Article PubMed PubMed Central Google Scholar
Conrad, T. M. et al. RNA polymerase mutants found through adaptive evolution reprogram Escherichia coli for optimal growth in minimal media. Proc. Natl Acad. Sci. USA 107, 20500–20505 (2010).
Article CAS PubMed PubMed Central ADS Google Scholar
Long, A., Liti, G., Luptak, A. & Tenaillon, O. Elucidating the molecular architecture of adaptation via evolve and resequence experiments. Nat. Rev. Genet. 16, 567–582 (2015).
Article CAS PubMed PubMed Central Google Scholar
Utrilla, J. et al. Global rebalancing of cellular resources by pleiotropic point mutations illustrates a multi-scale mechanism of adaptive evolution. Cell Syst. 2, 260–271 (2016).
Article CAS PubMed PubMed Central Google Scholar
Thomas, A. K. et al. Mutational convergence acts as a major player in adaptive parallel evolution of Shigella spp. Sci. Rep. 9, 3252 (2019).
Article PubMed PubMed Central ADS Google Scholar
Horinouchi, T. et al. Phenotypic convergence in bacterial adaptive evolution to ethanol stress. BMC Evol. Biol. 15, 180 (2015).
Article PubMed PubMed Central Google Scholar
Vaser, R., Adusumalli, S., Leng, S. N., Sikic, M. & Ng, P. C. SIFT missense predictions for genomes. Nat. Protoc. 11, 1–9 (2016).
Article CAS PubMed Google Scholar
Mosca, R., Céol, A. & Aloy, P. Interactome3D: adding structural details to protein networks. Nat. Methods 10, 47–53 (2013).
Article CAS PubMed Google Scholar
Steinsiek, S., Frixel, S., Stagge, S., SUMO & Bettenbrock, K. Characterization of E. coli MG1655 and frdA and sdhC mutants at various aerobiosis levels. J. Biotechnol. 154, 35–45 (2011).
Article CAS PubMed Google Scholar
Zheng, J., Singh, V. K. & Jia, Z. Identification of an ITPase/XTPase in Escherichia coli by structural and biochemical analysis. Structure 13, 1511–1520 (2005).
Article CAS PubMed Google Scholar
Szklarczyk, D. et al. STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res. 47, D607–D613 (2019).
Article CAS PubMed Google Scholar
Hecht, A. et al. Measurements of translation initiation from all 64 codons in E. coli. Nucleic Acids Res. 45, 3615–3626 (2017).
Article CAS PubMed PubMed Central Google Scholar
Chen, K. et al. Thermosensitivity of growth is determined by chaperone-mediated proteome reallocation. Proc. Natl Acad. Sci. 114, 11548–11553 (2017).
Article CAS PubMed PubMed Central Google Scholar
Price, C. E. & Driessen, A. J. M. Biogenesis of membrane bound respiratory complexes in Escherichia coli. Biochim. Biophys. Acta 1803, 748–766 (2010).
Article CAS PubMed Google Scholar
Young, I. G., Jaworowski, A. & Poulis, M. I. Amplification of the respiratory NADH dehydrogenase of Escherichia coli by gene cloning. Gene 4, 25–36 (1978).
Article CAS PubMed Google Scholar
Vamshi Krishna, K. & Venkata Mohan, S. Purification and characterization of NDH-2 protein and elucidating its role in extracellular electron transport and bioelectrogenic activity. Front. Microbiol. 10, 880 (2019).
Article CAS PubMed PubMed Central Google Scholar
Stettner, A. I. & Segrè, D. The cost of efficiency in energy metabolism. Proc. Natl Acad. Sci. USA 110, 9629–9630 (2013).
Article CAS PubMed PubMed Central ADS Google Scholar
Flamholz, A., Noor, E., Bar-Even, A., Liebermeister, W. & Milo, R. Glycolytic strategy as a tradeoff between energy yield and protein cost. Proc. Natl Acad. Sci. USA 110, 10039–10044 (2013).
Article CAS PubMed PubMed Central ADS Google Scholar
Sastry, A. V. et al. The Escherichia coli transcriptome mostly consists of independently regulated modules. Nat. Commun. 10, 5536 (2019).
Article CAS PubMed PubMed Central ADS Google Scholar
Ferguson, S. J. ATP synthase: from sequence to ring size to the P/O ratio. Proc. Natl Acad. Sci. USA 107, 16755–16756 (2010).
Article CAS PubMed PubMed Central ADS Google Scholar
Kaila, V. R. I. & Wikström, M. Architecture of bacterial respiratory chains. Nat. Rev. Microbiol. 19, 319–330 (2021).
Article CAS PubMed Google Scholar
Sobti, M. et al. Cryo-EM structures provide insight into how E. coli FF ATP synthase accommodates symmetry mismatch. Nat. Commun. 11, 2615 (2020).
Article CAS PubMed PubMed Central ADS Google Scholar
Petersen, J., Förster, K., Turina, P. & Gräber, P. Comparison of the H /ATP ratios of the H -ATP synthases from yeast and from chloroplast. Proc. Natl Acad. Sci. 109, 11150–11155 (2012).
Article CAS PubMed PubMed Central ADS Google Scholar
Steigmiller, S., Turina, P. & Gräber, P. The thermodynamic H+/ATP ratios of the H+-ATPsynthases from chloroplasts and Escherichia coli. Proc. Natl Acad. Sci. USA 105, 3745–3750 (2008).
Article CAS PubMed PubMed Central ADS Google Scholar
Jiang, W., Hermolin, J. & Fillingame, R. H. The preferred stoichiometry of c subunits in the rotary motor sector of Escherichia coli ATP synthase is 10. Proc. Natl Acad. Sci. USA 98, 4966–4971 (2001).
Article CAS PubMed PubMed Central ADS Google Scholar
Preiss, L. et al. The c-ring stoichiometry of ATP synthase is adapted to cell physiological requirements of alkaliphilic Bacillus pseudofirmus OF4. Proc. Natl Acad. Sci. USA 110, 7874–7879 (2013).
Article CAS PubMed PubMed Central ADS Google Scholar
Schemidt, R. A., Qu, J., Williams, J. R. & Brusilow, W. S. Effects of carbon source on expression of F0 genes and on the stoichiometry of the c subunit in the F1F0 ATPase of Escherichia coli. J. Bacteriol. 180, 3205–3208 (1998).
Article CAS PubMed PubMed Central Google Scholar
Tomashek, J. J. & Brusilow, W. S. Stoichiometry of energy coupling by proton-translocating ATPases: a history of variability. J. Bioenerg. Biomembr. 32, 493–500 (2000).
Article CAS PubMed Google Scholar
Thomason, L. C., Costantino, N. & Court, D. L. E. coliGenome manipulation by P1 transduction. Curr. Protoc. Mol. Biol. 1.17.1–1.17.8 https://doi.org/10.1002/0471142727.mb0117s79 (2007).
Baba, T. et al. Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: the Keio collection. Mol. Syst. Biol. 2, 1–11 (2006).
Article Google Scholar
Ou, J. & Zhu, L. J. trackViewer: a Bioconductor package for interactive and integrative visualization of multi-omics data. Nat. Methods 16, 453–454 (2019).
Article CAS PubMed Google Scholar
Rose, A. S. et al. NGL viewer: web-based molecular graphics for large complexes. Bioinformatics 34, 3755–3758 (2018).
Article CAS PubMed PubMed Central Google Scholar
Wagih, O. et al. A resource of variant effect predictions of single nucleotide variants in model organisms. Mol. Syst. Biol. 14, e8430 (2018).
Article PubMed PubMed Central Google Scholar
UniProt Consortium. UniProt: a worldwide hub of protein knowledge. Nucleic Acids Res. 47, D506–D515 (2019).
Article Google Scholar
Anand, A. et al. Pseudogene repair driven by selection pressure applied in experimental evolution. Nat. Microbiol 4, 386–389 (2019).
Article CAS PubMed Google Scholar
Anand, A. et al. Restoration of fitness lost due to dysregulation of the pyruvate dehydrogenase complex is triggered by ribosomal binding site modifications. Cell Rep. 35, 108961 (2021).
Article CAS PubMed PubMed Central Google Scholar
Sandberg, T. E., Lloyd, C. J., Palsson, B. O. & Feist, A. M. Laboratory evolution to alternating substrate environments yields distinct phenotypic and genetic adaptive strategies. Appl. Environ. Microbiol. 83, 1–15 (2017).
Lennen, R. M. et al. Adaptive laboratory evolution reveals general and specific chemical tolerance mechanisms and enhances biochemical production. bioRxiv 634105 https://doi.org/10.1101/634105 (2019).

Download references

Acknowledgements

This work was funded by the Novo Nordisk Foundation Grant Numbers NNF10CC1016517 and NNF20CC0035580 and National Institutes of Health Grant R01GM057089. We would like to thank Marc Abrams (Systems Biology Research Group, University of California San Diego) for assistance with paper editing.

Author information

Authors and Affiliations

Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA
Amitesh Anand, Arjun Patel, Ke Chen, Connor A. Olson, Patrick V. Phaneuf, Cameron Lamoureux, Ying Hefner, Richard Szubin, Adam M. Feist & Bernhard O. Palsson
Department of Biological Sciences, Tata Institute of Fundamental Research, Mumbai, Maharashtra, India
Amitesh Anand
Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, Kongens, Lyngby, Denmark
Adam M. Feist & Bernhard O. Palsson

Authors

Amitesh Anand
View author publications
You can also search for this author in PubMed Google Scholar
Arjun Patel
View author publications
You can also search for this author in PubMed Google Scholar
Ke Chen
View author publications
You can also search for this author in PubMed Google Scholar
Connor A. Olson
View author publications
You can also search for this author in PubMed Google Scholar
Patrick V. Phaneuf
View author publications
You can also search for this author in PubMed Google Scholar
Cameron Lamoureux
View author publications
You can also search for this author in PubMed Google Scholar
Ying Hefner
View author publications
You can also search for this author in PubMed Google Scholar
Richard Szubin
View author publications
You can also search for this author in PubMed Google Scholar
Adam M. Feist
View author publications
You can also search for this author in PubMed Google Scholar
Bernhard O. Palsson
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.A., A.F., and B.O.P. designed the study. A.A., C.O., and R.S. performed the experiments. A.A., A.P., K.C., P.P., C.L., and B.O.P. analyzed the data. A.A. and B.O.P. wrote the paper, with input from all co-authors.

Corresponding authors

Correspondence to Amitesh Anand or Bernhard O. Palsson.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Jeremy Wideman and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Reporting Summary

Peer Review File

Description of Additional Supplementary Files

Dataset 1

Dataset 2

Dataset 3

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Anand, A., Patel, A., Chen, K. et al. Laboratory evolution of synthetic electron transport system variants reveals a larger metabolic respiratory system and its plasticity. Nat Commun 13, 3682 (2022). https://doi.org/10.1038/s41467-022-30877-5

Download citation

Received: 16 November 2021
Accepted: 24 May 2022
Published: 27 June 2022
DOI: https://doi.org/10.1038/s41467-022-30877-5

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.