Structure of the transcription open complex of distinct σI factors

Li, Jie; Zhang, Haonan; Li, Dongyu; Liu, Ya-Jun; Bayer, Edward A.; Cui, Qiu; Feng, Yingang; Zhu, Ping

doi:10.1038/s41467-023-41796-4

Download PDF

Article
Open access
Published: 13 October 2023

Structure of the transcription open complex of distinct σ^I factors

Jie Li^1,2,3,4,5,6^na1,
Haonan Zhang^6,7^na1,
Dongyu Li^6,7,
Ya-Jun Liu^1,2,3,4,5,6,
Edward A. Bayer ORCID: orcid.org/0000-0001-7749-5150^8,9,
Qiu Cui^1,2,3,4,5,6,
Yingang Feng ORCID: orcid.org/0000-0002-0879-1316^1,2,3,4,5,6^na2 &
…
Ping Zhu ORCID: orcid.org/0000-0003-2648-3170^6,7^na2

Nature Communications volume 14, Article number: 6455 (2023) Cite this article

1849 Accesses
2 Citations
67 Altmetric
Metrics details

Subjects

Abstract

Bacterial σ^I factors of the σ⁷⁰-family are widespread in Bacilli and Clostridia and are involved in the heat shock response, iron metabolism, virulence, and carbohydrate sensing. A multiplicity of σ^I paralogues in some cellulolytic bacteria have been shown to be responsible for the regulation of the cellulosome, a multienzyme complex that mediates efficient cellulose degradation. Here, we report two structures at 3.0 Å and 3.3 Å of two transcription open complexes formed by two σ^I factors, SigI1 and SigI6, respectively, from the thermophilic, cellulolytic bacterium, Clostridium thermocellum. These structures reveal a unique, hitherto-unknown recognition mode of bacterial transcriptional promoters, both with respect to domain organization and binding to promoter DNA. The key characteristics that determine the specificities of the σ^I paralogues were further revealed by comparison of the two structures. Consequently, the σ^I factors represent a distinct set of the σ⁷⁰-family σ factors, thus highlighting the diversity of bacterial transcription.

σE of Streptomyces coelicolor can function both as a direct activator or repressor of transcription

Article Open access 06 January 2024

Phylogeny and structural modeling of the transcription factor CsqR (YihW) from Escherichia coli

Article Open access 03 April 2024

The bacterial multidrug resistance regulator BmrR distorts promoter DNA to activate transcription

Article Open access 08 December 2020

Introduction

Bacterial σ factors are critical components of RNA polymerase (RNAP) holoenzymes for initiation of transcription by specifically recognizing DNA promoter regions^1,2. A single bacterium often contains multiple σ⁷⁰ factors, which have been further classified into four groups, according to sequence conservation and domain architecture³. Group I includes housekeeping factors containing four conserved regions (σR1 to σR4) which are further divided into subregions⁴, largely corresponding to different domains (σ₁ to σ₄) by structural studies⁵. Groups II to IV include alternative σ factors regulating genes for specific functions. Group IV harbors only σ₂- and σ₄-domains, also termed ExtraCytoplasmic Function (ECF) σ-factors^6,7, which are functionally diverse for bacterial signaling in response to various external stimuli³. σ₂- and σ₄-domains recognize promoter −10 and −35 elements, respectively, and ECF σ-factor domains exhibit high specificity for promoter recognition, which is valuable in synthetic biology^7,8. Structures of RNAP in complex with different groups of σ factors, including housekeeping σ^A (group I), σ^S (group II), σ²⁸ (group III), ECF σ-factors σ^H, σ^L, and σ^E (group IV), and σ^N (family σ⁵⁴), have illustrated how bacterial σ factors specifically recognize promoters and initiate transcription^{9,10,11,12,13,14,15,16,17,18,19}.

All of the known σ⁷⁰ family σ factors in these structures specifically recognize the promoter −35 and −10 elements by σ₄ and σ₂ domains, respectively. In addition to the σ₂ and σ₄ domains, the group I σ factor contains σ_1.1 and σ₃ domains, which function in the auto-regulation of σ⁷⁰ and the binding of the extended −10 region, respectively^9,10. The group II σ factors lack σ_1.1 but exhibit high sequence identity with the group I σ factors in the regions from σ_1.2 to σ₄, and, therefore, present essentially the same structures in the σ₂, σ₃, and σ₄ domains^11,12. The group III σ factors lack both σ_1.1 and σ_1.2 and show weaker interactions between σ₄ and the −35 element than those of the group I and II σ factors^13,14. The group IV σ factors contain only σ₂- and σ₄-domains, which bind to RNAP and promoter DNA in a similar strategy to those of the other groups, but the detailed interactions between the group IV σ factor and the promoter DNA are quite different from the interactions of the other groups^15,16,17. These interactions are of great importance for the recognition of a consensus sequence of the −35/−10 elements by the group IV σ factors.

The σ^I (SigI) factor is a unique σ⁷⁰ that is widespread in Bacilli and Clostridia^{20,21,22,23,24}. It contains a σ₂-domain for recognition of the −10 element but lacks the σ₄-domain that recognizes the −35 element²⁵. σ^I was initially classified into σ⁷⁰-family group III²⁶ but later considered an ECF-like σ-factor, since its C-terminal domain (SigIC) was suspected of playing a recognition role for the −35 element despite its lack of sequence homology with σ₄^27,28. σ^I factors are involved in the heat shock response, iron metabolism, virulence, and carbohydrate sensing^21,24. Multiple paralogues of σ^I and cognate anti-σ^I factors (RsgIs) have been found, and these σ^I-anti-σ^I operons were shown to regulate component expression of cellulosomes, the multienzyme complexes that mediate efficient cellulose degradation^20,24,29. These RsgIs contain an exocellular carbohydrate-binding module, positioned to sense the extracellular polysaccharide substrate³⁰, a periplasmic domain that accommodates an autoproteolytic event for signal transduction^31,32,33, a transmembrane helix, and a cytoplasmic inhibitory domain that binds to SigI²³. Promoter sequences recognized by the σ^Is contain an A-tract motif and a CGWA motif in the −35 and −10 elements, respectively^27,28. σ^I paralogues exhibited distinct promoter-specificity, considered to be related to an upstream region of the A-tract motif^27,28. Although the N- and C-terminal σ^I-domains presumably recognize promoter −10 and −35 elements, respectively, it is unknown how they specifically recognize promoter DNA^23,25,27. The structure of σ^I in an active state (in complex with RNAP) is thus needed to elucidate the mechanism of specific promoter recognition by multiple σ^Is.

Here, we determined high-resolution cryo-EM structures of RNAP-σ-promoter complexes (transcription-ready open complexes, RPo complexes) for two C. thermocellum σ^Is. Structural analysis and functional validation revealed the unique promoter recognition mode and molecular mechanism of specificity for σ^I paralogues, which differ from all other known groups of σ⁷⁰ factors.

Results

Overall structure of RPo-σ^I

RPo complexes RPo-SigI1 and RPo-SigI6 were reconstituted using purified C. thermocellum RNAP core enzyme, purified recombinant Escherichia coli SigI1/SigI6, and synthesized P1/P6 DNA scaffolds (Fig. 1A and Fig. S1). The RPo-SigI1 and RPo-SigI6 structures were determined using single-particle cryo-electron microscopy (cryo-EM) (Fig. S2). Final structures were refined to 3.0-Å resolution for RPo-SigI1 and 3.3 Å for RPo-SigI6 (Table S1). Cryo-EM structures of the latter two complexes served to resolve the RNAP core enzyme (α, α, β, β′, and ω subunits), the σ^I, and promoter DNA with well-defined densities and structural stacking (Fig. 1B; Figs. S3 and S4). According to the previous report²⁷, the location of the transcriptional start sites (TSSs) of P1 and P6 differs, and the obtained RPo-SigI1 and RPo-SigI6 structures showed opened bubbles at different positions, which are not aligned with the experimental TSSs. For simplicity, we uniformly use base numbers from P1’s TSS position for both promoters (Fig. 1A). We observed the densities of DNA of the transcription bubble (−12 to +2) of the non-template strand (NT-strand) and the upstream (−43/−42 to −13 in RPo-SigI1/RPo-SigI6, respectively) and downstream DNA duplex (+3 to +15/+16 in RPo-SigI1/RPo-SigI6, respectively). The obtained RPo-σ^I structures adopt a closed conformation, by comparison with known bacterial RNAP holoenzyme and RPo structures (Fig. 1C).

**Fig. 1: Cryo-EM structures of RPo-SigI1 and RPo-SigI6 from *C. thermocellum*.**

The N-terminal SigI6 domain (SigI6N, residues 13–110) is located in the cleft between the RNAP-β lobe and RNAP-β‘ coiled-coil (β‘CC) with extensive hydrophobic and hydrogen-bond interactions, while the C-terminal SigI6 domain (SigI6C, residues 134–245) forms hydrophobic interactions with the flap-tip helix (βFTH) of the RNAP β subunit (Fig. 1D and Fig. S5). The relative position of SigIN with RNAP-β‘CC is similar to that of other σ⁷⁰-family σ₂-domains and β‘CC (Fig. S5A), but the detailed interactions are different, resulting in different helix orientations relative to β‘CC (Fig. S5B). These differences are caused by non-conserved interacting residues in the different types of σ factors, although those of RNAP-β‘CC are highly conserved (Fig. S5D, E). The interacting hydrophobic residues of SigIC with βFTH are completely different from those of the σ₄-domains of other σ⁷⁰ factors (Fig. S5C, F), because SigIC has no sequence homology with σ₄ and adopts different structural elements in binding βFTH.

Promoter DNA binds to both σ^I and the RNAP core enzyme (Fig. 1D). The upstream region of the promoter forms a duplex and the −35 element interacts with SigIC helices α8-α12. The downstream region forms the transcription bubble through extensive interaction with SigIN. SigIN binds to the −10 element, forming the opening of the bubble, and stabilizing the NT-strand DNA. Finally, the NT and template strands form a duplex and exit RNAP from the channel between the clamp formed by RNAP β and β‘ subunits.

Although the overall structures of RPo-SigI1 and RPo-SigI6 are similar, some differences are observed when the structures are aligned by their RNAP core enzymes (Fig. 1E). The SigIC domains show a rotation and shift, and the SigI1C-bound −35 element bends more towards RNAP than the promoter-bound SigI6C. The first α-helix of the N-terminal SigI1 and SigI6 domains also showed different orientations.

The overall architecture of the C. thermocellum RPo complexes is similar to other known RPo complexes from various bacteria^16,34,35,36. However, structural analysis of the σ^I-promoter interactions (Fig. S6) indicated that the mechanism of recognition is different from other known σ⁷⁰ family members, as shown below.

Interactions between σ^I and promoter −35 element

SigIC binds to the −35 element through both its HTH structure formed by helices α11 and α12 in the DNA major groove and the N-terminal part of helix α9 in the minor groove (Fig. 2A, B and Fig. S7). Although the local resolutions of the SigIC-binding region (about 4.5 Å) are lower than the resolution in the RNAP core regions, and the densities of the SigIC side chains are not always clearly observed, the SigIC model structures predicted by Alphafold³⁷ fit well into the densities, and some of the large side chain residues, such as Phe and Tyr, can be observed with clear side chain densities (Fig. S4), resulting in the construction of reliable models for the SigIC-promoter binding regions. Minor-groove binding in the −35 element has not been observed in other σ⁷⁰-family members^11,13,16,34. Several residues of helix α9 are involved in the interaction with the minor groove. The side chain of H171/H173 in SigI6/SigI1 is inserted into the minor groove, forming hydrogen bonding and stacking interactions with the ribose rings. Adjacent conserved residues, including R172/R174, S174/S176, and K170/K172, interact with backbone phosphates of the double-stranded DNA (dsDNA). These minor-groove-binding residues are conserved in σ^I (Fig. S7C), and the SigIC-binding minor groove is formed by the characteristic, essential A-tract region in σ^I-dependent promoters^27,28. To confirm the importance of the SigIC minor-groove-binding residues, we analyzed the activity of SigI6 and its mutants using both an in vivo heterologous Bacillus system^27,28,38 and in vitro transcriptional activity assays³⁹ (Fig. 2D, E). The in vivo heterologous Bacillus system revealed that mutation of H171 to Tyr, Phe, Asn, Ser, or Ala resulted in complete loss of activity, while mutation to Lys or Arg resulted in significantly decreased but detectable activity, since they are minor-groove-binding residues observed in A-tract binding proteins^40,41,42. Mutation of K170 and R172 also significantly decreased activity, confirming their functional importance. The in vitro transcriptional activity assays also exhibited similar results (Fig. 2E), indicating the functional importance of the minor-groove binding by SigI.

**Fig. 2: Interactions between the promoter DNA −35 element and SigI6 in the RPo-SigI6 structure.**

Unlike the minor-groove binding by conserved residues, the major groove of the −35 element was mainly bound by non-conserved SigIC residues. The interacting residues include conserved R215/R217 and non-conserved R219/N221, K200/N202, T203/R205, L204/N206, K221/R223, R214/G216, S213/H215, and E218/G220 of SigI6/SigI1, which form different interactions in the two RPo structures, indicating their selective importance in promoter specificity (Fig. 2B; Figs. S6 and S7A, C). Consistently, the major-groove DNA sequence corresponded to the region of specificity (ROS) proposed previously²⁷. Mutation of interacting RPo-SigI6 residues resulted in the loss of activity (Fig. 2D, E), indicating their roles in promoter binding. Comparison of SigI6C and SigI1C revealed slight differences in helix orientations (Fig. S7B), but showed significant shift and rotation relative to the RNAP core enzyme (Fig. 1E).

The DNA-binding mode of SigIC differs from that of the σ⁷⁰-family σ₄-domain, which binds to the major groove only^{11,13,16,17,34,43}. The additional minor-groove binding results in significantly larger interface area (952 Å²) between SigIC and promoter versus that between σ₄-domain and promoter (e.g., 769 Å² of σ^H from M. tuberculosis and 530 Å² of σ^A from B. subtilis). Furthermore, the binding modes of SigIC and σ₄ with the major groove are completely different. Previous studies indicated that SigIC would show steric hindrance if it would adopt a dsDNA binding conformation similar to that of the σ₄-domain of ECF σ-factors^23,44. The RPo-σ^I structures indeed revealed that although SigIC interacted with the major groove via its HTH structure (α11 and α12), its position exhibits a ~ 180° rotation compared with that of the σ₄-domain (Fig. 2A). This rotation not only resolves the potential steric clash but also allows the N-terminal part of helix α9 to fit into the minor groove forming an additional DNA-binding interface, representing a unique binding mode among the known σ⁷⁰ factors. A Dali search⁴⁵ revealed that the DNA-binding mode of SigIC is similar to the winged HTH domain of transcriptional factors, among which an ROK-family repressor Lmo0178⁴⁶ shows high similarity (Fig. 2C). HTH motifs of SigIC and Lmo0178 similarly bind to the major groove and a positively charged residue (His171/Lys9 in SigI6/Lmo0178, respectively) on an N-terminal helix that penetrates the downstream minor groove (Fig. 2C). However, as opposed to SigIC, Lmo0178 is dimeric and binds to a palindrome sequence, and a β-loop wing binds to the upstream minor groove.

In summary, SigIC has a unique −35 element recognition mode formed by two features: the conserved minor-groove A-tract binding lacking in σ₄-promoter recognition, and non-conserved major-groove ROS-binding by the HTH motif, which presents a ~ 180° rotation compared to the σ₄-HTH motif. Therefore, σ^I-promoter recognition of the −35 element differs completely from that of the σ₄-domain of other σ⁷⁰ factors.

Interactions between σ^I and the −10 element

The SigIN domain adopts an oval structure formed by three helices α2-α4, similar to the σ₂-domain of other σ⁷⁰ factors, and helix α1 is attached to one head of the oval, somewhat similar to the second helix of the σ_1.2 region (σR1.2) of groups I and II (Fig. 3A). Similar to other σ₂-domains, SigIN opens the duplex of the −10 element to form the transcriptional bubble, mainly through helix α4. SigIN also binds the NT-strand through α1, α2, and Loop3 (connecting α3 and α4), thus stabilizing the unwound transcription bubble. The bubble size (number of unpaired nucleotides) is 14 bp, similar to that (13–15 bp) opened by groups I-III σ⁷⁰ factors^{9,11,47,48,49} but different from that (12 bp) of ECF σ-factors^15,16,17. Although the overall structure is similar to the σ₂-domain of other σ⁷⁰ factors, the detailed comparison showed unique interactions between SigIN and promoter DNA for specific promoter recognition (Fig. 3A), as described below.

**Fig. 3: Interactions between promoter DNA −10 element and SigIN in the RPo-σ^I structures.**

The conserved C_-14G_-13W_-12A_-11 motif (CGAA in both P1 and P6) at the −10 element is recognized by helix-α4 residues. Paired C:G(−14) and G:C(−13) interact with R104/R108, D101/D105, R97/R101, and R98/R102 of SigI6N/SigI1N, and A_-12 initiates bubble formation. R97/R101 in SigI6/SigI1 serves as a wedge to disrupt stacking between positions −13 and −12. A_-11 is inserted into the protein pocket, formed by N-terminal A78/K82, K83/K87, D80/D84, and H84/G88 of Loop3 and N-terminal conserved F90/F94 of helix α3 (Fig. 3B and Fig. S8A). Mutation in conserved residues that bind C:G(−14), G:C(−13), and A_-11 resulted in near-complete activity loss, except for the D80A mutation (Fig. 3C). Non-conserved H84 does not play a key role in specific recognition. A_-12 showed extensive interaction with identical residues R97/R101, E74/E78, F90/F94, and Q93/Q97 in SigI6/SigI1, but R97/R101, F90/F94 and Q93/Q97 are only partially conserved in the other σ^Is (Fig. S8B). Therefore, we suspect that non-conserved W_-12 may partially contribute to specific promoter recognition for different σ^Is. A previous study showed that SigI3 binds the CGTA motif, and CGTA-to-CGAA mutation of the SigI3-dependent promoter Prgl11A resulted in a complete loss of SigI3 recognition²⁸. The CGTA mutant (A_-12t) of PsigI6 cannot be recognized by SigI6 in the heterologous Bacillus system (Fig. 3C), and the in vitro transcription activity of the CGTA mutant also decreased significantly (Fig. 2E), indicating the importance of W_-12 in σ^I specificity.

All unpaired bases of the NT-strand DNA in the bubble (from −12 to +2) in the two RPo-σ^I structures turn outward with abundant π-stackings between successive bases, and bases from −12 to −3 form extensive interactions with SigIN (Fig. 3A, B and Fig. S8A). This is a unique structural feature in known RNAP-σ complexes, since only part of the NT-strand bases in the bubble flip out in other group σ⁷⁰-RNAP complexes (Fig. 3A)^11,16,43. According to sequence alignment (Fig. S8B), residues binding to −10 element downstream bases are largely non-conserved in σ^I. The −10 to −7 downstream promoter region together with A_-11 showed extensive interactions with Loop3—the “specificity loop” in ECF-σ^16,50. The latter loop specifically recognizes the −11 base in the X_-14G_-13T_-12Y_-11 (X = C,G; Y = A,T,C) motif^3,25, which spatially corresponds to T_-10 of PsigI6. Since this position is not conserved in σ^I-dependent promoters, we investigated whether Loop 3 plays a specificity role in the different σ^Is. Mutation of T_-10 of PsigI6 into different nucleotides resulted in different effects: T_-10c showed much higher activity than wild-type PsigI6, while T_-10g and T_-10a showed complete and partial loss of activity, respectively. Similarly, mutations H84 and S85 of SigI6, according to the mutation pattern in SigI1 (H84G/S85Y), SigI2 (H84N/S85M) and SigI3 (H84N/S85G), resulted in diverse effects (Fig. 3C). The latter inconsistent results indicated that the downstream region of the CGWA motif is likely a modulator of promoter activity but does not serve as a specificity determinant for the different σ^Is.

Structural comparison of active σ^I in the RPo complex and RsgI-bound inhibited σ^I

Our previous study showed that RsgI specifically binds to the C-terminal domain of cognate σ^I to inhibit σ^I activity and that the interface contains both conserved and non-conserved residues²³. Nevertheless, how this interaction inhibits σ^I activity is unclear. The structure of the RPo complex revealed only slight conformational changes between the active and inhibited states of SigIC (Fig. 4A), and the same surface binds to RNAP and RsgI (Fig. 4B, C), thus indicating that RsgI inhibits σ^I activity by competitive binding. SigIC binds βFTH through conserved hydrophobic surface residues (Fig. 4B and Fig. S7C), which partly overlap with the RsgI-binding residues (Fig. 4C and Fig. S7C). However, the interface area of the RsgI1-SigI1 interaction (1056 Å²) is much larger than that of SigI1C-βFTH (800 Å²), which might explain why the conserved σ^I-RNAP interaction is inhibited by the non-conserved interaction with RsgI.

**Fig. 4: Comparison of SigI1C in the active (i.e., in RPo complex) and the inactive (i.e., RsgI-bound) states.**

Discussion

Despite more than 20 years of study of the σ^Is since their discovery²², their classification remains confusing. σ^Is were initially classified as σ⁷⁰-family group III²⁶ and later reclassified as “ECF-like”^27,28 rather than ECF σ factors^3,25,51. Our structures of the RPo-σ^I complexes indicated that σ^I is indeed a unique type of σ factor that cannot be classified into canonical groups of the σ⁷⁰ family. Several features distinguish σ^I from the other groups. In this context, σ^I has a σ₂-domain that contains part of σR1.2 which only exists in members of groups I and II. In addition, the RPo-σ^I complex contains a bubble size similar to those of groups I-III. However, σ^I lacks a σ₃ domain which exists in the latter groups (Fig. 5A). Moreover, the −10 element binds to SigIN with more flipped-out bases than those of other σ⁷⁰-promoter complexes (Fig. 5B). Finally, although SigIC is responsible for recognition of the −35 element and is functionally similar to the σ₄-domain of the other σ⁷⁰ factors, it is completely different, both in terms of structure and DNA-binding mode (Fig. 5B). Therefore, σ^I factors represent a distinct member of the σ⁷⁰-family σ factors, thus highlighting the diversity of bacterial transcription.

**Fig. 5: Schematic diagrams of the promoter recognition by σ^I and by the four groups of σ⁷⁰ family σ factors.**

Intriguingly, the C-terminal σ^I domain binds to −35 DNA with a large binding surface that penetrates both major and minor grooves of the promoter DNA. The minor- and major-groove regions correspond to the previously identified A-tract and region of specificity²⁷, respectively, and the present study provides a structural basis for the function of the two regions. The manner of minor-groove binding by a single positively charged residue has been widely observed in DNA-binding proteins for specific A-tract or AT-rich DNA recognition^52,53. Major-groove-binding by SigIC is similar to winged helix-turn-helix (HTH) domains of transcription factors^46,54. However, evolutionary relationships were lacking between σ^I and the transcriptional factors upon comparing their homologous sequences in various bacteria. The similarity is likely caused by the convergent evolution of two different proteins for DNA binding.

The two structures reported here provide insight into the specificity of different σ^I paralogues in one bacterium. Non-conserved residues in the HTH motif of the SigIC domain specifically bind to the ROS in the promoter −35 element. In addition, the −12 nucleotide in the promoter −10 element plays a role in the specificity. Its downstream nucleotides show extensive interactions with σ^I and probably modulate the activities for each specific gene. The numbers of interacting residues in σ^I and interacting nucleotides in the promoter are much higher than those of the ECF σ factors, which may explain why one bacterium can maintain so many (up to sixteen) σ^Is for regulation with specificity²⁷. Since the σ^Is in C. thermocellum are responsible for regulating the expression of cellulosome components—thus comprising a potential “treasure-trove for biotechnology”^55,56, the promoter recognition mechanism revealed in this study provides the basis for future engineering of cellulosome production in cellulosome-producing bacteria⁵⁷. Furthermore, the unique binding mode and specificity mechanism of the σ^Is provide new possibilities to design regulators in synthetic biology for the design of orthogonal genetic switches and regulators^8,58.

Methods

Purification of RNAP core enzyme from C. thermocellum

The strains used in this study are listed in Table S2. The plasmids and primers used in this study are listed in Supplementary Data 1 and Supplementary Data 2, respectively. The C. thermocellum strain for the purification of RNAP was constructed using the previously developed homologous recombination method⁵⁹. Specifically, a strong constitutive promoter P₂₆₃₈⁶⁰ and an N-terminal His×10-tag were inserted before the RNAP β‘ gene (clo1313_0314) [https://www.ncbi.nlm.nih.gov/gene/12420012] in C. thermocellum strain ΔpyrF⁵⁹. The homology arms Bp-UP and Bp-DN and the promoter P₂₆₃₈ were amplified by PCR from C. thermocellum DSM1313 genomic DNA. The DNA fragments were ligated by either overlapping PCR or restriction enzyme digestion and T4 ligation, and finally the homologous recombination plasmid pHKm2-homo-5’Betap was obtained (Fig. S1A). The plasmid was transformed into C. thermocellum strain ΔpyrF by electroporation⁵⁹, generating the mutant DSM1313::P₂₆₃₈-His₁₀-β‘ after two rounds of screening. Transformants containing the plasmid pHKm2-homo-5’Betap were first screened on semi-solid GS-2 medium containing 3 μg/mL thiamphenicol (Tm). Then, the obtained transformant was screened with 10 μg/mL 5-fluoro-2-deoxyuridine (FUDR)-supplemented uracil auxotrophic MJ medium to generate the target mutant after homologous recombination. The mutant was verified by colony PCR and sequencing. C. thermocellum strains were routinely cultured anaerobically at 55 °C in GS-2 medium, supplemented with 5.0 g/L cellobiose as carbon source.

The RNAP core enzyme was directly purified from DSM1313::P₂₆₃₈-His₁₀-β‘. The cells were grown anaerobically at 55 °C in 50 L GS-2 medium supplemented with 5 g/L glucose as a carbon source. When the optical density at 600 nm (OD_600nm) reached 1.2 ~ 1.8, cells were collected by centrifugation at 10,200 g for 30 min. The cell pellet was suspended in 1.5 L buffer A (20 mM Tris-HCl pH 8.0, 500 mM NaCl, 30 mM imidazole, 5% (v/v) glycerol, and protease inhibitor cocktail (Roche)) and lysed by ultrasonication. The lysate was centrifuged at 15,000 g for 50 min at 4 °C, the supernatant was then loaded onto a 40-mL His-Trap FF affinity column (GE Healthcare Life Sciences) pre-equilibrated with buffer A, and RNAP was eluted by buffer B (20 mM Tris-HCl pH 8.0, 500 mM NaCl, 500 mM imidazole, 5% (v/v) glycerol). The complex was further purified using a 5-mL Hi-Trap Heparin column (GE Healthcare Life Sciences) pre-equilibrated with buffer C (20 mM Tris-HCl pH 8.0, 100 mM NaCl, 2 mM DTT, 0.2 mM EDTA, 5% (v/v) glycerol, and protease inhibitor cocktail (Roche)), and the RNAP was eluted with buffer D (20 mM Tris-HCl pH 8.0, 1 M NaCl, 2 mM DTT, 0.2 mM EDTA, 5% (v/v) glycerol) by a linear gradient. The fractions containing RNAP were collected and loaded on a Source Q column (GE Healthcare Life Sciences). After elution by a linear gradient of NaCl to the final concentration of 1 M, the fractions containing RNAP core enzyme were collected, concentrated to 3 mg/mL, and stored at −80 °C. The subunits of the RNAP core enzyme in the purified proteins were identified by SDS-PAGE.

Purification of recombinant SigI1 and SigI6

The expression and purification of SigI1, SigI6, and mutants of SigI6 (C167S, R215A, R214A, H171R, H171A, R172A, K170A, R104A, and K16A) in Escherichia coli followed the procedures for SigI1 reported in a previous study²³. Briefly, the gene fragments encoding full-length SigI1 and SigI6 were cloned into the pET28a-SMT3 plasmid, generating the plasmids pET28a-SMT3-SigI1 and pET28a-SMT3-SigI6. Each mutant of SigI6 was constructed by site-directed mutagenesis using the QuikChange method. All the plasmids were transformed into E. coli BL21 (DE3) for protein expression. The wild-type SigI6 showed poor stability during the purification, and the mutant SigI6-C167S showed much better stability. Therefore, SigI6-C167S was purified and used in the structural study. The recombinant proteins were first purified by a nickel-affinity column and then purified further by size-exclusion chromatography with a HiLoad 16/600 Superdex 75 column with buffer E (20 mM Tris-HCl pH 8.0, 150 mM NaCl, 2 mM DTT, 0.2 mM EDTA, 5% (v/v) glycerol, 10 mM MgCl₂). The purity of recombinant proteins was detected using SDS-PAGE (Fig. S10).

Nucleic acid scaffolds

Double-stranded nucleic acid scaffolds for the cryo-EM study of RPo-SigI1 and RPo-SigI6 were prepared from synthetic oligos (Table S3) by annealing the DNA (heating at 95 °C for 5 min and then allowing the DNA to cool slowly to room temperature). The annealing buffer contains 20 mM Tris-HCl pH 8.0, 200 mM NaCl, and 10 mM MgCl₂.

Double-stranded nucleic acid scaffolds for the fluorescence-detected in vitro transcription assay were prepared by PCR using pUC19-PsigI6-Mango-tR2 as a template. DNA sequences and primers used for the in vitro transcription assay are listed in Tables S4 and Supplementary Data 2, respectively.

Reconstitution of the RPo-σ^I complex

To reconstitute the RPo-SigI1 and RPo-SigI6 complexes for cryo-EM, the purified C. thermocellum RNAP core enzyme, purified recombinant SMT3-SigI1 or SMT3-SigI6-C167S, and annealed nucleic-acid scaffold were mixed at 1:3:1.3 molar ratio and incubated at 4 °C overnight. The reconstituted RPo-σ^I complex was further treated with ULP1 protease to remove the SMT3 tag of SMT3-SigI1/6. The RPo-σ^I complexes were concentrated to 500 μL and then purified using a Superdex 200 Increase 10/300 GL column in buffer E. The fractions of the RPo-σ^I complex were collected and concentrated for cryo-EM sample preparation. The subunits and DNA scaffolds of RPo complexes were identified by SDS-PAGE and Native-PAGE (Fig. S1E, F).

Cryo-EM grid preparation

The purified samples (12–24 mg/mL protein) were mixed with 8 mM CHAPSO (final concentration) and 0.1 mM DTT. Quantifoil R1.2/1.3 holey carbon grids were glow-discharged for 90 s before the application of 3 μL of the sample. After blotting for 6–8 s with a blot force of 2 N, the grids were plunge-frozen in liquid ethane using an FEI Vitrobot Mark IV (FEI, Hillsboro) with 95% chamber humidity at 10 °C.

Cryo-EM data acquisition and processing

The grids were imaged using a 300-keV Titan Krios equipped with a K2 Summit direct electron detector (Gatan) and a GIF quantum energy filter (slit width 20 eV). Data were collected at a nominal magnification of ×22,500 (1.04 Å/pixel) with a dose rate of 8 electrons/pixel/s on the sample (~7.8 electrons/pixel/s on the detector). All images were recorded using Serial EM⁶¹ with super-resolution counting mode for 7.6 s exposures in 32 subframes to give a total dose of 60 electrons/Å² with defocus range of −1.5 to −2.5 μm.

Motion correction and CTF estimation of cryo-EM movies were performed using Warp⁶², and particles were picked using an instance of Warp’s neural network retrained on 100 selected micrographs RPo-SigI1 data sets and RPo-SigI6 data sets. Particles were extracted in Warp and subsequently classified in cryoSPARC⁶³.

For the RPo-SigI6 dataset, the initial model was generated by Mycobacterium tuberculosis wild-type RNAP holoenzyme/RbpA/CarD/Sor/AP3—RP2 class (EMD-22575) as a template to 3D classify the particles using cryoSPARC heterogeneous refinement. The best class was selected as the reference to classify the particles for 3D classification with alignment in RELION⁶⁴. A collection of 120591 particles was selected to perform autorefinement. Focused classification (without alignment) of the SigI6C terminal was performed to improve the local density of the SigI6 and binding DNA. To further clean the dataset, CryoDRGN⁶⁵ was used to classify particles, and three similar classes were selected to perform the non-uniform refinement. The map was estimated to be at a resolution of 3.58 Å in RELION, and further processing by density modification with the ResolveCryoEM program⁶⁶ improved the map quality and resolution to 3.36 Å.

For the RPo-SigI1 dataset, the extracted particles were first 3D- and 2D-classified in cryoSPARC to discard poor particles. The remaining particles were then subjected to 3D classification in Relion and refinement in cryoSPARC to obtain a reconstructed map. To improve the local density of SigI1 and binding DNA, focused classification (without alignment) of the SigI1C terminus was performed in Relion. All particles in the best class during the focused classification were then subjected to non-uniform refinement in cryoSPARC, resulting in a map with an overall resolution of 3.03 Å. Post-processing of the density map generated during refinement was performed using DeepEMhancer⁶⁷. Local resolution estimations were calculated within RELION. The procedures for Cryo-EM structure determination of RPo-SigI1 and RPo-SigI6 are shown in Fig. S2.

Model building and refinement

The final cryoEM map for RPo-SigI1 and RPo-SigI6 complexes was used for initial model building. The crystal structure of Mycobacterium tuberculosis RPtic-σ^H complex structure (PDB ID 5ZX2) [https://www.rcsb.org/structure/5ZX2] was placed in the cryoEM maps of the RPo-SigI1 and RPo-SigI6 complexes, by rigid-body fitting with UCSF Chimera⁶⁸. The RNAP subunits in RPo-σ^I complexes were manually rebuilt into the cryoEM map referring to the fit RPtic-σ^H structure. The individual models of SigI1 and SigI6 were built referring to the structure predicted by Alphafold³⁷. The model was completed and manually adjusted residue-by-residue with real-space refinement in Coot⁶⁹, and then followed by real-space refinement in PHENIX⁷⁰. The models were visualized with UCSF Chimera, UCSF ChimeraX⁷¹, and PyMOL (http://www.pymol.org/).

Bacillus subtilis strain construction

A heterologous B. subtilis host system was constructed to study the σ^I-dependent promoter activities, referring to the published system which has been successfully used to study the activities of σ^Is from C. thermocellum and Pseudobacteroides cellulosolvens^27,28. Plasmids and primers in the present work are listed in Supplementary Data 1 and Supplementary Data 2, respectively. Two plasmids pULacZ and pAX05 were constructed to integrate the lacZ reporter gene and the C. thermocellum sigI6 gene into the amyE and sigI-rsgI loci, respectively, of B. subtilis. The plasmid pAX05 (Fig. S9A) was constructed from plasmid pAX01 carrying an erythromycin (Erm) resistance cassette and the xylose-inducible promoter PxylA^27,71. The upstream (1011 bp) and downstream (1011 bp) regions of B. subtilis sigI-rsgI operon were used as the homologous recombination arms and amplified using primer pairs sigI-F1/sigI-R1 and rsgI-F1/rsgI-R1, respectively, from the genomic DNA of B. subtilis strain 168. The C. thermocellum sigI6 gene was amplified using the primer pair Bs-sigI6-F1/ Bs-sigI6-R1. Then the DNA fragments of the homologous recombination arms, the promoter PxylA, the sigI6 gene, and the linearized pAX01 vector generated by PCR were ligated simultaneously with the One Step Cloning Kit (Vazyme), thereby obtaining the pAX05 plasmid. The plasmid pULacZ was constructed from the pUC19 vector (Fig. S9B). A spectinomycin (Spc)-resistance gene as a selectable marker⁷² was amplified from plasmid pLH-16 (provided by Mr. Hui Li, Qingdao Institute of Bioenergy and Bioprocess Technology). The upstream (1074 bp) and downstream (825 bp) regions of B. subtilis amyE were used as the homologous recombination arms and amplified using primer pairs amyE-F1/amyE-R1 and amyE-F2/amyE-R2. The reporter lacZ gene was amplified from the E. coli genome using primer pairs lacZ-F1/lacZ-R1. The promoter PsigI6 was amplified from C. thermocellum genomic DNA. Then the DNA fragments and the linearized pUC19 vector generated by PCR were ligated, generating the pULacZ plasmid. The plasmids containing the mutation of SigI or PsigI6 were obtained by site-directed mutagenesis using the primer pairs listed in Supplementary Data 2. All the used strains are listed in Table S2.

B. subtilis strains were grown on LB, SM1, or SM2 media⁷³ at 37 °C. The competent cells of B. subtilis 168 were prepared following the reported protocol⁷³. B. subtilis 168 was transformed with pAX05 and pULacZ plasmids successively. The transformants were selected with 3 µg/mL Erm and 100 µg/mL Spc. Chromosomal integration of plasmids by a double-crossover event was confirmed by colony PCR using the primers listed in Supplementary Data 2.

Promoter activity analysis by the B. subtilis reporter system

To measure the β-galactosidase activity of LacZ in the B. subtilis reporter system, strain samples were inoculated into MCSE media with Erm and Spc, and the culture was shaken at 250 rpm until OD_600nm = 0.4–0.5. Then xylose was added to the final concentration of 1% to induce the expression of SigI for 2 h^28,74. The β-galactosidase activity was analyzed using ortho-nitrophenyl-β-galactoside (ONPG) as the substrate according to the previously described procedures²⁸. Briefly, 4 mL of the cell cultures was centrifuged at 5000 g for 10 min, and the cell pellet was washed twice with Z-buffer (60 mM Na₂HPO₄, 40 mM NaH₂PO₄, 10 mM KCl, 1 mM MgSO₄, pH 7.0) and resuspend in 700 μL working buffer (60 mM Na₂HPO₄, 40 mM NaH₂PO₄, 10 mM KCl, 1 mM MgSO₄, and 2.7 mM β-mercaptoethanol, pH 7.0). The cells were lysed by ultrasonication and the lysate was centrifuged at 13,000 g for 10 min. The 100 μL enzymatic reaction system contained different volumes of cell lysate, 10 μL ONPG stocking solution (13.1 mg/mL in double distilled water), and the working buffer to make up a volume of 100 μL. The reaction system was incubated at 37 °C for a certain period and then 40 μL of the reaction solution was added to 200 μL 1 M Na₂CO₃ to terminate the reaction. The released 2-nitrophenol (ONP) was measured by determining the absorbance at 420 nm (A_420nm). One unit of enzyme activity was defined as the amount of β-galactosidase that releases 1 nmol of ONP per minute. The enzymatic activity was normalized with cell density (OD_600nm).

Fluorescence-detected in vitro transcription assay

The measurement of transcription activity was conducted by utilizing the significantly enhanced fluorescence of TO1–3PEG-Biotin when the Mango riboswitch is engaged⁷⁵, which has been successfully used to study transcriptional activities of various RNAPs^39,76. Briefly, to measure the transcriptional activity of SigI6 mutants or PsigI6 mutants, reaction mixtures (20 μL), containing the C. thermocellum RNAP core enzyme (final concentration 50 nM), promoter DNA or its mutants (final concentration 50 nM), and SigI6 or its mutants (100 nM) in reaction buffer (50 mM Tris-HCl pH 7.9, 100 mM KCl, 10 mM MgCl₂, 1 mM DTT, 5% glycerol, and 0.01% Tween-20), were incubated at room temperature for 10 min. The reactions were initiated by the addition of 2 μL NTP mixture (UTP, ATP, GTP, and CTP; final concentration 0.1 mM of each) and 2 μL TOl-3PEG-Biotin (final concentration 0.5 μM), and the reaction mixture was incubated at 55 °C for 30 min. The fluorescence signals were measured using a plate reader (SpectraMax M2, Molecular Devices) at an excitation wavelength of 510 nm and an emission wavelength of 550 nm.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The models and cryo-EM maps have been deposited into the Protein Data Bank and the EMDB under accession numbers 8I23 and EMD-35130 for RPo-SigI1 and 8I24 and EMD-35131 for RPo-SigI6, respectively. Other structure data used in this study for analysis (7MKP, 6CA0, 7CKQ, 6MPJ, 5ZX2, 6IVU) are available in the Protein Data Bank. Protein sequences used in this study are available from Uniprot under accession codes A3DBH0 (SigI1) [https://www.uniprot.org/uniprotkb/A3DBH0/entry], A3DH98 (SigI6) [https://www.uniprot.org/uniprotkb/A3DH98/entry]. Source data are provided in this paper.

References

Murakami, K. S., Masuda, S., Campbell, E. A., Muzzin, O. & Darst, S. A. Structural basis of transcription initiation: an RNA polymerase holoenzyme-DNA complex. Science 296, 1285–1290 (2002).
Article ADS CAS PubMed Google Scholar
Marr, M. T. & Roberts, J. W. Promoter recognition as measured by binding of polymerase to nontemplate strand oligonucleotide. Science 276, 1258–1260 (1997).
Article CAS PubMed Google Scholar
Staroń, A. et al. The third pillar of bacterial signal transduction: classification of the extracytoplasmic function (ECF) σ factor protein family. Mol. Microbiol. 74, 557–581 (2009).
Article PubMed Google Scholar
Lonetto, M., Gribskov, M. & Gross, C. A. The σ⁷⁰ family: sequence conservation and evolutionary relationships. J. Bacteriol. 174, 3843–3849 (1992).
Article CAS PubMed PubMed Central Google Scholar
Paget, M. S. & Helmann, J. D. The σ⁷⁰ family of sigma factors. Genome Biol. 4, 203 (2003).
Article PubMed PubMed Central Google Scholar
Pinto, D., Liu, Q. & Mascher, T. ECF σ factors with regulatory extensions: the one-component systems of the σ universe. Mol. Microbiol. 112, 399–409 (2019).
Article CAS PubMed Google Scholar
Brooks, B. E. & Buchanan, S. K. Signaling mechanisms for activation of extracytoplasmic function (ECF) sigma factors. Biochim. Biophys. Acta 1778, 1930–1945 (2008).
Article CAS PubMed Google Scholar
Pinto, D. et al. Engineering orthogonal synthetic timer circuits based on extracytoplasmic function σ factors. Nucleic Acids Res. 46, 7450–7464 (2018).
Article CAS PubMed PubMed Central Google Scholar
Zuo, Y. & Steitz, T. A. Crystal structures of the E. coli transcription initiation complexes with a complete bubble. Mol. Cell 58, 534–540 (2015).
Article CAS PubMed PubMed Central Google Scholar
Bae B., Feklistov A., Lass-Napiorkowska A., Landick R. & Darst S. A. Structure of a bacterial RNA polymerase holoenzyme open promoter complex. eLife 4, e08504 (2015).
Article PubMed PubMed Central Google Scholar
Cartagena, A. J. et al. Structural basis for transcription activation by Crl through tethering of σ^S and RNA polymerase. Proc. Natl. Acad. Sci. USA 116, 18923–18927 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Liu, B., Zuo, Y. & Steitz, T. A. Structures of E. coli σ^S-transcription initiation complexes provide new insights into polymerase mechanism. Proc. Natl. Acad. Sci. USA 113, 4051–4056 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Shi, W. et al. Structural basis of bacterial σ²⁸ -mediated transcription reveals roles of the RNA polymerase zinc-binding domain. EMBO J. 39, e104389 (2020).
Article CAS PubMed PubMed Central Google Scholar
Lu, Q. et al. Structural insight into the mechanism of σ³²-mediated transcription initiation of bacterial RNA polymerase. Biomolecules 13, 738 (2023).
Article CAS PubMed PubMed Central Google Scholar
Fang, C. et al. Structures and mechanism of transcription initiation by bacterial ECF factors. Nucleic Acids Res. 47, 7094–7104 (2019).
Article CAS PubMed PubMed Central Google Scholar
Li, L., Fang, C., Zhuang, N., Wang, T. & Zhang, Y. Structural basis for transcription initiation by bacterial ECF σ factors. Nat. Commun. 10, 1153 (2019).
Article ADS PubMed PubMed Central Google Scholar
Lin, W. et al. Structural basis of ECF-σ-factor-dependent transcription initiation. Nat. Commun. 10, 710 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Glyde, R. et al. Structures of RNA polymerase closed and intermediate complexes reveal mechanisms of DNA opening and transcription initiation. Mol. Cell 67, 106–116 (2017).
Article CAS PubMed PubMed Central Google Scholar
Campbell, E. A., Kamath, S., Rajashankar, K. R., Wu, M. & Darst, S. A. Crystal structure of Aquifex aeolicus σ^N bound to promoter DNA and the structure of σ^N-holoenzyme. Proc. Natl Acad. Sci. USA 114, E1805–E1814 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Nataf, Y. et al. Clostridium thermocellum cellulosomal genes are regulated by extracytoplasmic polysaccharides via alternative sigma factors. Proc. Natl Acad. Sci. USA 107, 18646–18651 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Ramaniuk, O. et al. σ^I from Bacillus subtilis: impact on gene expression and characterization of σ^I-dependent transcription that requires new types of promoters with extended −35 and −10 elements. J. Bacteriol. 200, e00251-18 (2018).
Article PubMed PubMed Central Google Scholar
Zuber, U., Drzewiecki, K. & Hecker, M. Putative sigma factor SigI (YkoZ) of Bacillus subtilis is induced by heat shock. J. Bacteriol. 183, 1472–1475 (2001).
Article CAS PubMed PubMed Central Google Scholar
Wei, Z. et al. Alternative σ^I/anti-σ^I factors represent a unique form of bacterial σ/anti-σ complex. Nucleic Acids Res. 47, 5988–5997 (2019).
Article CAS PubMed PubMed Central Google Scholar
Kahel-Raifer, H. et al. The unique set of putative membrane-associated anti-σ factors in Clostridium thermocellum suggests a novel extracellular carbohydrate-sensing mechanism involved in gene regulation. FEMS Microbiol. Lett. 308, 84–93 (2010).
Article CAS PubMed Google Scholar
Casas-Pastor, D. et al. Expansion and re-classification of the extracytoplasmic function (ECF) σ factor family. Nucleic Acids Res. 49, 986–1005 (2021).
Article CAS PubMed PubMed Central Google Scholar
Gruber, T. M. & Gross, C. A. Multiple sigma subunits and the partitioning of bacterial transcription space. Annu. Rev. Microbiol. 57, 441–466 (2003).
Article CAS PubMed Google Scholar
Ortiz de Ora, L. et al. Regulation of biomass degradation by alternative σ factors in cellulolytic clostridia. Sci. Rep. 8, 11036 (2018).
Article ADS PubMed PubMed Central Google Scholar
Muñoz-Gutiérrez, I. et al. Decoding biomass-sensing regulons of Clostridium thermocellum alternative sigma-I factors in a heterologous Bacillus subtilis host system. PLoS ONE 11, e0146316 (2016).
Article PubMed PubMed Central Google Scholar
Izquierdo, J. A. et al. Complete genome sequence of Clostridium clariflavum DSM 19732. Stand. Genom. Sci. 6, 104–115 (2012).
Article MathSciNet CAS Google Scholar
Yaniv, O. et al. Fine-structural variance of family 3 carbohydrate-binding modules as extracellular biomass-sensing components of Clostridium thermocellum anti-σ^I factors. Acta Crystallogr. Sect. D Struct. Biol. 70, 522–534 (2014).
Article ADS CAS Google Scholar
Ding, X. K., Chen, C., Cui, Q., Li, W. L. & Feng, Y. G. Resonance assignments of the periplasmic domain of a cellulose-sensing trans-membrane anti-sigma factor from Clostridium thermocellum. Biomol. NMR Assign. 9, 321–324 (2015).
Article CAS PubMed Google Scholar
Chen, C. et al. Essential autoproteolysis of bacterial anti-σ factor RsgI for transmembrane signal transduction. Sci. Adv. 9, eadg4846 (2023).
Article CAS PubMed PubMed Central Google Scholar
Brunet, Y. R., Habib, C., Brogan, A. P., Artzi, L. & Rudner, D. Z. Intrinsically disordered protein regions are required for cell wall homeostasis in Bacillus subtilis. Genes Dev. 36, 970–984 (2022).
CAS PubMed PubMed Central Google Scholar
Narayanan, A. et al. Cryo-EM structure of Escherichia coli σ⁷⁰ RNA polymerase and promoter DNA complex revealed a role of σ non-conserved region during the open complex formation. J. Biol. Chem. 293, 7367–7375 (2018).
Article CAS PubMed PubMed Central Google Scholar
Shi, J. et al. Transcription activation by a sliding clamp. Nat. Commun. 12, 1131 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Boyaci, H., Chen, J., Jansen, R., Darst, S. A. & Campbell, E. A. Structures of an RNA polymerase promoter melting intermediate elucidate DNA unwinding. Nature 565, 382–385 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Jumper, J. et al. Highly accurate protein structure prediction with AlphaFold. Nature 596, 583–589 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Zhang, X. et al. Molecular basis of TcdR-dependent promoter activity for toxin production by Clostridioides difficile studied by a heterologous reporter system. Toxins 15, 306 (2023).
Article CAS PubMed PubMed Central Google Scholar
Fang, C. et al. CueR activates transcription through a DNA distortion mechanism. Nat. Chem. Biol. 17, 57–64 (2021).
Article CAS PubMed Google Scholar
Burkhoff, A. M. & Tullius, T. D. The unusual conformation adopted by the adenine tracts in kinetoplast DNA. Cell 48, 935–943 (1987).
Article CAS PubMed Google Scholar
Neidle S. In Principles of Nucleic Acid Structure (ed. Neidle S.). (Academic Press, 2008).
Barbic, A., Zimmer, D. P. & Crothers, D. M. Structural origins of adenine-tract bending. Proc. Natl. Acad. Sci. USA 100, 2369–2373 (2003).
Article ADS CAS PubMed PubMed Central Google Scholar
Fang, C. et al. The bacterial multidrug resistance regulator BmrR distorts promoter DNA to activate transcription. Nat. Commun. 11, 6284 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Lane, W. J. & Darst, S. A. The structural basis for promoter −35 element recognition by the group IV sigma factors. PLoS Biol. 4, e269 (2006).
Article PubMed PubMed Central Google Scholar
Holm, L. & Rosenström, P. Dali server: conservation mapping in 3D. Nucleic Acids Res. 38, 275–288 (2010).
Article Google Scholar
Light, S. H., Cahoon, L. A., Halavaty, A. S., Freitag, N. E. & Anderson, W. F. Structure to function of an α-glucan metabolic pathway that promotes Listeria monocytogenes pathogenesis. Nat. Microbiol. 2, 16202 (2016).
Article CAS PubMed Google Scholar
Chen, J. et al. Stepwise promoter melting by bacterial RNA polymerase. Mol. Cell 78, 275–288 (2020).
Article MathSciNet CAS PubMed PubMed Central Google Scholar
Saecker, R. M. et al. Structural origins of Escherichia coli RNA polymerase open promoter complex stability. Proc. Natl. Acad. Sci. USA 118, e2112877118 (2021).
Article CAS PubMed PubMed Central Google Scholar
Wang, F. et al. Structural basis for transcription inhibition by E. coli SspA. Nucleic Acids Res. 48, 9931–9942 (2020).
Article CAS PubMed PubMed Central Google Scholar
Campagne, S., Marsh, M. E., Capitani, G., Vorholt, J. A. & Allain, F. H. T. Structural basis for −10 promoter element melting by environmentally induced sigma factors. Nat. Struct. Mol. Biol. 21, 269–276 (2014).
Article CAS PubMed Google Scholar
Pinto, D. & da Fonseca, R. R. Evolution of the extracytoplasmic function σ factor protein family. NAR Genom. Bioinform. 2, lqz026 (2020).
Article PubMed PubMed Central Google Scholar
Crane-Robinson, C., Dragan, A. I. & Privalov, P. L. The extended arms of DNA-binding domains: a tale of tails. Trends Biochem. Sci. 31, 547–552 (2006).
Article CAS PubMed Google Scholar
Rohs, R. et al. The role of DNA shape in protein–DNA recognition. Nature 461, 1248–1253 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Aravind, L., Anantharaman, V., Balaji, S., Babu, M. M. & Iyer, L. M. The many faces of the helix-turn-helix domain: transcription regulation and beyond. FEMS Microbiol. Rev. 29, 231–262 (2005).
Article CAS PubMed Google Scholar
Bayer, E. A., Morag, E. & Lamed, R. The cellulosome—a treasure-trove for biotechnology. Trends Biotechnol. 12, 379–386 (1994).
Article CAS PubMed Google Scholar
Feng, Y., Liu, Y.-J. & Cui, Q. Research progress in cellulosomes and their applications in synthetic biology. Synth. Biol. J. 3, 138–154 (2022).
Google Scholar
Ma, X., Ma, L. & Huo, Y. X. Reconstructing the transcription regulatory network to optimize resource allocation for robust biosynthesis. Trends Biotechnol. 40, 735–751 (2022).
Article CAS PubMed Google Scholar
Zong, Y. et al. Insulated transcriptional elements enable precise design of genetic circuits. Nat. Commun. 8, 52 (2017).
Article ADS PubMed PubMed Central Google Scholar
Zhang, J. et al. Efficient whole-cell-catalyzing cellulose saccharification using engineered Clostridium thermocellum. Biotechnol. Biofuels 10, 124 (2017).
Qi, K. et al. Coordinated β-glucosidase activity with the cellulosome is effective for enhanced lignocellulose saccharification. Bioresour. Technol. 337, 125441 (2021).
Article CAS PubMed Google Scholar
Mastronarde, D. N. Automated electron microscope tomography using robust prediction of specimen movements. J. Struct. Biol. 152, 36–51 (2005).
Article PubMed Google Scholar
Tegunov, D. & Cramer, P. Real-time cryo-electron microscopy data preprocessing with Warp. Nat. Methods 16, 1146–1152 (2019).
Article CAS PubMed PubMed Central Google Scholar
Punjani, A., Rubinstein, J. L., Fleet, D. J. & Brubaker, M. A. cryoSPARC: algorithms for rapid unsupervised cryo-EM structure determination. Nat. Methods 14, 290–296 (2017).
Article CAS PubMed Google Scholar
Zivanov, J. et al. New tools for automated high-resolution cryo-EM structure determination in RELION-3. eLife 7, e42166 (2018).
Zhong, E. D., Bepler, T., Berger, B. & Davis, J. H. CryoDRGN: reconstruction of heterogeneous cryo-EM structures using neural networks. Nat. Methods 18, 176–185 (2021).
Article CAS PubMed PubMed Central Google Scholar
Terwilliger, T. C., Ludtke, S. J., Read, R. J., Adams, P. D. & Afonine, P. V. Improvement of cryo-EM maps by density modification. Nat. Methods 17, 923–927 (2020).
Article CAS PubMed PubMed Central Google Scholar
Sanchez-Garcia, R. et al. DeepEMhancer: a deep learning solution for cryo-EM volume post-processing. Commun. Biol. 4, 874 (2021).
Article PubMed PubMed Central Google Scholar
Pettersen, E. F. et al. UCSF Chimera—visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605–1612 (2004).
Article CAS PubMed Google Scholar
Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. Features and development of Coot. Acta Crystallogr. Sect. D. Biol. Crystallogr. 66, 486–501 (2010).
Article ADS CAS Google Scholar
Adams, P. D. et al. PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr. Sect. D. Biol. Crystallogr. 66, 213–221 (2010).
Article ADS CAS Google Scholar
Goddard, T. D. et al. UCSF ChimeraX: meeting modern challenges in visualization and analysis. Protein Sci. 27, 14–25 (2018).
Article CAS PubMed Google Scholar
Härtl, B., Wehrl, W., Wiegert, T., Homuth, G. & Schumann, W. Development of a new integration site within the Bacillus subtilis chromosome and construction of compatible expression cassettes. J. Bacteriol. 183, 2696–2699 (2001).
Article PubMed PubMed Central Google Scholar
Guiziou, S. et al. A part toolbox to tune genetic expression in Bacillus subtilis. Nucleic Acids Res. 44, 7495–7508 (2016).
CAS PubMed PubMed Central Google Scholar
Radeck, J. et al. The Bacillus BioBrick Box: generation and evaluation of essential genetic building blocks for standardized work with Bacillus subtilis. J. Biol. Eng. 7, 29 (2013).
Article PubMed PubMed Central Google Scholar
Trachman, R. J. 3rd et al. Structural basis for high-affinity fluorophore binding and activation by RNA Mango. Nat. Chem. Biol. 13, 807–813 (2017).
Article CAS PubMed PubMed Central Google Scholar
He, D. et al. Pseudomonas aeruginosa SutA wedges RNAP lobe domain open to facilitate promoter DNA unwinding. Nat. Commun. 13, 4204 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank Prof. Yu Zhang and Miss Linlin You (Shanghai Institute of Plant Physiology and Ecology, Chinese Academy of Sciences) for their helpful suggestions during this study. We thank Mr. Hui Li (Qingdao Institute of Bioenergy and Bioprocess Technology, Chinese Academy of Sciences) for providing the plasmid carrying the spectinomycin-resistance gene. We thank Professor Dawei Zhang and Dr. Gang Fu (Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences) for their helpful comments during the construction of the Bacillus subtilis reporter system. This work was supported by the National Natural Science Foundation of China [32070125 to Y.F., 3217005 to Q.C., 32070028 to Y.-J.L., 32241029 and 31730023 to P.Z.]; the National Key Research and Development Program of China [2021YFA1300100 to P.Z.]; Shandong Energy Institute [SEI S202106 to Q.C., SEI I202106 to Y.F.]; Qingdao Independent Innovation Major Project [21-1-2-23-hz to Y.-J.L.]; Strategic Priority Research Program of the Chinese Academy of Sciences [XDA21060201 to Q.C., XDB37010100 to P.Z.]. E.A.B. is the incumbent of The Maynard I. and Elaine Wishner Chair of Bio-organic Chemistry. The funders had no role in the study design, data collection, and interpretation, or the decision to submit the work for publication.

Author information

These authors contributed equally: Jie Li, Haonan Zhang.
These authors jointly supervised this work: Yingang Feng, Ping Zhu.

Authors and Affiliations

CAS Key Laboratory of Biofuels, Qingdao Institute of Bioenergy and Bioprocess Technology, Chinese Academy of Sciences, 266101, Qingdao, Shandong, China
Jie Li, Ya-Jun Liu, Qiu Cui & Yingang Feng
Shandong Provincial Key Laboratory of Synthetic Biology, Qingdao Institute of Bioenergy and Bioprocess Technology, Chinese Academy of Sciences, 266101, Qingdao, Shandong, China
Jie Li, Ya-Jun Liu, Qiu Cui & Yingang Feng
Shandong Engineering Laboratory of Single Cell Oil, Qingdao Institute of Bioenergy and Bioprocess Technology, Chinese Academy of Sciences, 266101, Qingdao, Shandong, China
Jie Li, Ya-Jun Liu, Qiu Cui & Yingang Feng
Shandong Energy Institute, 266101, Qingdao, Shandong, China
Jie Li, Ya-Jun Liu, Qiu Cui & Yingang Feng
Qingdao New Energy Shandong Laboratory, 266101, Qingdao, Shandong, China
Jie Li, Ya-Jun Liu, Qiu Cui & Yingang Feng
University of Chinese Academy of Sciences, 100049, Beijing, China
Jie Li, Haonan Zhang, Dongyu Li, Ya-Jun Liu, Qiu Cui, Yingang Feng & Ping Zhu
National Laboratory of Biomacromolecules, CAS Center for Excellence in Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, 100101, Beijing, China
Haonan Zhang, Dongyu Li & Ping Zhu
Department of Biomolecular Sciences, The Weizmann Institute of Science, 7610001, Rehovot, Israel
Edward A. Bayer
Department of Life Sciences and the National Institute for Biotechnology in the Negev, Ben-Gurion University of the Negev, 8499000, Beer-Sheva, Israel
Edward A. Bayer

Authors

Jie Li
View author publications
You can also search for this author in PubMed Google Scholar
Haonan Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Dongyu Li
View author publications
You can also search for this author in PubMed Google Scholar
Ya-Jun Liu
View author publications
You can also search for this author in PubMed Google Scholar
Edward A. Bayer
View author publications
You can also search for this author in PubMed Google Scholar
Qiu Cui
View author publications
You can also search for this author in PubMed Google Scholar
Yingang Feng
View author publications
You can also search for this author in PubMed Google Scholar
Ping Zhu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization: Y.F.; Methodology: J.L., H.Z., D.L.; Investigation: J.L., H.Z., D.L.; Visualization: J.L., H.Z., Y.F.; Funding acquisition: Y.J.L., Q.C., Y.F., P.Z.; Supervision: Y.F., P.Z.; Writing—original draft: J.L., H.Z., Y.F.; Writing—review & editing: Y.J.L., E.A.B., Q.C., Y.F., P.Z.

Corresponding authors

Correspondence to Yingang Feng or Ping Zhu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Li, J., Zhang, H., Li, D. et al. Structure of the transcription open complex of distinct σ^I factors. Nat Commun 14, 6455 (2023). https://doi.org/10.1038/s41467-023-41796-4

Download citation

Received: 22 March 2023
Accepted: 15 September 2023
Published: 13 October 2023
DOI: https://doi.org/10.1038/s41467-023-41796-4

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.