Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Nov 29:7:e40308.
doi: 10.7554/eLife.40308.

Evolutionary pathways of repeat protein topology in bacterial outer membrane proteins

Affiliations

Evolutionary pathways of repeat protein topology in bacterial outer membrane proteins

Meghan Whitney Franklin et al. Elife. .

Abstract

Outer membrane proteins (OMPs) are the proteins in the surface of Gram-negative bacteria. These proteins have diverse functions but a single topology: the β-barrel. Sequence analysis has suggested that this common fold is a β-hairpin repeat protein, and that amplification of the β-hairpin has resulted in 8-26-stranded barrels. Using an integrated approach that combines sequence and structural analyses, we find events in which non-amplification diversification also increases barrel strand number. Our network-based analysis reveals strand-number-based evolutionary pathways, including one that progresses from a primordial 8-stranded barrel to 16-strands and further, to 18-strands. Among these pathways are mechanisms of strand number accretion without domain duplication, like a loop-to-hairpin transition. These mechanisms illustrate perpetuation of repeat protein topology without genetic duplication, likely induced by the hydrophobic membrane. Finally, we find that the evolutionary trace is particularly prominent in the C-terminal half of OMPs, implicating this region in the nucleation of OMP folding.

Keywords: E. coli; beta barrels; beta hairpin; evolutionary biology; molecular biophysics; outer membrane beta barrels; outer membrane proteins; repeat proteins; structural biology.

PubMed Disclaimer

Conflict of interest statement

MF, SN, RF, RK, JS No competing interests declared, NB Reviewing editor, eLife

Figures

Figure 1.
Figure 1.. The prototypical barrels at E-values ≤ 10−3 to 10−12 colored by number of strands.
Each edge represents sequence similarity between two nodes (barrels) with an E-value less than or equal to the subcaption. Multiple edges between the same pair of nodes have been removed for clarity; the edge with the lowest E-value was kept for visualization. (A) E-values ≤ 10−3, (B) E-values ≤ 10−5, (C) E-values ≤ 10−7, (D) E-values ≤ 10−12. A gif toggling through the E-values is shown in Video 1. The equivalent of Figure 1 at 25% sequence similarity is shown in Figure 1—figure supplement 1. Supplementary file 1 lists the list the PDBs of the full OMBB dataset. Figure 1—source data 1 show all the alignments.
Figure 1—figure supplement 1.
Figure 1—figure supplement 1.. Prototypical barrels at 25% sequence similarity at an E-value ≤10–3 to 10–12 colored by number of strands.
Each edge represents an interaction between two nodes (barrels) with an E-value less than or equal to the subcaption. Multiple edges between the same pair of nodes have been removed for clarity; the edge with the lowest E-value was kept for visualization.
Figure 2.
Figure 2.. Similarity is higher among prototypical barrels with the same number of strands.
Probability of barrels aligning with other prototypical barrels of the same number of strands compared to the probability of barrels aligning with prototypical barrels with different numbers of strands. Only prototypical OMBBs with <25% sequence similarity were considered. Alignments of OMBBs shown at E-values < 10−3 (left), E-values < 10−5 (center), E-values < 10−7 (right). If multiple alignments between two proteins existed, we used only the alignment with the smallest E-value. Probability was calculated as the number of alignments in each group divided by the possible combinations of barrel alignments for each group. Moreover, we find that E-values are always lower among barrels of the same strand number than between barrels of a different strand number (Figure 2—source data 1). Although barrel residues increase in size as they increase in strand number (Figure 2—figure supplement 1) such differences do not mute the relationships between barrels of different sizes.
Figure 2—figure supplement 1.
Figure 2—figure supplement 1.. Average side chain volume by the size of the barrel in strands.
Inward-facing residues face towards the barrel pore, while the outward-facing residues face the membrane. R2 values for the line of best fit are shown.
Figure 3.
Figure 3.. Internal repeats of the prototypical barrels.
(A) Distribution of all prototypical barrels with an internal repeat. The blue bars represent the total counts, while the red bars represent the counts with an internal repeat. (B) Internal repeats are identified for the barrels that are circled. Red circles indicate a single hairpin shift; black indicates a double hairpin shift (See text for definition). (C,D) Strands are shown going right to left, N to C, following the orientation in the membrane, with up indicating extracellular and down indicating intracellular. Dark grey strands indicate that all barrels of that size share those strands in the repeat; light grey strands indicate that only some of the barrels have those strands involved in the internal repeat. White strands indicate that these strands are never observed to participate in an internal repeat. The top row of numbers represents the strand number and the bottom row of numbers represents the strand numbers that align with the top numbers in the internal repeat. (C) 8-stranded barrels, (D) 16-stranded barrels. Figure 3—figure supplement 1 shows the breakdown of Figure 3A by type of repeat as well as the repeat patterns for 12- and 14-stranded barrels. Figure 3—source data 1 show all internal alignments.
Figure 3—figure supplement 1.
Figure 3—figure supplement 1.. Additional details regarding internal repeats.
Strands are shown going right to left, N to C, following the orientation in the membrane, with up indicating extracellular and down indicating intracellular. Dark grey strands indicate that all barrels of that size share those strands in the repeat; light grey strands indicate that only some of the barrels have those strands involved in the internal repeat. White strands indicate that these strands are never observed to participate in an internal repeat. The top row of numbers represents the strand number and the bottom row of numbers represents the strand numbers that align with the top numbers in the internal repeat. (A) Distribution by barrel size of the types of internal repeats. Compare with Figure 3A. (B) Distribution of internal repeat type identified in the prototypical barrels. (C and D) Diagram of the internal repeat for the 12- and 14- stranded barrels.
Figure 4.
Figure 4.. Dominant mode of persistence of the eight strands in strand diversification from 8 to 22 strands at an E-value ≤10−4.
8-stranded proteins are found to align to proteins with 10, 12, 14 and 16 strands. The strands that initially align from the 8-stranded barrels in the 16-stranded barrels can be aligned from 16-stranded barrels to 18-stranded barrels. Moreover, the strands that align between the 8- and 14-stranded barrels align between the 14- and 22-stranded barrels. However, 8-stranded barrels themselves do not directly align with the 18- or 22- stranded barrels. Arrows indicate alignment between proteins of different strand numbers. Pink strands align with pink strands, red strands align with red strands, etc. An asterisk indicates multiple patterns are observed as described in the text though only the most prevalent one is shown. White strands are strands that are not found to align to the original eight strands between barrels of different sizes, although they may be part of the alignment between proteins of different strand numbers. Examples of sequence alignments between barrels of different sizes and the lengths of the alignments are in supplement Figure 4—figure supplement 1, Figure 4—source data 1 respectively.
Figure 4—figure supplement 1.
Figure 4—figure supplement 1.. Number of strands involved in alignments.
Number of strands involved in the alignments between barrels of different sizes at an E-value ≤10–5 at 85% sequence identity. The number of strands of the smaller barrel is counted.
Figure 5.
Figure 5.. The unusual alignments of the 16- to 18-stranded barrels.
(A) The subnetwork of unusual alignments with E-values shown above the arrow. 16-stranded barrels are shown in cyan; 18-stranded barrels are shown in blue. Full alignments with E-values are shown in Figure 5—source data 1. In (B) and (C) loops are colored to reflect the strands that they were or have become. Strand boundaries are determined by a combination of ϕ and φ angles and hydrogen bonding patterns as described in the materials and methods section. A representative alignment is shown below each diagram. Only strands are colored in the alignments to match the colors in the diagrams. (B) Loop to hairpin. A large loop in the left barrel (yellow/orange) becomes two strands in the right barrel. (C) Large rearrangement. In this rearrangement, the loop between a and b in the 16-stranded barrel becomes two strands in the 18-stranded barrel and strand b of the 16-stranded barrel becomes a loop in the 18-stranded barrel while the loop between c and d becomes a strand. Some barrels have alternative structures in which some amino acids in the extracellular region of the barrel may be defined as loop where in other structures they are defined as strand or vice versa. Only one of the 1653 alignments in the 10–3 E-value network changes strand number alignment based on alternative structure. That alignment is shown in Figure 5—figure supplement 1 and does not change any data shown in Figure 5.
Figure 5—figure supplement 1.
Figure 5—figure supplement 1.. Changed alignment using alternative structures.
The only alignment with a different number of strands involved using the alternative structures, is the alignment between PDB IDs 2wjr and 2 × 9 k. While the number of strands are identical, in the alternative alignment using the strand definitions of PDB ID 5mwv, it is only aligned with gap residues and therefore the alignment is considered to be one strand shorter.
Figure 6.
Figure 6.. Conservation of strands in the prototypical barrels at <85% sequence identity.
For each alignment involving two prototypical barrels, we determined the identity of the strands in each barrel. Each graph represents the distribution of the strands reused in that barrel size. Figure 6—figure supplement 1 shows a similar trend at 25% sequence identity. The length distribution of these alignment (at 85% sequence) is shown in Figure 6—figure supplement 2.
Figure 6—figure supplement 1.
Figure 6—figure supplement 1.. Identity of stands in alignments at 25% sequence identity.
Identity of strands involved in alignments at 25% sequence identity. For each alignment involving two prototypical barrels, we determined the identity of the strands in each barrel. Each graph represents the distribution of the strands reused in that barrel size.
Figure 6—figure supplement 2.
Figure 6—figure supplement 2.. Number of strands in alignments.
Length of the strands involved in the alignments between barrels at 85% sequence identity and an E-value ≤10–3.
Figure 7.
Figure 7.. Possible mechanisms of OMBB folding.
The C-terminal half of beta barrels evince a strong evolutionary trace. The strong evolutionary trace may be indicative of nucleation. (A) Nucleation on side of barrel: Such nucleation could occur with the second half of the nascent barrel (red squiggly lines) forming beta hairpins in the membrane (boundaries shown with gray planes) along strands of the outer membrane translocation machinery, BamA with POTRA domains (solid blue barrel with five attached spheres). The rest of the nascent barrel would use the formed strands as a template. (B) Nucleation in incorporation of barrel: the strong evolutionary trace may also be indicative of beta strand incorporation, whereby the second half of the beta barrel is incorporated into BamA. In this model, the conserved strands form new strands in the BamA barrel while the N-terminal half of the protein inserts through the center of the barrel. Ultimately the nascent barrel buds off of the BamA, creating a new OMBB.

Comment in

  • doi: 10.7554/eLife.44076

Similar articles

Cited by

References

    1. Arnold T, Poynor M, Nussberger S, Lupas AN, Linke D. Gene duplication of the eight-stranded beta-barrel OmpX produces a functional pore: a scenario for the evolution of transmembrane beta-barrels. Journal of Molecular Biology. 2007;366:1174–1184. doi: 10.1016/j.jmb.2006.12.029. - DOI - PubMed
    1. Dong C, Beis K, Nesper J, Brunkan-Lamontagne AL, Clarke BR, Whitfield C, Naismith JH. Wza the translocon for E. coli capsular polysaccharides defines a new class of membrane protein. Nature. 2006;444:226–229. doi: 10.1038/nature05267. - DOI - PMC - PubMed
    1. Drake JW, Charlesworth B, Charlesworth D, Crow JF. Rates of spontaneous mutation. Genetics. 1998;148:1667. - PMC - PubMed
    1. Franklin MW, Nepomnyachiy S, Feehan R, Ben-Tal N, Kolodny R, Slusky JSG. Efflux Pumps Represent Possible Evolutionary Convergence onto the β-Barrel Fold. Structure. 2018a;26:1266–1274. doi: 10.1016/j.str.2018.06.007. - DOI - PMC - PubMed
    1. Franklin MW, Slusky JSG. Tight Turns of Outer Membrane Proteins: An Analysis of Sequence, Structure, and Hydrogen Bonding. Journal of Molecular Biology. 2018b;430:3251–3265. doi: 10.1016/j.jmb.2018.06.013. - DOI - PMC - PubMed

Publication types

MeSH terms

Substances