Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2015 Jun:32:102-12.
doi: 10.1016/j.sbi.2015.03.008. Epub 2015 Apr 2.

Relating sequence encoded information to form and function of intrinsically disordered proteins

Affiliations
Review

Relating sequence encoded information to form and function of intrinsically disordered proteins

Rahul K Das et al. Curr Opin Struct Biol. 2015 Jun.

Abstract

Intrinsically disordered proteins (IDPs) showcase the importance of conformational plasticity and heterogeneity in protein function. We summarize recent advances that connect information encoded in IDP sequences to their conformational properties and functions. We focus on insights obtained through a combination of atomistic simulations and biophysical measurements that are synthesized into a coherent framework using polymer physics theories.

PubMed Disclaimer

Conflict of interest statement

Conflict of interest

None declared.

Figures

Figure 1
Figure 1. Definitions of polar tracts, polyelectrolytes, and polyampholytes
Polar tracts shown here include polyQ (UniProt ID: P42858): Polyglutamine tracts are found in at least ten proteins associated with human neurodegenerative disorders including Huntington’s disease; Sup35 (UniProt ID: P05453): Residues 4–23 of S.cerevisiae Sup35 corresponding to a region of the N-terminal prion domain; EcSSB (UniProt ID: P0AGE0): Residues 117–136 of E.coli single stranded DNA binding protein corresponding to a region of the C-terminal tail; Nup42 (UniProt ID: P49686): Residues 181–200 of S.cerevisiae nucleoporin Nup42 corresponding to a region of the FG domain, which modulates gating of the nuclear pore complex. Polyampholytes shown here include: Nup60 (UniProt ID: P39705): Residues 412–431 of S.cerevisiae nucleoporin Nup60 corresponding to a region of the FG domain which modulates gating of the nuclear pore complex; PfSSB (UniProt ID: Q8I415): Residues 232–251 of P.falciparum single stranded DNA binding protein corresponding to a region of the Cterminal tail; Nsp1 (UniProt ID: P14907): Residues 359–378 of S.cerevisiae nucleoporin Nsp1 corresponding to a region of the FG domain which modulates gating of the nuclear pore complex; PQBP1 (UniProt ID: O60828): Residues 146–165 of H.sapiens polyglutamine-tract binding protein 1 corresponding to a region of the expanded linker, which connects the N-terminal WW domain and the C-terminal U5 15 kDa binding region. Polyelectrolytes shown here include: PRM2 (UniProt ID: Q9EP54): Residues 2–21 of the C.griseus DNA packaging protein protamine 2, which is involved in the chromatin condensation process during spermatogenesis [6]; PDE6G (UniProt ID: P18545): Residues 63–82 of H.sapiens retinal rod rhodopsin-sensitive cGMP 3′,5′-cyclic phosphodiesterase subunit gamma protein, which is involved in processing visual signal; NP1 (UniProt ID: O13030): Residues 5–24 of C.pyrrhogaster protamine 1 which is involved in the chromatin condensation process during spermatogenesis; RAG2 (UniProt ID: P21784): Residues 392–411 of C.griseus V(D)J recombination-activating protein 2 corresponding to a region of the “acidic hinge” which modulates DNA repair mechanisms.
Figure 2
Figure 2. Summary of the typical workflow used to extract quantitative CCRs and SCRs from computer simulations, in vitro biophysical experiments, or synergy between the two modes of investigation
Figure 3
Figure 3. Summary of readily calculated compositional parameters that help in quantitative assessments of CCRs for IDP sequences
Figure 4
Figure 4. Diagram-of-states classification depicting the distinct conformational classes for IDP sequences
Statistics for different regions (percentages) are from analysis of bona fide IDPs in DISPROT [61].
Figure 5
Figure 5. Illustrations of the impact of conserved versus altered CCRs on IDP functions
Figure 6
Figure 6. Calculation of κ and using it to distinguish the sequences with different linear patterns of oppositely charged residues
The top row shows how κ is calculated. The overall charge asymmetry σ is determined by the amino acid composition (see Figure 2). Each sequence is divided into nw sliding windows and the mean squared deviation δ helps quantify the deviation of the charge asymmetry across different sequence windows vis-à-vis the charge asymmetry encoded by the amino acid composition. The value of δ is calculated for all sequence variants that correspond to the amino acid composition and this is used to evaluate the value of κ, as shown, thus ensuring that κ is bounded between 0 and 1. As an illustration of the patterning that is quantified using κ, we show the sequence of the “polar rich domain” extracted from the sequence of the polyglutamine tract binding protein PQB-P1. The bottom two rows show two de novo designed sequences designated as sv1 and sv2 for sequence variants 1 and 2. These two sequences were derived from alterations to the linear sequence distribution of oppositely charged residues [38]. On each row, the values of κ are shown to the right.
Figure 7
Figure 7. Illustrating the impact of sequence patterns and their conservation / alteration on IDP functions

References

    1. Chothia C, Gough J, Vogel C, Teichmann SA. Evolution of the Protein Repertoire. Science. 2003;300:1701–1703. - PubMed
    1. Babu MM, Kriwacki RW, Pappu RV. Versatility from Protein Disorder. Science. 2012;337:1460–1461. - PubMed
    1. Wright PE, Dyson HJ. Intrinsically disordered proteins in cellular signalling and regulation. Nature Reviews: Molecular Cell Biology. 2015;16:18–29.. ● An updated account of the importance of IDPs in cell signaling and the control of cellular decisions and fates.

    1. van der Lee R, Buljan M, Lang B, Weatheritt RJ, Daughdrill GW, Dunker AK, Fuxreiter M, Gough J, Gsponer J, Jones DT, et al. Classification of Intrinsically Disordered Regions and Proteins. Chemical Reviews. 2014;114:6589–6631.. ● A comprehensive review of informatics and physical considerations that have enabled the classification of motifs and IDPs.

    1. Uversky VN. Natively unfolded proteins: A point where biology waits for physics. Protein Science. 2002;11:739–756. - PMC - PubMed

Publication types

Substances

LinkOut - more resources