Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Jul 3;21(7):654.
doi: 10.3390/e21070654.

Sequence Versus Composition: What Prescribes IDP Biophysical Properties?

Affiliations

Sequence Versus Composition: What Prescribes IDP Biophysical Properties?

Jiří Vymětal et al. Entropy (Basel). .

Abstract

Intrinsically disordered proteins (IDPs) represent a distinct class of proteins and are distinguished from globular proteins by conformational plasticity, high evolvability and a broad functional repertoire. Some of their properties are reminiscent of early proteins, but their abundance in eukaryotes, functional properties and compositional bias suggest that IDPs appeared at later evolutionary stages. The spectrum of IDP properties and their determinants are still not well defined. This study compares rudimentary physicochemical properties of IDPs and globular proteins using bioinformatic analysis on the level of their native sequences and random sequence permutations, addressing the contributions of composition versus sequence as determinants of the properties. IDPs have, on average, lower predicted secondary structure contents and aggregation propensities and biased amino acid compositions. However, our study shows that IDPs exhibit a broad range of these properties. Induced fold IDPs exhibit very similar compositions and secondary structure/aggregation propensities to globular proteins, and can be distinguished from unfoldable IDPs based on analysis of these sequence properties. While amino acid composition seems to be a major determinant of aggregation and secondary structure propensities, sequence randomization does not result in dramatic changes to these properties, but for both IDPs and globular proteins seems to fine-tune the tradeoff between folding and aggregation.

Keywords: IDP; IDR; aggregation propensity; secondary structure prediction; sequence randomization.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Scatter plot showing predicted secondary structure content (x-axis) and aggregation propensity (y-axis) of the PDB (blue) and DisProt (orange) datasets. The Disprot dataset is further annotated for classified examples of induced fold (by squares) and unfoldable (by circles) intrinsically disordered proteins (IDPs) in (A), which belong also to the most disordered Disprot predicted proteins by MobiDB-lite score in (B).
Figure 2
Figure 2
Compositional distribution of the PDB and DisProt datasets. Projections to the principal components (PC1 and PC2) are presented in (A), the Disprot dataset has the induced fold (squares) and unfoldable (circles) representatives highlighted. Contributions of individual amino acids to PC1 and PC2 are visualized in (B).
Figure 3
Figure 3
Analysis of properties of the DisProt and PDB datasets, showing the (A) mean values and (B) z-score of aggregation and secondary structure propensity after 1000 sequence permutations, compared to the original sequence.

Similar articles

Cited by

References

    1. Babu M.M. The contribution of intrinsically disordered regions to protein function, cellular complexity, and human disease. Biochem. Soc. Trans. 2016;44:1185–1200. doi: 10.1042/BST20160172. - DOI - PMC - PubMed
    1. Van Der Lee R., Buljan M., Lang B., Weatheritt R.J., Daughdrill G.W., Dunker A.K., Fuxreiter M., Gough J., Gsponer J., Jones D.T., et al. Classification of intrinsically disordered regions and proteins. Chem. Rev. 2014;114:6589–6631. doi: 10.1021/cr400525m. - DOI - PMC - PubMed
    1. Theillet F.X., Kalmar L., Tompa P., Han K.H., Selenko P., Dunker A.K., Daughdrill G.W., Uversky V.N. The alphabet of intrinsic disorder: I. Act like a Pro: On the abundance and roles of proline residues in intrinsically disordered proteins. Intrinsically Disord. Proteins. 2013;1:e24360. doi: 10.4161/idp.24360. - DOI - PMC - PubMed
    1. Romero P., Obradovic Z., Li X., Garner E.C., Brown C.J., Dunker A.K. Sequence complexity of disordered protein. Proteins Struct. Funct. Bioinform. 2001;42:38–48. doi: 10.1002/1097-0134(20010101)42:1<38::AID-PROT50>3.0.CO;2-3. - DOI - PubMed
    1. Uversky V.N. Paradoxes and wonders of intrinsic disorder: Complexity of simplicity. Intrinsically Disord. Proteins. 2016;4:e1135015. doi: 10.1080/21690707.2015.1135015. - DOI - PMC - PubMed

LinkOut - more resources