Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 May 2;15(1):3727.
doi: 10.1038/s41467-024-47519-7.

Synthetic intrinsically disordered protein fusion tags that enhance protein solubility

Affiliations

Synthetic intrinsically disordered protein fusion tags that enhance protein solubility

Nicholas C Tang et al. Nat Commun. .

Abstract

We report the de novo design of small (<20 kDa) and highly soluble synthetic intrinsically disordered proteins (SynIDPs) that confer solubility to a fusion partner with minimal effect on the activity of the fused protein. To identify highly soluble SynIDPs, we create a pooled gene-library utilizing a one-pot gene synthesis technology to create a large library of repetitive genes that encode SynIDPs. We identify three small (<20 kDa) and highly soluble SynIDPs from this gene library that lack secondary structure and have high solvation. Recombinant fusion of these SynIDPs to three known inclusion body forming proteins rescue their soluble expression and do not impede the activity of the fusion partner, thereby eliminating the need for removal of the SynIDP tag. These findings highlight the utility of SynIDPs as solubility tags, as they promote the soluble expression of proteins in E. coli and are small, unstructured proteins that minimally interfere with the biological activity of the fused protein.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1. The steps for identification of highly soluble SynIDPs.
A Generation of SynIDP gene library. A pool of 1020 ssDNA 72-nt long oligonucleotides was circularized by CircLigase and amplified using RCA. Substitution of dCTP with 10% 5-methyl-dCTP randomly incorporates 5mC into the RCA product, resulting in various DNA size products upon digestion by SexAI (see Supplementary Fig. 1). The products are separated by electrophoresis. Fragments of the desired size (360–576 bp) are then gel purified and ligated into a plasmid. B Restriction digestion of RCA products with 0% 5mCTP (left) vs. 10% 5mCTP (right) using the restriction enzyme SexAI shows that digests occur in multiples of 72 bp when 5mC is incorporated. Desired size (360–576 bp, marked with arrow) DNA are then gel purified (C). Illumina Miseq analysis of cloned library of plasmid DNA verifies the presence of 865 unique motifs (See Supplementary Fig. 2 for enlarged version). Source data are provided as a Source Data file (D). CoFi protein expression (see Supplementary Fig. 3) allows identification of E. coli colonies that express soluble protein by an anti-His6 western blot (left panel). Colonies were identified manually on a greyscale image (middle panel) and a black and white pixel-thresholded image overlaying the green channel (right panel), was printed and aligned underneath the plate of colonies as a visual aid for colony picking. Images represent a subset of the total set of agar plates that were analyzed (see Supplementary Fig). E Dot blot of insoluble and soluble lysate fractions to determine target protein solubility. The dot blot membrane was subjected to His6-tag antibody detection. The distinct dots marked by arrows represent an example target SynIDP—[GTHGTP]24—that was determined to be soluble. Solubility was determined by the intensity of the soluble fraction relative to the intensity of the insoluble fraction in both dot blot and PAGE gels (see Supplementary Figs. 6, 7).
Fig. 2
Fig. 2. Characterization of the physical and molecular properties of SynIDPs demonstrate that they are unstructured proteins.
A SDS-PAGE of purified SynIDPs. All SynIDPs run slightly higher than their expected molecular weight, which is characteristic of many IDPs. B CD spectra of the SynIDPs shows a random coil structure, which is deduced from the characteristic negative peak at 197 nm and a positive peak at 215 nm. C Kratky plots [I(q)·q2 as a function of q] of the SynIDP reveal a characteristic of flexible disordered chain for SynIDP-1 and SynIDP-2 while demonstrating extended chain conformation for SynIDP-3. D Scattering profiles [I(q) vs. q] of SynIDPs in a log-log plot. Black lines represent Fit of the PEV model to the SAXS data of SynIDP-1 and SynIDP-2. Curves were offset along Y-axis for better visibility. Source data are provided as a Source Data files.
Fig. 3
Fig. 3. Fusion to SynIDPs rescues soluble and functional expression of mTdT.
A Western blot of insoluble and soluble fractions of mTdT and SynIDP-1/2/3-mTdT using anti-TdT antibody. B Purified SynIDP-1/2/3-mTdT after IMAC and SEC visualized on SDS-PAGE. C mTdT activity assay showing elongation of Cy5-poly-T50 initiator on TBE-Urea PAGE gels. Low Range ssRNA ladder was used to quantify nucleotide addition. From left to right; ladder, initiator (negative control), Promega-TdT (positive control), SynIDP-1-mTdT 1X and 2X, SynIDP-2-mTdT 1X and 2X, SynIDP-3-mTdT 1X and 2X. D Insoluble (Ins) and soluble (S) fractions of mTdT, SynIDP-1-mTdT, SUMO-mTdT and MBP-mTdT visualized on SDS-PAGE. Large increase in soluble expression of mTdT are observed when fused to SynIDP-1 compared to fusion with SUMO or MBP. Arrows indicate the desired protein product. E TdT activity assay showing elongation of Cy5-poly-T50 initiator on TBE-Urea PAGE gels. Low Range ssRNA ladder was used to quantify nucleotide addition. From left to right; cleaved mTdT, SynIDP-1-mTdT, SUMO-mTdT, MBP-mTdT, Ladder. Image was processed from Supplementary Fig. 14D (see experimental). F ImageJ analysis of fluorescence intensity as measurement for dispersity and degree of polymerization of mTdT variants’ reaction products. The peak at a distance of ~100 pixels stems from the dye front and is not a real product. SynIDP-1-mTdT outperforms the other variants by displaying lower dispersity and a higher degree of polymerization. The calibration of nt size to the distance on the gel (shown in the line above) was obtained by similar image analysis of the ladder. Source data are provided as a Source Data file.
Fig. 4
Fig. 4. Fusion to SynIDPs rescues soluble and functional expression of Z2-LO10 and TEV proteins.
A Purified SynIDP-1/2/3-Z2-LO10 after IMAC and SEC. B Log-fold dilutions of SynIDP-Z2-LO10 and SynIDP controls (see Supplementary Fig. 15C) were incubated with CT-2A-EGFRviii, an EGFR positive murine glioma cell line for 48 h and tested for viability by an MTS assay. n = 3 replicates, error bars represent SD. C Purified SynIDP-1/2/3-TEV after His-purification and SEC. D MALDI-TOF MS of products of SynIDP-2-TEV reaction with ELP-tev-FGF21 substrate at 37 °C for 30 min confirms proteolytic activity. The absence of the intact substrate is evident by the lack of a peak at m/z 45,459. The cleaved FGF-21 (m/z of 20,038) and the ELP-tev (m/z of 25,439) can be seen in the MS spectra. The peak at m/z ~ 37,000 matches the expected Mw of SynIDP-2-TEV. Mass spectra for the control and products of SynIDP-1/3-TEV reaction with ELP-tev-FGF21 are shown in SI (see Supplementary Fig. 18). Source data are provided as a Source Data file.

Similar articles

Cited by

References

    1. Fahnert B, Lilie H, Neubauer P. Inclusion bodies: Formation and utilisation. Adv. Biochem. Engin/Biotechnol. 2004;89:93–142. - PubMed
    1. Baneyx F, Mujacic M. Recombinant protein folding and misfolding in Escherichia coli. Nat. Biotechnol. 2004;22:1399–1408. doi: 10.1038/nbt1029. - DOI - PubMed
    1. Costa S, Almeida A, Castro A, Domingues L. Fusion tags for protein solubility, purification and immunogenicity in Escherichia coli: the novel Fh8 system. Front. Microbiol. 2014;5:63. doi: 10.3389/fmicb.2014.00063. - DOI - PMC - PubMed
    1. Kapust RB, Waugh DS. Escherichia coli maltose-binding protein is uncommonly effective at promoting the solubility of polypeptides to which it is fused. Protein Sci. 1999;8:1668–1674. doi: 10.1110/ps.8.8.1668. - DOI - PMC - PubMed
    1. Hewitt SN, et al. Expression of proteins in Escherichia coli as fusions with maltose-binding protein to rescue non-expressed targets in a high-throughput protein-expression and purification pipeline. Acta Crystallogr. 2011;67:1006–1009. - PMC - PubMed

Publication types

MeSH terms

Substances