Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007 Dec;8(4):153-66.
doi: 10.1007/s10969-007-9032-5. Epub 2007 Nov 6.

Small-scale, semi-automated purification of eukaryotic proteins for structure determination

Affiliations

Small-scale, semi-automated purification of eukaryotic proteins for structure determination

Ronnie O Frederick et al. J Struct Funct Genomics. 2007 Dec.

Abstract

A simple approach that allows cost-effective automated purification of recombinant proteins in levels sufficient for functional characterization or structural studies is described. Studies with four human stem cell proteins, an engineered version of green fluorescent protein, and other proteins are included. The method combines an expression vector (pVP62K) that provides in vivo cleavage of an initial fusion protein, a factorial designed auto-induction medium that improves the performance of small-scale production, and rapid, automated metal affinity purification of His8-tagged proteins. For initial small-scale production screening, single colony transformants were grown overnight in 0.4 ml of auto-induction medium, produced proteins were purified using the Promega Maxwell 16, and purification results were analyzed by Caliper LC90 capillary electrophoresis. The yield of purified [U-15N]-His8-Tcl-1 was 7.5 microg/ml of culture medium, of purified [U-15N]-His8-GFP was 68 microg/ml, and of purified selenomethione-labeled AIA-GFP (His8 removed by treatment with TEV protease) was 172 microg/ml. The yield information obtained from a successful automated purification from 0.4 ml was used to inform the decision to scale-up for a second meso-scale (10-50 ml) cell growth and automated purification. 1H-15N NMR HSQC spectra of His8-Tcl-1 and of His8-GFP prepared from 50 ml cultures showed excellent chemical shift dispersion, consistent with well folded states in solution suitable for structure determination. Moreover, AIA-GFP obtained by proteolytic removal of the His8 tag was subjected to crystallization screening, and yielded crystals under several conditions. Single crystals were subsequently produced and optimized by the hanging drop method. The structure was solved by molecular replacement at a resolution of 1.7 A. This approach provides an efficient way to carry out several key target screening steps that are essential for successful operation of proteomics pipelines with eukaryotic proteins: examination of total expression, determination of proteolysis of fusion tags, quantification of the yield of purified protein, and suitability for structure determination.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Expression vector pVP62K. (a) Linear map showing key features of the vector and location of the Bar-CAT toxic cassette and 3′ homology region (3′-hmr) for Flexi Vector cloning. (b) Nucleotide and encoded protein sequence in the linker region near to the SgfI cloning site. The TVMV protease site is ETVRFQS, where proteolysis occurs between the Q and S residues. The fusion protein may be cleaved in the expression host due to the presence of a low level of TVMV protease produced by constitutive expression from pVP62K. The TEV protease site is ENLYFQA, where proteolysis occurs between the Q and A residues. After purification of the His8-tagged protein, the His8 tag can be removed by treatment with TEV protease to release an N-terminal AIA-target
Fig. 2
Fig. 2
Conditional methionine auxotrophy in E. coli B834. (a) Genome organization near to the metE gene in E. coli K12 [51]. (b) Genome organization near to the metE gene in E. coli B834. In this organism, DNA sequencing revealed a large insert in the metE gene, which caused the protein to be truncated to 56 amino acids (aa), non-functional peptide
Fig. 3
Fig. 3
Caliper LC90 analysis of His8-tagged proteins purified by Maxwell 16. Lanes LA and LB are molecular weight markers. Lanes A1–B2 are structural genomics target proteins (protein bands marked with ovals) with molecular weight ∼50–75 kDa. They were expressed in factorial evolved auto-induction medium containing selenomethionine [33] as an N-terminal fusion with MBP from pVP56K, a vector that does not give in vivo proteolysis of the fusion protein. Lane B3 contains His8-MBP (protein band marked with oval), while lane B4 (1.1 mg/ml) contains His8-GFP expressed from pVP62 after in vivo cleavage from MBP. Lanes with a purified expressed fusion protein with yield than 100 μg/ml are marked with a star (also see Table 2 )
Fig. 4
Fig. 4
Small-scale purification screening of human embryonic stem cell proteins. Human stem cell proteins were expressed in E. coli B834 by auto-induction, liberated by in vivo proteolysis, and purified by the Maxwell 16 purification system. Table 2 provides further information on these proteins. Lane 1, molecular weight markers. Lanes 2 and 3, total cell lysate and eluted sample from purification of CCNF. No purified protein was detected. Lanes 4 and 5, C10orf96 was obtained in detectable amounts, but not sufficient for scale-up, along with two higher molecular weight contaminants. Lanes 6 and 7, His8-Tcl-1 was expressed, proteolyzed, and successfully purified. Lanes 8 and 9, NPM2 was expressed and proteolyzed, but only a small amount of protein was purified. In addition, the purified protein appeared to be partially degraded. Lanes 10 and 11, His8-GFP
Fig. 5
Fig. 5
Replicate Maxwell 16 purification of human embryonic stem T-cell lymphoma-1 protein. Lane 1, molecular weight markers. Lanes 2–12, replicate purifications of His8-Tcl-1. Lane 13, His8-MBP-At2g34690.1, an Arabidopsis thaliana protein expression control
Fig. 6
Fig. 6
1H–15N HSQC NMR spectra of Maxwell-purified proteins Tcl-1 and GFP. (a) 750 MHz spectrum of His8-Tcl-1 obtained at 35°C (1.75 mg in 250 μl of 10 mM KHPO4, pH 7, containing 50 mM KCl). The total NMR time required to obtain this spectrum was 9.5 h. (b) 600 MHz spectrum of His8-GFP obtained at 35°C (5.6 mg in 250 μl of 10 mM KHPO4, pH 7, containing 50 mM KCl). The total NMR time required to obtain this spectrum was 1 h
Fig. 7
Fig. 7
X-ray structure of AIA–GFP. The chromophore is shown as green cylinders representing bonded atoms
Fig. 8
Fig. 8
Schematic of a purification screening protocol. Steps from obtaining a sequence-verified target in auto-cleavage vector pVP62K to identification of purified proteins. The transformed expression host is grown in auto-induction medium. Cells from production trials are loaded into the Maxwell 16 instrument for automated purification, and purified proteins are detected by Caliper LC90 capillary electrophoresis. Successful purification of a protein from auto-cleavage expression with yield exceeding 50 μg/ml of culture medium indicates feasibility of scale-up efforts

Similar articles

Cited by

References

    1. {'text': '', 'ref_index': 1, 'ids': [{'type': 'DOI', 'value': '10.1006/abio.2001.5331', 'is_inner': False, 'url': 'https://doi.org/10.1006/abio.2001.5331'}, {'type': 'PubMed', 'value': '11567530', 'is_inner': True, 'url': 'https://pubmed.ncbi.nlm.nih.gov/11567530/'}]}
    2. Knaust RK, Nordlund P (2001) Screening for soluble expression of recombinant proteins in a 96-well format. Anal Biochem 297:79–85 - PubMed
    1. {'text': '', 'ref_index': 1, 'ids': [{'type': 'DOI', 'value': '10.1016/S0969-2126(00)00193-3', 'is_inner': False, 'url': 'https://doi.org/10.1016/s0969-2126(00)00193-3'}, {'type': 'PubMed', 'value': '10986469', 'is_inner': True, 'url': 'https://pubmed.ncbi.nlm.nih.gov/10986469/'}]}
    2. Stevens RC (2000) Design of high-throughput methods of protein production for structural biology. Structure 8:R177–185 - PubMed
    1. {'text': '', 'ref_index': 1, 'ids': [{'type': 'DOI', 'value': '10.1006/bbrc.2000.4229', 'is_inner': False, 'url': 'https://doi.org/10.1006/bbrc.2000.4229'}, {'type': 'PubMed', 'value': '11162598', 'is_inner': True, 'url': 'https://pubmed.ncbi.nlm.nih.gov/11162598/'}]}
    2. Kawasaki M, Inagaki F (2001) Random PCR-based screening for soluble domains using green fluorescent protein. Biochem Biophys Res Commun 280:842–844 - PubMed
    1. {'text': '', 'ref_index': 1, 'ids': [{'type': 'DOI', 'value': '10.1074/jbc.M512588200', 'is_inner': False, 'url': 'https://doi.org/10.1074/jbc.m512588200'}, {'type': 'PubMed', 'value': '16717101', 'is_inner': True, 'url': 'https://pubmed.ncbi.nlm.nih.gov/16717101/'}]}
    2. King DA, Hall BE, Iwamoto MA, Win KZ, Chang JF, Ellenberger T (2006) Domain structure and protein interactions of the silent information regulator Sir3 revealed by screening a nested deletion library of protein fragments. J Biol Chem 281:20107–20119 - PubMed
    1. {'text': '', 'ref_index': 1, 'ids': [{'type': 'DOI', 'value': '10.1110/ps.062082606', 'is_inner': False, 'url': 'https://doi.org/10.1110/ps.062082606'}, {'type': 'PMC', 'value': 'PMC2242398', 'is_inner': False, 'url': 'https://pmc.ncbi.nlm.nih.gov/articles/PMC2242398/'}, {'type': 'PubMed', 'value': '17008718', 'is_inner': True, 'url': 'https://pubmed.ncbi.nlm.nih.gov/17008718/'}]}
    2. Reich S, Puckey LH, Cheetham CL, Harris R, Ali AA, Bhattacharyya U, Maclagan K, Powell KA, Prodromou C, Pearl LH, Driscoll PC, Savva R (2006) Combinatorial domain hunting: an effective approach for the identification of soluble protein domains adaptable to high-throughput applications. Protein Sci 15:2356–2365 - PMC - PubMed

Publication types

MeSH terms