Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Oct;172(1):3-13.
doi: 10.1016/j.jsb.2010.06.008. Epub 2010 Jun 10.

High-throughput production of human proteins for crystallization: the SGC experience

Affiliations

High-throughput production of human proteins for crystallization: the SGC experience

Pavel Savitsky et al. J Struct Biol. 2010 Oct.

Abstract

Producing purified human proteins with high yield and purity remains a considerable challenge. We describe the methods utilized in the Structural Genomics Consortium (SGC) in Oxford, resulting in successful purification of 48% of human proteins attempted; of those, the structures of approximately 40% were solved by X-ray crystallography. The main driver has been the parallel processing of multiple (typically 9-20) truncated constructs of each target; modest diversity in vectors and host systems; and standardized purification procedures. We provide method details as well as data on the properties of the constructs leading to crystallized proteins and the impact of methodological variants. These can be used to formulate guidelines for initial approaches to expression of new eukaryotic proteins.

PubMed Disclaimer

Figures

Supplementary Figure 1
Supplementary Figure 1
(A) Target diversity; (B) Crystallization class distribution of proteins solved by the SCG (Oxford).
Fig. 1
Fig. 1
Cloning pipeline. Distribution of the number of constructs generated per each target (A) or each target domain combination (B). Productive targets/domains are those for which at least one construct led to soluble, purified protein; non-productive targets/domains could not be produced in substantial amounts from any of the constructs generated. (C) Distribution of cloned fragments generated per target/domain (same as panel B, but fragments cloned in different vectors are counted only once).
Fig. 2
Fig. 2
Schematic domain organization and construct design. Fes tyrosine-protein kinase: the black line represents the full-length gene. The three PFAM-annotated domains (FCH, SH2, and P-kinase) are depicted in boxes; the blue and red lines represent the gene fragments that were all cloned and tested for expression and crystallization. Note that the blue and red lines represent distinct constructs. HSD11B1 (11-beta-hydroxysteroid dehydrogenase 1): a demonstration of different definitions of structured domains.
Fig. 3
Fig. 3
Fraction of soluble constructs per target/domain. For each productive target/domain (as defined in Fig. 1), the fraction of constructs that tested positive for soluble expression was scored. The proportion of domains with different fractions of soluble constructs is displayed in histogram form.
Fig. 4
Fig. 4
Ends of successful constructs relative to the predicted and observed structured regions. The constructs leading to 282 unique structures were compared with structured regions predicted by PSIPRED (A), the observed structured region in the crystal structure (B), and the annotated PFAM domains (C). The length of extensions in the N-termini (white) and C-termini (full) are depicted as histograms. Note the different scale in panel C.
Fig. 5
Fig. 5
Presence of tags on crystallized proteins. Size distribution of proteins that were crystallized with the tag (open) or following tag cleavage (full).
Fig. 6
Fig. 6
Multiconstruct testing in baculovirus. Ten constructs of one gene (A) and five constructs of a second gene (B) were cloned in baculovirus and expression was tested following infection of 3-ml suspension cultures of SF9 cells (Shrestha et al., 2008). The cells were lysed after 48 h, the lysates were clarified by centrifugation and the recombinant His6-tagged proteins were purified by Ni-affinity. Aliquots of the uncentrifuged total lysate (T) and the NiNTA-eluate (E) were analyzed by SDS–PAGE. The positions of the recombinant proteins are marked with (*).

Similar articles

Cited by

References

    1. Abbott A. Protein structures hint at the shape of things to come. Nature. 2005;435:547. - PubMed
    1. Aslanidis C., de Jong P.J. Ligation-independent cloning of PCR products (LIC-PCR) Nucleic Acids Res. 1990;18:6069–6074. - PMC - PubMed
    1. Banci L., Bertini I., Cusack S., de Jong R.N., Heinemann U., Jones E.Y., Kozielski F., Maskos K., Messerschmidt A., Owens R., Perrakis A., Poterszman A., Schneider G., Siebold C., Silman I., Sixma T., Stewart-Jones G., Sussman J.L., Thierry J.C., Moras D. First steps towards effective methods in exploiting high-throughput technologies for the determination of human protein structures of high biomedical value. Acta Crystallogr. D Biol. Crystallogr. 2006;62:1208–1217. - PubMed
    1. Berrow N.S., Bussow K., Coutard B., Diprose J., Ekberg M., Folkers G.E., Levy N., Lieu V., Owens R.J., Peleg Y., Pinaglia C., Quevillon-Cheruel S., Salim L., Scheich C., Vincentelli R., Busso D. Recombinant protein expression and solubility screening in Escherichia coli: a comparative study. Acta Crystallogr. D Biol. Crystallogr. 2006;62:1218–1226. - PubMed
    1. Burgess-Brown N.A., Sharma S., Sobott F., Loenarz C., Oppermann U., Gileadi O. Codon optimization can improve expression of human genes in Escherichia coli: a multi-gene study. Protein Expr. Purif. 2008;59:94–102. - PubMed

Publication types