Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2017:2017:1032432.
doi: 10.1155/2017/1032432. Epub 2017 Jan 15.

Structural Study of Heterogeneous Biological Samples by Cryoelectron Microscopy and Image Processing

Affiliations
Review

Structural Study of Heterogeneous Biological Samples by Cryoelectron Microscopy and Image Processing

H E White et al. Biomed Res Int. 2017.

Abstract

In living organisms, biological macromolecules are intrinsically flexible and naturally exist in multiple conformations. Modern electron microscopy, especially at liquid nitrogen temperatures (cryo-EM), is able to visualise biocomplexes in nearly native conditions and in multiple conformational states. The advances made during the last decade in electronic technology and software development have led to the revelation of structural variations in complexes and also improved the resolution of EM structures. Nowadays, structural studies based on single particle analysis (SPA) suggests several approaches for the separation of different conformational states and therefore disclosure of the mechanisms for functioning of complexes. The task of resolving different states requires the examination of large datasets, sophisticated programs, and significant computing power. Some methods are based on analysis of two-dimensional images, while others are based on three-dimensional studies. In this review, we describe the basic principles implemented in the various techniques that are currently used in the analysis of structural conformations and provide some examples of successful applications of these methods in structural studies of biologically significant complexes.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Figure 1
Figure 1
Overall diagram of the work flow of structural analysis by cryo-EM.
Figure 2
Figure 2
Multivariate Statistical Analysis. (a) Left: ten images, each consisting of 2 pixels. Right: each image is represented as a vector in 2-dimensional space according to their grey values. (b) Hierarchical Classification. The left panel shows the sequential combination of vectors according to their closeness. The initial classification of images starts by forming small classes which include images that are close to one another in multidimensional space and then the size of the group is progressively increased by merging with dimensional other surrounding smaller groups that are in close proximity to each other (see the text). Images that are too far from each other form new separate classes. In the example shown in panel (b) the process of forming two classes is represented by the blue and green ovals which have varying degrees of colour intensity. The light and dark coloured ovals correspond to the initial and final steps of classification, respectively. The right panel shows a tree of HAC. The starting point is 10 classes which correspond to the number of single images in the dataset. The cut-off point is shown by the dashed red line if 2 classes are required and this corresponds to the two classes shown in the left panel.
Figure 3
Figure 3
Eigenimages and Classification. (a) A set of raw images. (b) Four images (top) shown with a coarse pixilation similar to those in panel (a) with size K × K pixels. Images form a matrix where a single image is presented as a single row in it (bottom). Each pixel in row 1 of image 1 is laid out in the first row of the matrix. The second row of image 1 follows on after row 1 in the first row of the matrix. This continues until all K rows have been laid out in the first row of the matrix. The rows of image 2 are laid out in a similar manner in row 2 of the matrix and the process continues until all N images in the dataset have been placed into the matrix. (c) Eight eigenimages obtained from the set of aligned images in (a). (d) Classification of the dataset into 5 classes. (e) Classification of the dataset into 10 classes. (f) Raw unaligned rotated images. (g) Eigenimages from the unaligned dataset.
Figure 4
Figure 4
Eigenimages-Symmetry. (a) A model dataset with 4-fold symmetry. (b) Eigenimages from that dataset. Eigenvectors 2 and 3 have clear 4-fold symmetry and are rotated with respect to one another by 22.5° degrees. (c) Tetrameric α-latrotoxin raw images (top row), class averages (middle row), and eigenimages (bottom row). The eigenimages showing 4-fold and pseudo 8-fold symmetry are shown with numbers. (d) Class averages of top views from the connector of bacteriophage SPP1 are shown in the top panel. Classes where the symmetry is visible are highlighted with yellow circles. The eigenimages are in the bottom panel. Eigenimage 1 represents the total sum of the data and the 12-fold symmetry is revealed in eigenimages 2 and 3.
Figure 5
Figure 5
Eigenimages-Size Variation. (a) Eigenimages of Hsp26 are shown in the left panel. Eigenimage 1 represents the total sum of the dataset. Eigenimage 2 shows the continuous outer circle which indicates the characteristic size difference range within the dataset. The right panel shows the entire dataset separated into four classes via MSA by only using these first four eigenimages. The big class is highlighted with a white circle around its perimeter, the small class is highlighted with a dashed white circle, and the remaining two classes represent a mixture of large and small Hsp26 images. (b) Eigenimages of BSMV. The size difference is shown in images 11 and 12 (adapted from [18]). (c) A representative micrograph showing the heterogeneity of the SPP1 bacteriophage procapsids where different sizes are clearly seen [19]. (d) The classes of the procapsid images are labelled according to their size, big (B, in blue) and small (S, in yellow).
Figure 6
Figure 6
Eigenimages-Substrate Binding. (a) GroEL bound to the substrate rhodanese with the raw images (top) and eigenimages (bottom). Eigenimage 5, highlighted with a yellow box indicates heterogeneity in the trans-ring which is related to the binding of rhodanese (adapted from [5]). (b) Three of the 12 orientation classes (column 1) from GroEL-rhodanese complex after MSA based on the eigenimages, the first six of which are shown in (a). The eigenimages of these classes are shown in columns 2–5 and the heterogeneity in the trans-ring is highlighted with a yellow box (from [5]).
Figure 7
Figure 7
ML procedure in the analysis of conformational changes of biocomplexes. Raw images are firstly assigned initial orientation angles using the initial model. That is typically done by projection matching. Then the ML approach is used to obtain 6 to 8 reconstructions. Each 3D model is visually examined in the area of interest; for a ligand presence, in this case the bound tRNA is highlighted in red. Images which were used to obtain the models with tRNA are extracted and subjected to the next round of classification. The following step involves extracting images corresponding to one or another conformation and then followed by refinement. The percentages below the structures in the top row indicate fractions of images from the entire dataset used to calculate these models, while in the second row the percentages are taken from the number of images supposedly containing the bound tRNA.
Figure 8
Figure 8
K-Means Clustering. (a) Two initial seeds are randomly placed within the data. (b) Step  2 indicates positions of the averages of images that are nearest to the seeds. (c) The averages are then recalculated based on the assignments in step  2. Steps  2 and 3 are reiterated; (d) shows the final classes.
Figure 9
Figure 9
Bootstrapping. A representative set of chickens with different tails and head positions. During step one each of L subsets of M images was picked to make L reconstructions. During step  2 the variance within L reconstructions determines the most significant differences in the head (green) and tail (red) positions. The result of the classification of images shown in step  3 is done by analysing the level of variance in areas defined in step  2 (highlighted by red and blue circles). The two reconstructions generated are then used as the input to carry out the refinement using the focused classification (step  4).

References

    1. Wikoff W. R., Conway J. F., Tang J., et al. Time-resolved molecular dynamics of bacteriophage HK97 capsid maturation interpreted by electron cryo-microscopy and X-ray crystallography. Journal of Structural Biology. 2006;153(3):300–306. doi: 10.1016/j.jsb.2005.11.009. - DOI - PubMed
    1. Blanchard S. C., Kim H. D., Gonzalez R. L., Jr., Puglisi J. D., Chu S. tRNA dynamics on the ribosome during translation. Proceedings of the National Academy of Sciences of the United States of America. 2004;101(35):12893–12898. doi: 10.1073/pnas.0403884101. - DOI - PMC - PubMed
    1. Cornish P. V., Ermolenko D. N., Noller H. F., Ha T. Spontaneous intersubunit rotation in single ribosomes. Molecular Cell. 2008;30(5):578–588. doi: 10.1016/j.molcel.2008.05.004. - DOI - PMC - PubMed
    1. Marino J., von Heijne G., Beckmann R. Small protein domains fold inside the ribosome exit tunnel. FEBS Letters. 2016;590(5):655–660. doi: 10.1002/1873-3468.12098. - DOI - PubMed
    1. Elad N., Clare D. K., Saibil H. R., Orlova E. V. Detection and separation of heterogeneity in molecular complexes by statistical analysis of their two-dimensional projections. Journal of Structural Biology. 2008;162(1):108–120. doi: 10.1016/j.jsb.2007.11.007. - DOI - PubMed

LinkOut - more resources