Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Jul 3;13(7):2728-2736.
doi: 10.1021/cg3016029.

Real-Time Protein Crystallization Image Acquisition and Classification System

Affiliations

Real-Time Protein Crystallization Image Acquisition and Classification System

Madhav Sigdel et al. Cryst Growth Des. .

Abstract

In this paper, we describe the design and implementation of a stand-alone real-time system for protein crystallization image acquisition and classification with a goal to assist crystallographers in scoring crystallization trials. In-house assembled fluorescence microscopy system is built for image acquisition. The images are classified into three categories as non-crystals, likely leads, and crystals. Image classification consists of two main steps - image feature extraction and application of classification based on multilayer perceptron (MLP) neural networks. Our feature extraction involves applying multiple thresholding techniques, identifying high intensity regions (blobs), and generating intensity and blob features to obtain a 45-dimensional feature vector per image. To reduce the risk of missing crystals, we introduce a max-class ensemble classifier which applies multiple classifiers and chooses the highest score (or class). We performed our experiments on 2250 images consisting 67% non-crystal, 18% likely leads, and 15% clear crystal images and tested our results using 10-fold cross validation. Our results demonstrate that the method is very efficient (< 3 seconds to process and classify an image) and has comparatively high accuracy. Our system only misses 1.2% of the crystals (classified as non-crystals) most likely due to low illumination or out of focus image capture and has an overall accuracy of 88%.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Layout of assembled microscopy system
Figure 2
Figure 2
Basic flowchart of protein image acquisition process
Figure 3
Figure 3
Sample images under non-crystal category
Figure 4
Figure 4
Sample images under likely leads category
Figure 5
Figure 5
Sample images under crystals category
Figure 6
Figure 6
Flowchart of the integrated algorithm for feature extraction
Figure 7
Figure 7
a, c, and e: median-filtered images; b, d, and f: the Otsu thresholded images for a, c, and e, respectively
Figure 8
Figure 8
Results of showing the application of the 3 thresholding techniques for a sample image. a) Original b) Otsu’s threshold c) 90th per threshold d) Max green threshold
Figure 9
Figure 9
Separating background and foreground region. a) M (Image after noise removal) b) Binary image c) Background pixels d) Foreground pixels
Figure 10
Figure 10
a) M (image after noise removal) b) Binary image, B c) O:Objects (regions) using connected component labeling d) Ω: the skeletonization of object
Figure 11
Figure 11
a) Boundary uniformity b) Measure of symmetry
Figure 12
Figure 12
Precision-recall plot (NC: non-crystal; LL: likely leads; C: crystal, O: Overall) a) Using MLP classifier with all features b) Using max-class ensemble method
Figure 13
Figure 13
Crystals classified as non-crystal using max-class ensemble classifier

References

    1. Bern M, et al. Automatic classification of protein crystallization images using a curve-tracking algorithm. J Appl Cryst. 2004;37:279–287.
    1. Berry IM, et al. SPINE high-throughput crystallization, crystal imaging and recognition techniques: current state, performance analysis, new technologies and future aspects. Acta Cryst. 2006;D62:1137–1149. - PubMed
    1. Cumbaa CA, et al. Automatic classification of sub-microlitre protein-crystallization trials in 1536-well plates. Acta Cryst. 2003;D59:1619–1627. - PubMed
    1. Cumbaa CA, Jurisica I. Automatic classification and pattern discovery in high-throughput protein crystallization trials. J Struct Funct Gen. 2005;6:195–202. - PubMed
    1. Cumbaa CA, Jurisica I. Protein Crystallization Analysis on the World Community Grid. J Struct Funct Genomics. 2010;11:61–69. - PMC - PubMed