Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Mar:2014:10.1109/SECON.2014.6950744.
doi: 10.1109/SECON.2014.6950744.

Evaluation of Normalization and PCA on the Performance of Classifiers for Protein Crystallization Images

Affiliations

Evaluation of Normalization and PCA on the Performance of Classifiers for Protein Crystallization Images

İmren Dinç et al. Proc IEEE Southeastcon. 2014 Mar.

Abstract

In this paper, we investigate the performance of classification of protein crystallization images captured during protein crystal growth process. We group protein crystallization images into 3 categories: noncrystals, likely leads (conditions that may yield formation of crystals) and crystals. In this research, we only consider the subcategories of noncrystal and likely leads protein crystallization images separately. We use 5 different classifiers to solve this problem and we applied some data preprocessing methods such as principal component analysis (PCA), min-max (MM) normalization and z-score (ZS) normalization methods to our datasets in order to evaluate their effects on classifiers for the noncrystal and likely leads datasets. We performed our experiments on 1606 noncrystal and 245 likely leads images independently. We had satisfactory results for both datasets. We reached 96.8% accuracy for noncrystal dataset and 94.8% accuracy for likely leads dataset. Our target is to investigate the best classifiers with optimal preprocessing techniques on both noncrystal and likely leads datasets.

Keywords: classification; normalization; principal component analysis; protein crystallization.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Clear Drop
Fig. 2
Fig. 2
Regular Granular Precipitate
Fig. 3
Fig. 3
Granular Precipitate or Microcrystals
Fig. 4
Fig. 4
Unclear Bright Regions
Fig. 5
Fig. 5
Effects of data normalization on classifiers for noncrystal dataset
Fig. 6
Fig. 6
Effects of data normalization on classifiers for likely leads dataset

References

    1. Pan S, et al. Automated classification of protein crystallization images using support vector machines with scale-invariant texture and Gabor features. Acta Crystallographica Section D. 2006 Mar;62(3):271–279. - PubMed
    1. Rupp B, Wang J. Predictive Models for Protein Crystallization. Methods. 2004;34(3):390–407. - PubMed
    1. Cumbaa CA, Jurisica I. Protein crystallization analysis on the World Community Grid. Journal of Structural and Functional Genomics. 2010 Jan;11(1):61–69. - PMC - PubMed
    1. Sigdel M, Pusey ML, Aygun RS. Real-Time Protein Crystallization Image Acquisition and Classification System. Crystal Growth & Design. 2013;13(7):2728–2736. - PMC - PubMed
    1. Bern M, Goldberg D, Stevens RC, Kuhn P. Automatic classification of protein crystallization images using a curve-tracking algorithm. Journal of Applied Crystallography. 2007 Apr;37(2):279–-287.