Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2008 Apr;41(2):264-71.
doi: 10.1016/j.jbi.2007.06.008. Epub 2007 Jul 10.

Automated identification of analyzable metaphase chromosomes depicted on microscopic digital images

Affiliations

Automated identification of analyzable metaphase chromosomes depicted on microscopic digital images

Xingwei Wang et al. J Biomed Inform. 2008 Apr.

Abstract

Visual search and identification of analyzable metaphase chromosomes using optical microscopes is a very tedious and time-consuming task that is routinely performed in genetic laboratories to detect and diagnose cancers and genetic diseases. The purpose of this study is to develop and test a computerized scheme that can automatically identify chromosomes in metaphase stage and classify them into analyzable and un-analyzable groups. Two independent datasets involving 170 images are used to train and test the scheme. The scheme uses image filtering, threshold, and labeling algorithms to detect chromosomes, followed by computing a set of features for each individual chromosome as well as for each identified metaphase cell. Two machine learning classifiers including a decision tree (DT) based on the features of individual chromosomes and an artificial neural network (ANN) using the features of the metaphase cells are optimized and tested to classify between analyzable and un-analyzable cells. Using the DT based classifier the Kappa coefficients for agreement between the cytogeneticist and the scheme are 0.83 and 0.89 for the training and testing datasets, respectively. We apply an independent testing and a 2-fold cross-validation method to assess the performance of the ANN-based classifier. The area under and receiver operating characteristic (ROC) curve is 0.93 for the complete dataset. This preliminary study demonstrates the feasibility of developing a computerized scheme to automatically identify and classify metaphase chromosomes.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Digital images of two metaphase chromosome cells in which (a) is considered un-analyzable cells that will be deleted and (b) is an analyzable cell that will be selected to perform karyotyping.
Figure 2
Figure 2
A flow diagram of a computerized scheme to segment chromosomes and classify metaphase cells into analyzable and un-analyzable cells.
Figure 3
Figure 3
A five-feature based DT for recognizing analyzable and un-analyzable metaphase chromosome cells. Note: F1 – Average size of each region; F2 – Circularity of each region; F3 – Average gray value of each region; F4 – Radial length of each region; Th1 - Number of regions is between Nmin = 19 and Nmax = 46; Th2 – Average size of each region is between Smin = 90 and Smax = 5000; Th3 – Circularity of each region is < CT = 0.9; Th4 - Average gray value of each region is ≥ IT = 75; Th5 - Number of regions is either < Nmin = 19 or > N max = 46; Th6 - Average size of each region is either <Smin = 90 or >S max = 5000; Th7 - Circularity of each region is ≥ CT = 0.9; Th8 - Average gray value of each region is < IT = 75; Th9 - Radial length of each region is ≥ Lk (the maximum radial length – standard deviation of radial length of all labeled regions); and Th10 - Radial length of each region is < Lk.
Figure 4
Figure 4
A scatter diagram between two features of 100 training samples including 35 analyzable (“positive”) and 65 un-analyzable (“negative”) cells.
Figure 5
Figure 5
A scatter diagram between two features of 70 testing samples including 37 analyzable (“positive”) and 33 un-analyzable (“negative”) cells.
Figure 6
Figure 6
The distribution of the computed performance points for two data subsets (“training” and “test” subsets) and the complete dataset. A ROC-type performance curve was generated based on fitting of the complete dataset using ROCFIT program.

References

    1. Tjio JH, Levan A. The chromosome number in man. Hereditas. 1956;42:1–6.
    1. Nowell PC, Hungerford DA. A minute chromosome in human chronic granulocytic leukemia. Science. 1960;132:1497. - PubMed
    1. Piper J, Granum E, Rutovitz D, Ruttledge H. Automation of chromosome analysis. Signal Processing. 1980;2:203–221.
    1. Truong K, Gibaud A, Zalcman G, Guilly M. Quantitative fish determination of chromosome 3 arm imbalances in lung tumors by automated image cytometry. Med Sci Monit. 2004;10:426–432. - PubMed
    1. Shih LM, Wang TL. Apply innovative technologies to explore cancer genome. Curr Opin Oncol. 2005;17:33–38. - PubMed

Publication types

MeSH terms

LinkOut - more resources