Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Feb 23;10(1):104.
doi: 10.1038/s41597-023-02003-7.

An Open Dataset of Annotated Metaphase Cell Images for Chromosome Identification

Affiliations

An Open Dataset of Annotated Metaphase Cell Images for Chromosome Identification

Jenn-Jhy Tseng et al. Sci Data. .

Abstract

Chromosomes are a principal target of clinical cytogenetic studies. While chromosomal analysis is an integral part of prenatal care, the conventional manual identification of chromosomes in images is time-consuming and costly. This study developed a chromosome detector that uses deep learning and that achieved an accuracy of 98.88% in chromosomal identification. Specifically, we compiled and made available a large and publicly accessible database containing chromosome images and annotations for training chromosome detectors. The database contains five thousand 24 chromosome class annotations and 2,000 single chromosome annotations. This database also contains examples of chromosome variations. Our database provides a reference for researchers in this field and may help expedite the development of clinical applications.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Example of a raw chromosome image with three annotated datasets. (a) Original chromosome image taken from fetal amniotic fluid; (b) annotation of single chromosomes; (c) annotations of 24 chromosome categories.
Fig. 2
Fig. 2
Karyotype (46, XY) produced by an expert processing from the original chromosome map.
Fig. 3
Fig. 3
Examples of difficult image according to three definitions. (a) Multiple chromosomal overlaps; (b) suboptimal dark and light banding; (c) excessively elongated chromosomes.
Fig. 4
Fig. 4
Two images containing detected chromosomes. (a) Simple and (b) difficult images. Detection accuracy was 100% with a simple image. Multiple overlapping and adherent chromosomes make detection more difficult. Chromosomes not captured correctly were those that fell between three overlapping chromosomes.
Fig. 5
Fig. 5
Curve of number of images and model accuracy (%).

References

    1. Wapner RJ, et al. Chromosomal microarray versus karyotyping for prenatal diagnosis. New England Journal of Medicine. 2012;367:2175–2184. doi: 10.1056/NEJMoa1203382. - DOI - PMC - PubMed
    1. Carlson LM, Vora NL. Prenatal diagnosis: screening and diagnostic tools. Obstetrics and Gynecology Clinics. 2017;44:245–256. - PMC - PubMed
    1. Theisen A, Shaffer LG. Disorders caused by chromosome abnormalities. The application of clinical genetics. 2010;3:159. - PMC - PubMed
    1. Jindal, S., Gupta, G., Yadav, M., Sharma, M. & Vig, L. in Proceedings of the IEEE international conference on computer vision workshops. 72–81.
    1. Karvelis, P. S., Fotiadis, D. I., Georgiou, I. & Syrrou, M. in 2006 International Conference of the IEEE Engineering in Medicine and Biology Society. 3009–3012 (IEEE). - PubMed