Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Nov 23;5(1):66-75.
doi: 10.1002/mgg3.262. eCollection 2017 Jan.

A comprehensive global genotype-phenotype database for rare diseases

Affiliations

A comprehensive global genotype-phenotype database for rare diseases

Daniel Trujillano et al. Mol Genet Genomic Med. .

Abstract

Background: The ability to discover genetic variants in a patient runs far ahead of the ability to interpret them. Databases with accurate descriptions of the causal relationship between the variants and the phenotype are valuable since these are critical tools in clinical genetic diagnostics. Here, we introduce a comprehensive and global genotype-phenotype database focusing on rare diseases.

Methods: This database (CentoMD ®) is a browser-based tool that enables access to a comprehensive, independently curated system utilizing stringent high-quality criteria and a quickly growing repository of genetic and human phenotype ontology (HPO)-based clinical information. Its main goals are to aid the evaluation of genetic variants, to enhance the validity of the genetic analytical workflow, to increase the quality of genetic diagnoses, and to improve evaluation of treatment options for patients with hereditary diseases. The database software correlates clinical information from consented patients and probands of different geographical backgrounds with a large dataset of genetic variants and, when available, biomarker information. An automated follow-up tool is incorporated that informs all users whenever a variant classification has changed. These unique features fully embedded in a CLIA/CAP-accredited quality management system allow appropriate data quality and enhanced patient safety.

Results: More than 100,000 genetically screened individuals are documented in the database, resulting in more than 470 million variant detections. Approximately, 57% of the clinically relevant and uncertain variants in the database are novel. Notably, 3% of the genetic variants identified and previously reported in the literature as being associated with a particular rare disease were reclassified, based on internal evidence, as clinically irrelevant.

Conclusions: The database offers a comprehensive summary of the clinical validity and causality of detected gene variants with their associated phenotypes, and is a valuable tool for identifying new disease genes through the correlation of novel genetic variants with specific, well-defined phenotypes.

Keywords: Clinical diagnostics; HPO; rare disease; variant database.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Distribution of confirmed cases and carriers by age at genetic diagnosis.
Figure 2
Figure 2
Annual evolution of screened and positive cases in CentoMD ®.
Figure 3
Figure 3
Classification of genetic variants according to their clinical significance in CentoMD ®. “Other” CRV variants include risk factors and modifiers. CRV, clinically relevant variant, CIV, clinically irrelevant variant, VUS, variant of unknown significance.
Figure 4
Figure 4
Viewing results in CentoMD database. A) GLA c.644A>G variant as displayed by the Genotype‐Phenotype module. Black arrows indicate the two search modules: Genotype to Phenotype and Phenotype to Genotype. The result table contains variant detailed information and four additional options: rationale (1), curated individuals; (2), statistics and individual view. Gray arrow: gene‐based statistics. B) Phenotype search using HPO terms: polycystic kidney disease, renal cysts, and abnormality of the liver, as displayed by Phenotype‐Genotype module. Black arrows indicate the search field using HPO terms. Gray arrow: table with similar cases. White arrow: candidate genes most likely explaining the clinical symptoms used to initiate the search. Arrow: result table of variants (CRV and VUS) identified in similar cases sharing the HPO terms used to initiate the search. By variant activation, rationale, curated individuals, statistics, and individual view are available for user.

References

    1. Abecasis, G. R. , Auton A., Brooks L. D., DePristo M. A., Durbin R. M., Handsaker R. E., et al. 2012. An integrated map of genetic variation from 1,092 human genomes. Nature 491:56–65. - PMC - PubMed
    1. Adams, D. R. , Sincan M., Fuentes Fajardo K., Mullikin J. C., Pierson T. M., Toro C., et al. 2012. Analysis of DNA sequence variants detected by high‐throughput sequencing. Hum. Mutat. 33:599–608. - PMC - PubMed
    1. Amberger, J. S. , Bocchini C. A., Schiettecatte F., Scott A. F., and Hamosh A.. 2015. OMIM.org: Online Mendelian Inheritance in Man (OMIM®), an online catalog of human genes and genetic disorders. Nucleic Acids Res. 43(Database issue):D789–D798. - PMC - PubMed
    1. Baglioni, C. 1962. The fusion of two peptide chains in hemoglobin Lepore and its interpretation as a genetic deletion. Proc. Natl Acad. Sci. USA 48:1880–1886. - PMC - PubMed
    1. Conter, C. , Rolland M. O., Cheillan D., Bonnet V., Maire I., and Froissart R.. 2006. Genetic heterogeneity of the GLDC gene in 28 unrelated patients with glycine encephalopathy. J. Inherit. Metab. Dis. 29:135–142. - PubMed