Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Sep 5;105(3):448-455.
doi: 10.1016/j.ajhg.2019.07.011.

Mendelian Gene Discovery: Fast and Furious with No End in Sight

Affiliations

Mendelian Gene Discovery: Fast and Furious with No End in Sight

Michael J Bamshad et al. Am J Hum Genet. .

Abstract

Gene discovery for Mendelian conditions (MCs) offers a direct path to understanding genome function. Approaches based on next-generation sequencing applied at scale have dramatically accelerated gene discovery and transformed genetic medicine. Finding the genetic basis of ∼6,000-13,000 MCs yet to be delineated will require both technical and computational innovation, but will rely to a larger extent on meaningful data sharing.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
Annualized Metrics of Gene Discovery for Mendelian Conditions (A) Approximate rates of reported gene discoveries for Mendelian conditions (MCs) and of delineation of MCs over time (1900–2017). Trends in reported (i.e., published) delineations of new MCs, including the so-called “Golden Age” of syndrome delineation in the 1970s, leading to a peak throughout that decade. These data also show the impact of technical and methodological advances that fueled gene discovery, namely the impact of positional cloning in 1986; the development of dense, genome-wide linkage maps in the early 1990s; and increasing knowledge, gained via the Human Genome Project (1990–2001), of the physical locations and sequence content of genes. Rates shown for gene discoveries and syndrome delineations reflect publications, not unpublished discoveries or syndrome delineations, as recorded in OMIM and extracted by text analysis. The dashed line represents the number of genes for MCs reported discovered by the Centers of Mendelian Genomics, most of which are unpublished. (B) Approximate number of gene discoveries per year for MCs made by exome sequencing (ES) and next-generation sequencing (NGS) versus conventional approaches. Following the introduction of positional cloning in 1986 and of ES in 2010, there were rapid increases in the rate of gene discovery for MCs. Each new approach made gene discovery possible for MCs that had otherwise been intractable to prior approaches, and this added to the baseline rate of gene discovery. Since 2010, NGS-based approaches (blue) have been used to make nearly all gene discoveries for MCs compared to conventional approaches (red). (C) Approximate number of gene discoveries per year for MCs by mode of inheritance. The estimated proportion of gene discoveries for MCs due to de novo variants (DNVs, red) has increased since 2010 as NGS made routine identification of such variants possible (see Supplemental Methods in Supplemental Data). However, the proportions of gene discoveries for autosomal recessive (green), dominant (orange), and X-linked (blue) MCs each continue to be equal to or greater than the number of discoveries for MCs due to de novo (red) variants. Moreover, until 2010, the vast majority of gene discoveries for MCs were for inherited conditions (∼97% before 2010; ∼89% from 2010–2016; and ∼79% in 2017), so still, most MCs known to date (∼90%–93%) are predominately due to inherited variants. MCs assessed as being attributable to DNVs were excluded from the autosomal dominant and X-linked groups and vice versa, and MCs attributed to both dominant and recessive variants are not shown. Modes of inheritance were inferred by text analysis of OMIM entries. (D) Impact of ES and NGS on the rate and method of syndrome delineation. Classical syndrome delineation (orange) is phenotype-driven and proceeds by ascertaining multiple individuals with overlapping clinical findings and then identifying of the underlying gene. In contrast, for genotype-driven syndrome delineation (teal), persons with overlapping clinical findings are identified only after discovery that they share pathogenic variants in the same candidate gene. Introduction of NGS-based approaches rapidly extinguished phenotype-driven syndrome delineation, and as of 2017, new MCs have been reported only after discovery of the underlying gene. MCs for which the gene was discovered in the same year as the first publication of data from an individual with the MC were categorized as genotype-driven; MCs for which data on the first individual with the MC was published one year or more prior to gene discovery were categorized as phenotype-driven (see Supplemental Methods in Supplemental Data).
Figure 2
Figure 2
Estimated Number of Genes for Mendelian Conditions (A) Priority candidate genes for Mendelian conditions (MCs) yet to be discovered or delineated can be identified based on detecting a deficit of variation in healthy human controls due to purifying selection (i.e., human data) or the existence of at least one mutant mouse line with an abnormal phenotype (i.e., mouse data). Each stacked bar illustrates, from left to right, the number of human genes known to underlie an MC but not supported by a given type of evidence (dark blue); known to underlie an MC and supported by evidence (light blue); not known to underlie an MC (i.e., novel) and not supported by evidence (burnt orange); and not known to underlie an MC but evidence suggests that it does and it is therefore a priority candidate gene (bright orange) supported by evidence. Selecting the intersection of genes supported by both mouse and human data yields 4,450 priority candidate genes that are likely to underlie one or more novel MCs. (B) Alternatively, the union of genes supported by either mouse or human data suggests there are at least 10,467 priority candidate genes likely to underlie one or more new MCs. Both of these estimates are probably conservative for several reasons: ∼25% of genes underlie two or more distinct MCs (adjusting for this yields ∼6,100–14,400 potential novel MCs), mutant phenotypes for >12,000 mouse genes have not yet been assessed, and current human constraint metrics still lack power to detect constraint in ∼30% of all genes and are underpowered to detect constraint against homozygous loss of function. (See Supplemental Data for details.)

Similar articles

Cited by

References

    1. Antonarakis S.E., Beckmann J.S. Mendelian disorders deserve more attention. Nat. Rev. Genet. 2006;7:277–282. - PubMed
    1. Starita L.M., Islam M.M., Banerjee T., Adamovich A.I., Gullingsrud J., Fields S., Shendure J., Parvin J.D. A multiplex homology-directed DNA repair assay reveals the impact of more than 1,000 BRCA1 missense substitution variants on protein function. Am. J. Hum. Genet. 2018;103:498–508. - PMC - PubMed
    1. Ramsey B.W., Davies J., McElvaney N.G., Tullis E., Bell S.C., Dřevínek P., Griese M., McKone E.F., Wainwright C.E., Konstan M.W., VX08-770-102 Study Group A CFTR potentiator in patients with cystic fibrosis and the G551D mutation. N. Engl. J. Med. 2011;365:1663–1672. - PMC - PubMed
    1. Ng S.B., Buckingham K.J., Lee C., Bigham A.W., Tabor H.K., Dent K.M., Huff C.D., Shannon P.T., Jabs E.W., Nickerson D.A. Exome sequencing identifies the cause of a mendelian disorder. Nat. Genet. 2010;42:30–35. - PMC - PubMed
    1. Ng S.B., Bigham A.W., Buckingham K.J., Hannibal M.C., McMillin M.J., Gildersleeve H.I., Beck A.E., Tabor H.K., Cooper G.M., Mefford H.C. Exome sequencing identifies MLL2 mutations as a cause of Kabuki syndrome. Nat. Genet. 2010;42:790–793. - PMC - PubMed

Publication types

MeSH terms