Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jun 6;390(21):1985-1997.
doi: 10.1056/NEJMoa2314761.

Genome Sequencing for Diagnosing Rare Diseases

Monica H Wojcik  1 Gabrielle Lemire  1 Eva Berger  1 Maha S Zaki  1 Mariel Wissmann  1 Wathone Win  1 Susan M White  1 Ben Weisburd  1 Dagmar Wieczorek  1 Leigh B Waddell  1 Jeffrey M Verboon  1 Grace E VanNoy  1 Ana Töpf  1 Tiong Yang Tan  1 Steffen Syrbe  1 Vincent Strehlow  1 Volker Straub  1 Sarah L Stenton  1 Hana Snow  1 Moriel Singer-Berk  1 Josh Silver  1 Shirlee Shril  1 Eleanor G Seaby  1 Ronen Schneider  1 Vijay G Sankaran  1 Alba Sanchis-Juan  1 Kathryn A Russell  1 Karit Reinson  1 Gianina Ravenscroft  1 Maximilian Radtke  1 Denny Popp  1 Tilman Polster  1 Konrad Platzer  1 Eric A Pierce  1 Emily M Place  1 Sander Pajusalu  1 Lynn Pais  1 Katrin Õunap  1 Ikeoluwa Osei-Owusu  1 Henry Opperman  1 Volkan Okur  1 Kaisa Teele Oja  1 Melanie O'Leary  1 Emily O'Heir  1 Chantal F Morel  1 Andreas Merkenschlager  1 Rhett G Marchant  1 Brian E Mangilog  1 Jill A Madden  1 Daniel MacArthur  1 Alysia Lovgren  1 Jordan P Lerner-Ellis  1 Jasmine Lin  1 Nigel Laing  1 Friedhelm Hildebrandt  1 Julia Hentschel  1 Emily Groopman  1 Julia Goodrich  1 Joseph G Gleeson  1 Roula Ghaoui  1 Casie A Genetti  1 Janina Gburek-Augustat  1 Hanna T Gazda  1 Vijay S Ganesh  1 Mythily Ganapathi  1 Lyndon Gallacher  1 Jack M Fu  1 Emily Evangelista  1 Eleina England  1 Sandra Donkervoort  1 Stephanie DiTroia  1 Sandra T Cooper  1 Wendy K Chung  1 John Christodoulou  1 Katherine R Chao  1 Liam D Cato  1 Kinga M Bujakowska  1 Samantha J Bryen  1 Harrison Brand  1 Carsten G Bönnemann  1 Alan H Beggs  1 Samantha M Baxter  1 Tobias Bartolomaeus  1 Pankaj B Agrawal  1 Michael Talkowski  1 Christina Austin-Tse  1 Rami Abou Jamra  1 Heidi L Rehm  1 Anne O'Donnell-Luria  1
Affiliations

Genome Sequencing for Diagnosing Rare Diseases

Monica H Wojcik et al. N Engl J Med. .

Abstract

Background: Genetic variants that cause rare disorders may remain elusive even after expansive testing, such as exome sequencing. The diagnostic yield of genome sequencing, particularly after a negative evaluation, remains poorly defined.

Methods: We sequenced and analyzed the genomes of families with diverse phenotypes who were suspected to have a rare monogenic disease and for whom genetic testing had not revealed a diagnosis, as well as the genomes of a replication cohort at an independent clinical center.

Results: We sequenced the genomes of 822 families (744 in the initial cohort and 78 in the replication cohort) and made a molecular diagnosis in 218 of 744 families (29.3%). Of the 218 families, 61 (28.0%) - 8.2% of families in the initial cohort - had variants that required genome sequencing for identification, including coding variants, intronic variants, small structural variants, copy-neutral inversions, complex rearrangements, and tandem repeat expansions. Most families in which a molecular diagnosis was made after previous nondiagnostic exome sequencing (63.5%) had variants that could be detected by reanalysis of the exome-sequence data (53.4%) or by additional analytic methods, such as copy-number variant calling, to exome-sequence data (10.8%). We obtained similar results in the replication cohort: in 33% of the families in which a molecular diagnosis was made, or 8% of the cohort, genome sequencing was required, which showed the applicability of these findings to both research and clinical environments.

Conclusions: The diagnostic yield of genome sequencing in a large, diverse research cohort and in a small clinical cohort of persons who had previously undergone genetic testing was approximately 8% and included several types of pathogenic variation that had not previously been detected by means of exome sequencing or other techniques. (Funded by the National Human Genome Research Institute and others.).

PubMed Disclaimer

Figures

Figure 1.
Figure 1.. Examples of Variants or Regions Requiring Genome Sequencing.
Shown are some of the reasons that variants require genome sequencing for detection. These include deep intronic variants (Panel A, arrow), which are unlikely to be reliably detected by typical exome-sequencing methods (>20 bp upstream or downstream from the beginning or end of an exon); variants that are missed with previous exome sequencing because of poor coverage of the region (Panel B, arrow pointing to a region of lower exome coverage in blue as compared with genome coverage in green for the PCSK9 gene, on the gnomAD, version 2, browser); tandem repeat expansions and some structural variants such as copy-neutral inversions; and small (Panel C, approximately 50 to 2000 bp), largely intronic copy-number variants, or complex events involving more than one type of structural variant.
Figure 2.
Figure 2.. Diagnostic Yield According to Phenotype and Genetic Ancestry.
Probands were categorized according to the most prominent phenotypic features (Panel A) and genetic ancestry (Panel B). The percentage of families in which diagnoses were made is shown. Genetic ancestry was determined by gnomAD, version 2, random forest model, and a genetic ancestry was assigned to all samples for which the probability of that ancestry was more than 90%. All samples that did not meet the 90% threshold were imputed as “multiple or unassigned.”
Figure 3.
Figure 3.. Classification of Families with a Molecular Diagnosis.
Among the 218 families in which a molecular diagnosis or probable molecular diagnosis was obtained by genome sequencing, the types of variants that required genome sequencing for identification (61 families) are shown. A total of 21 (34%) were structural variants, including 11 deletions (2 in noncoding regions), 2 duplications, 4 inversions, 3 complex deletion–duplication events, and 1 mobile element insertion in a noncoding region. In addition, 6 tandem repeat expansions were identified. Of these variants, 19 (31%) were coding variants that had been missed in previous exome sequencing because of poor coverage of the region, which resulted in either no variant at that site or a poor-quality variant that was filtered out by variant quality control and hence overlooked in the analysis process. Most of these variants (13 of 19) were small indels (usually 10 to 15 bp in size), of which 8 were deletions. Fifteen (25%) of the diagnoses requiring genome sequencing involved deep intronic noncoding variants (complemented by RNA sequencing for 8 persons). Two persons had variants that were missed on exome sequencing for more than one reason (a structural variant in a new gene in one case and another case involving a promoted variant plus a new gene contributing to a blended phenotype). Of the 157 families in which the causal variants were detectable by exome sequencing, 94 (59.9%) had had previous exome sequencing that missed the variant, whereas the remaining 63 had not had previous exome sequencing but were diagnosed through the identification of variants expected to be detectable by current exome-sequencing methods. Two diagnostic mitochondrial DNA (mtDNA) variants were identified by genome sequencing and missed with previous exome sequencing, although these variants were also identified by mitochondrial genome sequencing. CNV denotes copy-number variant.
Figure 4.
Figure 4.. Reasons for Lack of Diagnosis by Previous Exome Sequencing.
The number of families with diagnoses missed for each reason is shown. The 14 families with deep intronic variants required genome sequencing for identification of diagnoses, and 10 families required additional validation (8 required RNA sequencing and 2 required other functional validation).

References

    1. Yang Y, Muzny DM, Reid JG, et al. Clinical whole-exome sequencing for the diagnosis of mendelian disorders. N Engl J Med 2013; 369:1 502–11. - PMC - PubMed
    1. Kim J, Hu C, Moufawad El Achkar C, et al. Patient-customized oligonucleotide therapy for a rare genetic disease. N Engl J Med 2019; 381:1 644–52. - PMC - PubMed
    1. Kim J, Woo S, de Gusmao CM, et al. A framework for individualized splice-switching oligonucleotide therapy. Nature 2023; 619:8 28–36. - PMC - PubMed
    1. Shashi V, Schoch K, Spillmann R, et al. A comprehensive iterative approach is highly effective in diagnosing individuals who are exome negative. Genet Med 2019; 21:1 61–72. - PMC - PubMed
    1. Splinter K, Adams DR, Bacino CA, et al. Effect of genetic diagnosis on patients with previously undiagnosed disease. N Engl J Med 2018; 379: 2131–9. - PMC - PubMed

Publication types

MeSH terms

Grants and funding