Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Jan;15(1):36-44.
doi: 10.1038/gim.2012.112. Epub 2012 Sep 20.

An informatics approach to analyzing the incidentalome

Affiliations

An informatics approach to analyzing the incidentalome

Jonathan S Berg et al. Genet Med. 2013 Jan.

Abstract

Purpose: Next-generation sequencing has transformed genetic research and is poised to revolutionize clinical diagnosis. However, the vast amount of data and inevitable discovery of incidental findings require novel analytic approaches. We therefore implemented for the first time a strategy that utilizes an a priori structured framework and a conservative threshold for selecting clinically relevant incidental findings.

Methods: We categorized 2,016 genes linked with Mendelian diseases into "bins" based on clinical utility and validity, and used a computational algorithm to analyze 80 whole-genome sequences in order to explore the use of such an approach in a simulated real-world setting.

Results: The algorithm effectively reduced the number of variants requiring human review and identified incidental variants with likely clinical relevance. Incorporation of the Human Gene Mutation Database improved the yield for missense mutations but also revealed that a substantial proportion of purported disease-causing mutations were misleading.

Conclusion: This approach is adaptable to any clinically relevant bin structure, scalable to the demands of a clinical laboratory workflow, and flexible with respect to advances in genomics. We anticipate that application of this strategy will facilitate pretest informed consent, laboratory analysis, and posttest return of results in a clinical context.

PubMed Disclaimer

Figures

FIGURE 1
FIGURE 1. Selection of variants based on allele frequency and predicted effect on the translated protein
(A) The initial informatics analysis resulted in an average of ~13,000 variants per person in Bin 1 genes, ~175,000 variants per person in Bin 2b genes, and ~9000 variants per person in Bin 2c genes. (B) Limiting these variants to <5% AF or <1% AF reduces these counts ~10–15 fold, respectively. (C) Restricting to protein-coding variants (missense, nonsense, frameshift, splice site) at <5% AF results in ~10 variants per person in Bin 1 genes and 100–200 variants per person in Bin 2b genes. At <1% AF there were ~5 variants per person in bin 1 genes and 50–100 variants per person in Bin 2b genes. (D) Restricting only to truncating variants (nonsense, frameshift, splice site) results in only a small number of variants to be analyzed by the reviewer. Interestingly, the AF cut-off (<5% vs. <1%) does not dramatically affect the number of truncating variants that are selected.
FIGURE 2
FIGURE 2. Analysis of mutations annotated as “DM” in HGMD
(A) All variants were queried against the HGMD to identify variants classified as “DM”. The numbers of rare (<5% AF) missense variants in all bins and the numbers of HGMD “DM” variants per person are depicted as a box-whisker plot with whiskers indicating the 5th and 95th percentiles and outliers shown as filled circles. Homozygous variants are counted twice. (B) The overlap between the rare missense variants and “DM” variants is depicted as a Venn diagram. (C) The maximum 1000 Genomes allele frequencies were determined for each variant identified in the 80 whole genomes and histograms of allele frequencies were generated for each person. These histograms were then combined to depict the average number of variants per person within each range of allele frequencies (depicted as a bar plot with standard deviations). (D) The maximum 1000 Genomes allele frequencies were determined for all “DM” variants in HGMD and graphed as a histogram. The inset shows the distribution for “DM” variants with >1% allele frequencies.
FIGURE 3
FIGURE 3. Results of the manual review of variants selected by the informatics algorithm
After individual review of the 906 unique variants returned by the final informatics algorithm, 45% were reassigned or removed from consideration. The graphs depict the variants initially selected within a given “bin” and the stacked segments represent the proportions of those variants that were confirmed, reassigned, or removed after review (see legend). Figure (A) shows all 906 unique variants, (B) shows the 392 rare truncating variants identified by the algorithm and (C) shows the 514 rare “DM” variants from HGMD. A higher proportion of “DM” variants in each bin category were removed from consideration compared to novel truncating variants.

Comment in

  • Preserving personal autonomy in a genomic testing era.
    Lindor NM, Johnson KJ, McCormick JB, Klee EW, Ferber MJ, Farrugia G. Lindor NM, et al. Genet Med. 2013 May;15(5):408-9. doi: 10.1038/gim.2013.24. Genet Med. 2013. PMID: 23649380 No abstract available.
  • Response to Lindor et al.
    Berg JS. Berg JS. Genet Med. 2013 May;15(5):409-10. doi: 10.1038/gim.2013.25. Genet Med. 2013. PMID: 23649381 No abstract available.

References

    1. Lupski JR, Reid JG, Gonzaga-Jauregui C, et al. Whole-genome sequencing in a patient with Charcot-Marie-Tooth neuropathy. N Engl J Med. 2010;362:1181–1191. - PMC - PubMed
    1. Ashley EA, Butte AJ, Wheeler MT, et al. Clinical assessment incorporating a personal genome. Lancet. 2010;375:1525–1535. - PMC - PubMed
    1. Worthey EA, Mayer AN, Syverson GD, et al. Making a definitive diagnosis: successful clinical application of whole exome sequencing in a child with intractable inflammatory bowel disease. Genet Med. 2011;13:255–262. - PubMed
    1. Bick D, Dimmock D. Whole exome and whole genome sequencing. Curr Opin Pediatr. 2011;23:594–600. - PubMed
    1. Kohane IS, Masys DR, Altman RB. The incidentalome: a threat to genomic medicine. JAMA. 2006;296:212–215. - PubMed

Publication types