Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Jul 3;105(1):177-188.
doi: 10.1016/j.ajhg.2019.05.017. Epub 2019 Jun 27.

Analyzing and Reanalyzing the Genome: Findings from the MedSeq Project

Affiliations

Analyzing and Reanalyzing the Genome: Findings from the MedSeq Project

Kalotina Machini et al. Am J Hum Genet. .

Abstract

Although genome sequencing is increasingly available in clinical and research settings, many questions remain about the interpretation of sequencing data. In the MedSeq Project, we explored how much effort is required to evaluate and report on more than 4,500 genes reportedly associated with monogenic conditions, as well as pharmacogenomic (PGx) markers, blood antigen serotyping, and polygenic risk scores in 100 individuals (50 with cardiomyopathy and 50 healthy) randomized to the sequencing arm. We defined the quality thresholds for determining the need for Sanger confirmation. Finally, we examined the effort needed and new findings revealed by reanalyzing each genome (6-23 months after initial analysis; mean 13 months). Monogenic disease risk and carrier status were reported in 21% and 94% of participants, respectively. Only two participants had no monogenic disease risk or carrier status identified. For the PGx results (18 genotypes in six genes for five drugs), the identified diplotypes prompted recommendation for non-standard dosing of at least one of the analyzed drugs in 95% of participants. For blood antigen studies, we found that 31% of participants had a rare blood antigen genotype. In the cardiomyopathy cohort, an explanation for disease was identified in 48% of individuals. Over the course of the study, 14 variants were reclassified and, upon reanalysis, 18 new variants met criteria for reporting. These findings highlight the quantity of medically relevant findings from a broad analysis of genomic sequencing data as well as the need for periodic reinterpretation and reanalysis of data for both diagnostic indications and secondary findings.

Keywords: MedSeq; clinical genomes; genome; genomic interpretation; reanalysis; secondary findings; sequencing.

PubMed Disclaimer

Figures

Figure 1
Figure 1
General Filtration Approach and Overall Outcome (A) Schematic representation of the filters used for rare-variant interpretation of the MedSeq genomes. HGMD = Human Gene Mutation Database; LoF = loss of function. (B) Pie chart showing the results of gene curation performed for all genes that had at least one variant that came through the MedSeq genome filter. (C) Bar graph showing the breakdown of unique variants queued for review and their sources (HGMD versus LoF). Variants were excluded either for insufficient gene-disease validity (gene disease validity) or for high MedSeq allele frequency (platform-specific frequency). Please note that several variants were excluded on the basis of both gene disease validity and platform-specific frequency. (D) Graph showing the reduction of variants that required review as a function of the number of genomes reviewed.
Figure 2
Figure 2
Framework for Predicted LoF Variant Assessment and Results of Manual Variant Curation (A) Final variant classifications with filter source noted. Blue bars represent variants from the HGMD filter, whereas orange bars illustrate novel predicted LoF variants. (B) Schematic depiction of the LoF checklist designed to assist in the rapid and accurate classification of predicted LoF variants. (C) Application of the predicted LoF checklist on novel as well as previously reported predicted LoF variants.
Figure 3
Figure 3
Summary of Genome Reanalysis Findings and Variant Reclassification (A) The middle pie chart depicts the overall findings, including the number of variants added onto the report upon genome reanalysis (shown in blue), and variants reclassified over the course of the study (shown in green). The pie chart on the left represents reportable variants newly discovered as a result of pipeline updates (light blue) or existing variants newly reported based on new evidence during variant reanalysis (blue). The pie chart on the right shows the consequences of reclassification of previously reported variants; variants removed from reports are shown in dark green, and variants with a category change only (but still reportable) are shown in light green. (B) Schematic illustration of variant reclassification categories. The middle pie chart shows the number of variants reclassified (relevant to indication in green, monogenic disease risk in red, and carrier status in blue). The pie chart on the left illustrates the various classification changes in carrier status variants, whereas the pie chart on the right focuses on variant reclassifications relative to cardiomyopathy indication. Please note that one monogenic disease variant was reclassified (from VUS-FP to N/A) and was removed from the report because new evidence disputed the gene’s association with disease.
Figure 4
Figure 4
Results of Sanger Confirmation with Scatterplot of Variants That Underwent Sanger Testing (A and B) Blue circles represent true positive (TP) indels and gray circles represent TP SNPs. Orange crosses represent false positive (FP) indels and yellow crosses represent FP SNPs. (A)The y axis corresponds to the quality by depth (QD) and the x axis to the mapping quality (MQ). The right plot is a magnification showing variants with QD < 6. The red line indicates the minimum QD threshold, below which we stopped follow-up (QD < 4). The two TP variants with QD < 4 are circled in black. (B) The y axis corresponds to the Fisher strand bias (FS) and the x axis to the MQ. The right plot is a magnification showing variants with FS < 35. The red line indicates the minimum FS threshold, above which we stopped follow-up (FS > 30). The six FP variants that also had QD > 4 are circled in black.

References

    1. Richards S., Aziz N., Bale S., Bick D., Das S., Gastier-Foster J., Grody W.W., Hegde M., Lyon E., Spector E., ACMG Laboratory Quality Assurance Committee Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet. Med. 2015;17:405–424. - PMC - PubMed
    2. Richards, S., Aziz, N., Bale, S., Bick, D., Das, S., Gastier-Foster, J., Grody, W.W., Hegde, M., Lyon, E., Spector, E., et al.; ACMG Laboratory Quality Assurance Committee (2015). Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet. Med. 17, 405-424. - PMC - PubMed
    1. Harrison S.M., Dolinsky J.S., Knight Johnson A.E., Pesaran T., Azzariti D.R., Bale S., Chao E.C., Das S., Vincent L., Rehm H.L. Clinical laboratories collaborate to resolve differences in variant interpretations submitted to ClinVar. Genet. Med. 2017;19:1096–1104. - PMC - PubMed
    2. Harrison, S.M., Dolinsky, J.S., Knight Johnson, A.E., Pesaran, T., Azzariti, D.R., Bale, S., Chao, E.C., Das, S., Vincent, L., and Rehm, H.L. (2017). Clinical laboratories collaborate to resolve differences in variant interpretations submitted to ClinVar. Genet. Med. 19, 1096-1104. - PMC - PubMed
    1. Landrum M.J., Lee J.M., Benson M., Brown G., Chao C., Chitipiralla S., Gu B., Hart J., Hoffman D., Hoover J. ClinVar: public archive of interpretations of clinically relevant variants. Nucleic Acids Res. 2016;44(D1):D862–D868. - PMC - PubMed
    2. Landrum, M.J., Lee, J.M., Benson, M., Brown, G., Chao, C., Chitipiralla, S., Gu, B., Hart, J., Hoffman, D., Hoover, J., et al. (2016). ClinVar: public archive of interpretations of clinically relevant variants. Nucleic Acids Res. 44 (D1), D862-D868. - PMC - PubMed
    1. Stenson P.D., Mort M., Ball E.V., Evans K., Hayden M., Heywood S., Hussain M., Phillips A.D., Cooper D.N. The Human Gene Mutation Database: towards a comprehensive repository of inherited mutation data for medical research, genetic diagnosis and next-generation sequencing studies. Hum. Genet. 2017;136:665–677. - PMC - PubMed
    2. Stenson, P.D., Mort, M., Ball, E.V., Evans, K., Hayden, M., Heywood, S., Hussain, M., Phillips, A.D., and Cooper, D.N. (2017). The Human Gene Mutation Database: towards a comprehensive repository of inherited mutation data for medical research, genetic diagnosis and next-generation sequencing studies. Hum. Genet. 136, 665-677. - PMC - PubMed
    1. Pinard A., Miltgen M., Blanchard A., Mathieu H., Desvignes J.P., Salgado D., Fabre A., Arnaud P., Barré L., Krahn M. Actionable genes, core databases, and locus-specific databases. Hum. Mutat. 2016;37:1299–1307. - PubMed
    2. Pinard, A., Miltgen, M., Blanchard, A., Mathieu, H., Desvignes, J.P., Salgado, D., Fabre, A., Arnaud, P., Barre, L., Krahn, M., et al. (2016). Actionable genes, core databases, and locus-specific databases. Hum. Mutat. 37, 1299-1307. - PubMed

Publication types

LinkOut - more resources