A clinically validated whole genome pipeline for structural variant detection and analysis
- PMID: 31307387
- PMCID: PMC6631445
- DOI: 10.1186/s12864-019-5866-z
A clinically validated whole genome pipeline for structural variant detection and analysis
Abstract
Background: With the continuing decrease in cost of whole genome sequencing (WGS), we have already approached the point of inflection where WGS testing has become economically feasible, facilitating broader access to the benefits that are helping to define WGS as the new diagnostic standard. WGS provides unique opportunities for detection of structural variants; however, such analyses, despite being recognized by the research community, have not previously made their way into routine clinical practice.
Results: We have developed a clinically validated pipeline for highly specific and sensitive detection of structural variants basing on 30X PCR-free WGS. Using a combination of breakpoint analysis of split and discordant reads, and read depth analysis, the pipeline identifies structural variants down to single base pair resolution. False positives are minimized using calculations for loss of heterozygosity and bi-modal heterozygous variant allele frequencies to enhance heterozygous deletion and duplication detection respectively. Compound and potential compound combinations of structural variants and small sequence changes are automatically detected. To facilitate clinical interpretation, identified variants are annotated with phenotype information derived from HGMD Professional and population allele frequencies derived from public and Variantyx allele frequency databases. Single base pair resolution enables easy visual inspection of potentially causal variants using the IGV genome browser as well as easy biochemical validation via PCR. Analytical and clinical sensitivity and specificity of the pipeline has been validated using analysis of Genome in a Bottle reference genomes and known positive samples confirmed by orthogonal sequencing technologies.
Conclusion: Consistent read depth of PCR-free WGS enables reliable detection of structural variants of any size. Annotation both on gene and variant level allows clinicians to match reported patient phenotype with detected variants and confidently report causative finding in all clinical cases used for validation.
Keywords: Break point; CNV; Clinical validation; Deletion; Diagnostic console; Duplication; Pipeline; Structural variants; WGS; Whole genome sequencing.
Conflict of interest statement
NN, GF, NM, SM and AK are affiliated with Variantyx Inc.
Figures




Similar articles
-
From cytogenetics to cytogenomics: whole-genome sequencing as a first-line test comprehensively captures the diverse spectrum of disease-causing genetic variation underlying intellectual disability.Genome Med. 2019 Nov 7;11(1):68. doi: 10.1186/s13073-019-0675-1. Genome Med. 2019. PMID: 31694722 Free PMC article.
-
ONT long-read WGS for variant discovery and orthogonal confirmation of short read WGS derived genetic variants in clinical genetic testing.Front Genet. 2023 Apr 21;14:1145285. doi: 10.3389/fgene.2023.1145285. eCollection 2023. Front Genet. 2023. PMID: 37152986 Free PMC article.
-
Evaluation of the performance of copy number variant prediction tools for the detection of deletions from whole genome sequencing data.J Biomed Inform. 2019 Jun;94:103174. doi: 10.1016/j.jbi.2019.103174. Epub 2019 Apr 6. J Biomed Inform. 2019. PMID: 30965134 Review.
-
Test development, optimization and validation of a WGS pipeline for genetic disorders.BMC Med Genomics. 2023 Apr 5;16(1):74. doi: 10.1186/s12920-023-01495-x. BMC Med Genomics. 2023. PMID: 37020281 Free PMC article.
-
Analytical demands to use whole-genome sequencing in precision oncology.Semin Cancer Biol. 2022 Sep;84:16-22. doi: 10.1016/j.semcancer.2021.06.009. Epub 2021 Jun 10. Semin Cancer Biol. 2022. PMID: 34119643 Review.
Cited by
-
Familial Early-Onset Alzheimer's Caused by Novel Genetic Variant and APP Duplication: A Cross-Sectional Study.Curr Alzheimer Res. 2022;19(10):694-707. doi: 10.2174/1567205020666221020095257. Curr Alzheimer Res. 2022. PMID: 36278440
-
Long read sequencing on its way to the routine diagnostics of genetic diseases.Front Genet. 2024 Mar 6;15:1374860. doi: 10.3389/fgene.2024.1374860. eCollection 2024. Front Genet. 2024. PMID: 38510277 Free PMC article. Review.
-
Reanalysis of Trio Whole-Genome Sequencing Data Doubles the Yield in Autism Spectrum Disorder: De Novo Variants Present in Half.Int J Mol Sci. 2024 Jan 18;25(2):1192. doi: 10.3390/ijms25021192. Int J Mol Sci. 2024. PMID: 38256266 Free PMC article.
-
Long read sequencing enhances pathogenic and novel variation discovery in patients with rare diseases.Nat Commun. 2025 Mar 14;16(1):2500. doi: 10.1038/s41467-025-57695-9. Nat Commun. 2025. PMID: 40087273 Free PMC article.
-
Novel Missense Variant in the SMARCD1 Gene as the Cause of Coffin-Siris Syndrome 11 in a Fetus With Ambiguous Genitalia and Multiple Dysmorphic Features.Prenat Diagn. 2024 Nov;44(12):1522-1525. doi: 10.1002/pd.6683. Epub 2024 Oct 10. Prenat Diagn. 2024. PMID: 39389935 Free PMC article. No abstract available.
References
-
- Miller DT, Adam MP, Aradhya S, Biesecker LG, Brothman AR, Carter NP, Church DM, Crolla JA, Eichler EE, Epstein CJ, Faucett WA, Feuk L, Friedman JM, Hamosh A, Jackson L, Kaminsky EB, Kok K, Krantz ID, Kuhn RM, Lee C, Ostell JM, Rosenberg C, Scherer SW, Spinner NB, Stavropoulos DJ, Tepperberg JH, Thorland EC, Vermeesch JR, Waggoner DJ, Watson MS, Martin CL, Ledbetter DH. Consensus statement: chromosomal microarray is a first-tier clinical diagnostic test for individuals with developmental disabilities or congenital anomalies. Am J Hum Genet. 2010;86:749–764. doi: 10.1016/j.ajhg.2010.04.006. - DOI - PMC - PubMed
MeSH terms
LinkOut - more resources
Full Text Sources