Test development, optimization and validation of a WGS pipeline for genetic disorders
- PMID: 37020281
- PMCID: PMC10077614
- DOI: 10.1186/s12920-023-01495-x
Test development, optimization and validation of a WGS pipeline for genetic disorders
Abstract
Background: With advances in massive parallel sequencing (MPS) technology, whole-genome sequencing (WGS) has gradually evolved into the first-tier diagnostic test for genetic disorders. However, deployment practice and pipeline testing for clinical WGS are lacking.
Methods: In this study, we introduced a whole WGS pipeline for genetic disorders, which included the entire process from obtaining a sample to clinical reporting. All samples that underwent WGS were constructed using polymerase chain reaction (PCR)-free library preparation protocols and sequenced on the MGISEQ-2000 platform. Bioinformatics pipelines were developed for the simultaneous detection of various types of variants, including single nucleotide variants (SNVs), insertions and deletions (indels), copy number variants (CNVs) and balanced rearrangements, mitochondrial (MT) variants, and other complex variants such as repeat expansion, pseudogenes and absence of heterozygosity (AOH). A semiautomatic pipeline was developed for the interpretation of potential SNVs and CNVs. Forty-five samples (including 14 positive commercially available samples, 23 laboratory-held positive cell lines and 8 clinical cases) with known variants were used to validate the whole pipeline.
Results: In this study, a whole WGS pipeline for genetic disorders was developed and optimized. Forty-five samples with known variants (6 with SNVs and Indels, 3 with MT variants, 5 with aneuploidies, 1 with triploidy, 23 with CNVs, 5 with balanced rearrangements, 2 with repeat expansions, 1 with AOHs, and 1 with exon 7-8 deletion of SMN1 gene) validated the effectiveness of our pipeline.
Conclusions: This study has been piloted in test development, optimization, and validation of the WGS pipeline for genetic disorders. A set of best practices were recommended using our pipeline, along with a dataset of positive samples for benchmarking.
Keywords: Bioinformatics pipelines; Clinical diagnosis; Genetic disorders; Whole genome sequencing.
© 2023. The Author(s).
Conflict of interest statement
The authors declare that they have no competing interests.
Similar articles
-
From cytogenetics to cytogenomics: whole-genome sequencing as a first-line test comprehensively captures the diverse spectrum of disease-causing genetic variation underlying intellectual disability.Genome Med. 2019 Nov 7;11(1):68. doi: 10.1186/s13073-019-0675-1. Genome Med. 2019. PMID: 31694722 Free PMC article.
-
A clinically validated whole genome pipeline for structural variant detection and analysis.BMC Genomics. 2019 Jul 16;20(Suppl 8):545. doi: 10.1186/s12864-019-5866-z. BMC Genomics. 2019. PMID: 31307387 Free PMC article.
-
Performance characterization of PCR-free whole genome sequencing for clinical diagnosis.Medicine (Baltimore). 2022 Mar 11;101(10):e28972. doi: 10.1097/MD.0000000000028972. Medicine (Baltimore). 2022. PMID: 35451387 Free PMC article.
-
[Whole-genome sequencing and its application in the research and diagnoses of genetic diseases].Yi Chuan. 2014 Nov;36(11):1087-98. Yi Chuan. 2014. PMID: 25567867 Review. Chinese.
-
Lessons and pitfalls of whole genome sequencing.Pract Neurol. 2024 Jul 16;24(4):263-274. doi: 10.1136/pn-2023-004083. Pract Neurol. 2024. PMID: 38548322 Review.
Cited by
-
Unlocking the Potential of Animal Hair Shafts for Genomic Studies: A Comprehensive Evaluation of DNA Quality.Biology (Basel). 2025 Mar 28;14(4):353. doi: 10.3390/biology14040353. Biology (Basel). 2025. PMID: 40282218 Free PMC article.
-
Expanding the genome information on Bacillales for biosynthetic gene cluster discovery.Sci Data. 2024 Nov 21;11(1):1267. doi: 10.1038/s41597-024-04118-x. Sci Data. 2024. PMID: 39572589 Free PMC article.
References
-
- Clark MM, Stark Z, Farnaes L, Tan TY, White SM, Dimmock D, et al. Meta-analysis of the diagnostic and clinical utility of genome and exome sequencing and chromosomal microarray in children with suspected genetic diseases. NPJ Genom Med. 2018;3:16. doi: 10.1038/s41525-018-0053-8. - DOI - PMC - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Research Materials