A reference human genome dataset of the BGISEQ-500 sequencer
- PMID: 28379488
- PMCID: PMC5467036
- DOI: 10.1093/gigascience/gix024
A reference human genome dataset of the BGISEQ-500 sequencer
Erratum in
-
Erratum to: A reference human genome dataset of the BGISEQ-500 sequencer.Gigascience. 2018 Dec 1;7(12):giy144. doi: 10.1093/gigascience/giy144. Gigascience. 2018. PMID: 30500904 Free PMC article. No abstract available.
Abstract
BGISEQ-500 is a new desktop sequencer developed by BGI. Using DNA nanoball and combinational probe anchor synthesis developed from Complete Genomics™ sequencing technologies, it generates short reads at a large scale. Here, we present the first human whole-genome sequencing dataset of BGISEQ-500. The dataset was generated by sequencing the widely used cell line HG001 (NA12878) in two sequencing runs of paired-end 50 bp (PE50) and two sequencing runs of paired-end 100 bp (PE100). We also include examples of the raw images from the sequencer for reference. Finally, we identified variations using this dataset, estimated the accuracy of the variations, and compared to that of the variations identified from similar amounts of publicly available HiSeq2500 data. We found similar single nucleotide polymorphism (SNP) detection accuracy for the BGISEQ-500 PE100 data (false positive rate [FPR] = 0.00020%, sensitivity = 96.20%) compared to the PE150 HiSeq2500 data (FPR = 0.00017%, sensitivity = 96.60%) better SNP detection accuracy than the PE50 data (FPR = 0.0006%, sensitivity = 94.15%). But for insertions and deletions (indels), we found lower accuracy for BGISEQ-500 data (FPR = 0.00069% and 0.00067% for PE100 and PE50 respectively, sensitivity = 88.52% and 70.93%) than the HiSeq2500 data (FPR = 0.00032%, sensitivity = 96.28%). Our dataset can serve as the reference dataset, providing basic information not just for future development, but also for all research and applications based on the new sequencing platform.
Keywords: BGISEQ-500; genomics; next-generation sequencing; sequencing.
© The Authors 2017. Published by Oxford University Press.
Figures




Similar articles
-
Comparative analysis of 7 short-read sequencing platforms using the Korean Reference Genome: MGI and Illumina sequencing benchmark for whole-genome sequencing.Gigascience. 2021 Mar 12;10(3):giab014. doi: 10.1093/gigascience/giab014. Gigascience. 2021. PMID: 33710328 Free PMC article.
-
Germline and somatic variant identification using BGISEQ-500 and HiSeq X Ten whole genome sequencing.PLoS One. 2018 Jan 10;13(1):e0190264. doi: 10.1371/journal.pone.0190264. eCollection 2018. PLoS One. 2018. PMID: 29320538 Free PMC article.
-
Comparative analysis of novel MGISEQ-2000 sequencing platform vs Illumina HiSeq 2500 for whole-genome sequencing.PLoS One. 2020 Mar 16;15(3):e0230301. doi: 10.1371/journal.pone.0230301. eCollection 2020. PLoS One. 2020. PMID: 32176719 Free PMC article.
-
Evaluation of the performance of copy number variant prediction tools for the detection of deletions from whole genome sequencing data.J Biomed Inform. 2019 Jun;94:103174. doi: 10.1016/j.jbi.2019.103174. Epub 2019 Apr 6. J Biomed Inform. 2019. PMID: 30965134 Review.
-
Current state-of-art of sequencing technologies for plant genomics research.Brief Funct Genomics. 2012 Jan;11(1):3-11. doi: 10.1093/bfgp/elr045. Brief Funct Genomics. 2012. PMID: 22345601 Review.
Cited by
-
Kinome-Wide RNAi Screen Uncovers Role of Ballchen in Maintenance of Gene Activation by Trithorax Group in Drosophila.Front Cell Dev Biol. 2021 Mar 5;9:637873. doi: 10.3389/fcell.2021.637873. eCollection 2021. Front Cell Dev Biol. 2021. PMID: 33748127 Free PMC article.
-
Whole-exome sequencing identified recurrent and novel variants in benzene-induced leukemia.BMC Med Genomics. 2023 Jan 26;16(1):13. doi: 10.1186/s12920-023-01442-w. BMC Med Genomics. 2023. PMID: 36703207 Free PMC article.
-
A Novel Prognostic Risk Model for Cervical Cancer Based on Immune Checkpoint HLA-G-Driven Differentially Expressed Genes.Front Immunol. 2022 Jul 18;13:851622. doi: 10.3389/fimmu.2022.851622. eCollection 2022. Front Immunol. 2022. PMID: 35924232 Free PMC article.
-
Molecular digitization of a botanical garden: high-depth whole-genome sequencing of 689 vascular plant species from the Ruili Botanical Garden.Gigascience. 2019 Apr 1;8(4):giz007. doi: 10.1093/gigascience/giz007. Gigascience. 2019. PMID: 30689836 Free PMC article.
-
Deconvolution of single-cell multi-omics layers reveals regulatory heterogeneity.Nat Commun. 2019 Jan 28;10(1):470. doi: 10.1038/s41467-018-08205-7. Nat Commun. 2019. PMID: 30692544 Free PMC article.
References
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources