Extensive sequencing of seven human genomes to characterize benchmark reference materials
- PMID: 27271295
- PMCID: PMC4896128
- DOI: 10.1038/sdata.2016.25
Extensive sequencing of seven human genomes to characterize benchmark reference materials
Abstract
The Genome in a Bottle Consortium, hosted by the National Institute of Standards and Technology (NIST) is creating reference materials and data for human genome sequencing, as well as methods for genome comparison and benchmarking. Here, we describe a large, diverse set of sequencing data for seven human genomes; five are current or candidate NIST Reference Materials. The pilot genome, NA12878, has been released as NIST RM 8398. We also describe data from two Personal Genome Project trios, one of Ashkenazim Jewish ancestry and one of Chinese ancestry. The data come from 12 technologies: BioNano Genomics, Complete Genomics paired-end and LFR, Ion Proton exome, Oxford Nanopore, Pacific Biosciences, SOLiD, 10X Genomics GemCode WGS, and Illumina exome and WGS paired-end, mate-pair, and synthetic long reads. Cell lines, DNA, and data from these individuals are publicly available. Therefore, we expect these data to be useful for revealing novel information about the human genome and improving sequencing technologies, SNP, indel, and structural variant calling, and de novo assembly.
Conflict of interest statement
F.C., E.J., A.M. are employees of Illumina. K.P., W.S., T.L., M.S., Z.D., A.H., and H.C. are employees of BioNano Genomics. P.M., S.K.-P., G.S.Y.Z., M.S.-L., H.S.O., and P.A.M. are employees of 10X Genomics. R.M.T., C.C.C., and N.G. are employees of BGI-Complete Genomics. K.Z., S.G., F.H., and Y.F. are employees of Thermo Fisher Scientific.
Figures





Comment on
-
Integrating human sequence data sets provides a resource of benchmark SNP and indel genotype calls.Nat Biotechnol. 2014 Mar;32(3):246-51. doi: 10.1038/nbt.2835. Epub 2014 Feb 16. Nat Biotechnol. 2014. PMID: 24531798
-
svclassify: a method to establish benchmark structural variant calls.BMC Genomics. 2016 Jan 16;17:64. doi: 10.1186/s12864-016-2366-2. BMC Genomics. 2016. PMID: 26772178 Free PMC article.
References
Data Citations
-
- Zook J. M. 2015. NCBI SRA. SRX1049768–SRX1049855
-
- Zook J. M. 2015. NCBI SRA. SRX847862–SRX848317
-
- Zook J. M. 2015. NCBI SRA. SRX1388368–SRX1388459
-
- Zook J. M. 2015. NCBI SRA. SRX1388732–SRX138874359
-
- Sheng Y. 2015. NCBI SRA. SRP047086
References
-
- Rasberry S. D. & Gills T. E. The certification, development and use of standard reference materials. Spectrochim. Acta Part B At. Spectrosc. 46, 1577–1582 (1991).
-
- Mackey E. A. et al. Certification of NIST Standard Reference Material 1575a Pine Needles and Results of an International Laboratory Comparison. NIST Special Publication 260–156 (2004).
-
- Zook J. M. et al. Integrating human sequence data sets provides a resource of benchmark SNP and indel genotype calls. Nat. Biotechnol. 32, 246–251 (2014). - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials