The sequence read archive
- PMID: 21062823
- PMCID: PMC3013647
- DOI: 10.1093/nar/gkq1019
The sequence read archive
Abstract
The combination of significantly lower cost and increased speed of sequencing has resulted in an explosive growth of data submitted into the primary next-generation sequence data archive, the Sequence Read Archive (SRA). The preservation of experimental data is an important part of the scientific record, and increasing numbers of journals and funding agencies require that next-generation sequence data are deposited into the SRA. The SRA was established as a public repository for the next-generation sequence data and is operated by the International Nucleotide Sequence Database Collaboration (INSDC). INSDC partners include the National Center for Biotechnology Information (NCBI), the European Bioinformatics Institute (EBI) and the DNA Data Bank of Japan (DDBJ). The SRA is accessible at http://www.ncbi.nlm.nih.gov/Traces/sra from NCBI, at http://www.ebi.ac.uk/ena from EBI and at http://trace.ddbj.nig.ac.jp from DDBJ. In this article, we present the content and structure of the SRA, detail our support for sequencing platforms and provide recommended data submission levels and formats. We also briefly outline our response to the challenge of data growth.
Similar articles
-
The Sequence Read Archive: explosive growth of sequencing data.Nucleic Acids Res. 2012 Jan;40(Database issue):D54-6. doi: 10.1093/nar/gkr854. Epub 2011 Oct 18. Nucleic Acids Res. 2012. PMID: 22009675 Free PMC article.
-
Archiving next generation sequencing data.Nucleic Acids Res. 2010 Jan;38(Database issue):D870-1. doi: 10.1093/nar/gkp1078. Epub 2009 Dec 3. Nucleic Acids Res. 2010. PMID: 19965774 Free PMC article.
-
DDBJ new system and service refactoring.Nucleic Acids Res. 2013 Jan;41(Database issue):D25-9. doi: 10.1093/nar/gks1152. Epub 2012 Nov 24. Nucleic Acids Res. 2013. PMID: 23180790 Free PMC article.
-
The DNA Data Bank of Japan launches a new resource, the DDBJ Omics Archive of functional genomics experiments.Nucleic Acids Res. 2012 Jan;40(Database issue):D38-42. doi: 10.1093/nar/gkr994. Epub 2011 Nov 22. Nucleic Acids Res. 2012. PMID: 22110025 Free PMC article.
-
Efficient compression of SARS-CoV-2 genome data using Nucleotide Archival Format.Patterns (N Y). 2022 Sep 9;3(9):100562. doi: 10.1016/j.patter.2022.100562. Epub 2022 Jul 7. Patterns (N Y). 2022. PMID: 35818472 Free PMC article. Review.
Cited by
-
Geological processes mediate a microbial dispersal loop in the deep biosphere.Sci Adv. 2022 Aug 26;8(34):eabn3485. doi: 10.1126/sciadv.abn3485. Epub 2022 Aug 26. Sci Adv. 2022. PMID: 36026445 Free PMC article.
-
A Bioinformatics Whole-Genome Sequencing Workflow for Clinical Mycobacterium tuberculosis Complex Isolate Analysis, Validated Using a Reference Collection Extensively Characterized with Conventional Methods and In Silico Approaches.J Clin Microbiol. 2021 May 19;59(6):e00202-21. doi: 10.1128/JCM.00202-21. Print 2021 May 19. J Clin Microbiol. 2021. PMID: 33789960 Free PMC article.
-
Improving the filtering of false positive single nucleotide variations by combining genomic features with quality metrics.Bioinformatics. 2023 Dec 1;39(12):btad694. doi: 10.1093/bioinformatics/btad694. Bioinformatics. 2023. PMID: 38019945 Free PMC article.
-
Connecting thiamine availability to the microbial community composition in Chinook salmon spawning habitats of the Sacramento River basin.Appl Environ Microbiol. 2024 Jan 24;90(1):e0176023. doi: 10.1128/aem.01760-23. Epub 2023 Dec 12. Appl Environ Microbiol. 2024. PMID: 38084986 Free PMC article.
-
Rust expression browser: an open source database for simultaneous analysis of host and pathogen gene expression profiles with expVIP.BMC Genomics. 2021 Mar 9;22(1):166. doi: 10.1186/s12864-021-07488-3. BMC Genomics. 2021. PMID: 33750297 Free PMC article.
References
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources