Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Jan;40(Database issue):D54-6.
doi: 10.1093/nar/gkr854. Epub 2011 Oct 18.

The Sequence Read Archive: explosive growth of sequencing data

Affiliations

The Sequence Read Archive: explosive growth of sequencing data

Yuichi Kodama et al. Nucleic Acids Res. 2012 Jan.

Abstract

New generation sequencing platforms are producing data with significantly higher throughput and lower cost. A portion of this capacity is devoted to individual and community scientific projects. As these projects reach publication, raw sequencing datasets are submitted into the primary next-generation sequence data archive, the Sequence Read Archive (SRA). Archiving experimental data is the key to the progress of reproducible science. The SRA was established as a public repository for next-generation sequence data as a part of the International Nucleotide Sequence Database Collaboration (INSDC). INSDC is composed of the National Center for Biotechnology Information (NCBI), the European Bioinformatics Institute (EBI) and the DNA Data Bank of Japan (DDBJ). The SRA is accessible at www.ncbi.nlm.nih.gov/sra from NCBI, at www.ebi.ac.uk/ena from EBI and at trace.ddbj.nig.ac.jp from DDBJ. In this article, we present the content and structure of the SRA and report on updated metadata structures, submission file formats and supported sequencing platforms. We also briefly outline our various responses to the challenge of explosive data growth.

PubMed Disclaimer

Similar articles

  • The sequence read archive.
    Leinonen R, Sugawara H, Shumway M; International Nucleotide Sequence Database Collaboration. Leinonen R, et al. Nucleic Acids Res. 2011 Jan;39(Database issue):D19-21. doi: 10.1093/nar/gkq1019. Epub 2010 Nov 9. Nucleic Acids Res. 2011. PMID: 21062823 Free PMC article.
  • Archiving next generation sequencing data.
    Shumway M, Cochrane G, Sugawara H. Shumway M, et al. Nucleic Acids Res. 2010 Jan;38(Database issue):D870-1. doi: 10.1093/nar/gkp1078. Epub 2009 Dec 3. Nucleic Acids Res. 2010. PMID: 19965774 Free PMC article.
  • DDBJ new system and service refactoring.
    Ogasawara O, Mashima J, Kodama Y, Kaminuma E, Nakamura Y, Okubo K, Takagi T. Ogasawara O, et al. Nucleic Acids Res. 2013 Jan;41(Database issue):D25-9. doi: 10.1093/nar/gks1152. Epub 2012 Nov 24. Nucleic Acids Res. 2013. PMID: 23180790 Free PMC article.
  • The evolution of dbSNP: 25 years of impact in genomic research.
    Phan L, Zhang H, Wang Q, Villamarin R, Hefferon T, Ramanathan A, Kattman B. Phan L, et al. Nucleic Acids Res. 2025 Jan 6;53(D1):D925-D931. doi: 10.1093/nar/gkae977. Nucleic Acids Res. 2025. PMID: 39530225 Free PMC article. Review.
  • Efficient compression of SARS-CoV-2 genome data using Nucleotide Archival Format.
    Kryukov K, Jin L, Nakagawa S. Kryukov K, et al. Patterns (N Y). 2022 Sep 9;3(9):100562. doi: 10.1016/j.patter.2022.100562. Epub 2022 Jul 7. Patterns (N Y). 2022. PMID: 35818472 Free PMC article. Review.

Cited by

References

    1. Shumway M, Cochrane G, Sugawara H. Archiving next generation sequencing data. Nucleic Acids Res. 2010;38:D870–D871. - PMC - PubMed
    1. Leinonen R, Sugawara H, Shumway M. The sequence read archive. Nucleic Acids Res. 2011;39:D19–D21. - PMC - PubMed
    1. Karsch-Mizrachi I, Nakamura Y, Cochrane G. The International Nucleotide Sequence Database Collaboration. Nucleic Acids Res. 2012;40:D33–D37. - PMC - PubMed
    1. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, 1000 Genome Project Data Processing Subgroup The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078–2079. - PMC - PubMed
    1. Barrett T, Troup DB, Wilhite SE, Ledoux P, Evangelista C, Kim IF, Tomashevsky M, Marshall KA, Phillippy KH, Sherman PM, et al. NCBI GEO: archive for functional genomics data sets–10 years on. Nucleic Acids Res. 2011;39:D1005–D1010. - PMC - PubMed

Publication types