Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jan 6;53(D1):D56-D61.
doi: 10.1093/nar/gkae1114.

GenBank 2025 update

Affiliations

GenBank 2025 update

Eric W Sayers et al. Nucleic Acids Res. .

Abstract

GenBank® (https://www.ncbi.nlm.nih.gov/genbank/) is a comprehensive, public data repository that contains 34 trillion base pairs from over 4.7 billion nucleotide sequences for 581 000 formally described species. Daily data exchange with the European Nucleotide Archive and the DNA Data Bank of Japan ensures worldwide coverage. We summarize the content of the database in 2025 and recent updates such as accelerated processing of influenza sequences and the ability to upload feature tables to Submission Portal for messenger RNA sequences. We provide an overview of the web, application programming and command-line interfaces that allow users to access GenBank data. We also discuss the importance of creating BioProject and BioSample records during submissions, particularly for viruses and metagenomes. Finally, we summarize educational materials and recent community outreach efforts.

PubMed Disclaimer

Figures

Graphical Abstract
Graphical Abstract
Figure 1.
Figure 1.
Growth of GenBank recorded in both base pairs (circles) and the number of sequence records (triangles). Each point represents the GenBank release in August of each year, starting with release 173 (August 2009).
Figure 2.
Figure 2.
Taxonomy page in NCBI Datasets for the family Suidae providing easy access to available genomes, nucleotide and protein sequences, SRA data, BioProjects and more, in addition to the other taxonomic nodes in the lineage of Suidae.

References

    1. Sayers E.W., Cavanaugh M., Clark K., Pruitt K.D., Schoch C.L., Sherry S.T., Karsch-Mizrachi I.. GenBank. Nucleic Acids Res. 2021; 49:D92–D96. - PMC - PubMed
    1. Karsch-Mizrachi I., Arita M., Burdett T., Cochrane G., Nakamura Y., Pruitt K.D., Schneider V.A.. The International Nucleotide Sequence Database Collaboration (INSDC). Nucleic Acids Res. 2024; 10.1093/nar/gkae1058. - DOI - PMC - PubMed
    1. Yuan D., Ahamed A., Burgin J., Cummins C., Devraj R., Gueye K., Gupta D., Gupta V., Haseeb M., Ihsan M.et al. .. The European Nucleotide Archive in 2023. Nucleic Acids Res. 2024; 52:D92–D97. - PMC - PubMed
    1. Ara T., Kodama Y., Tokimatsu T., Fukuda A., Kosuge T., Mashima J., Tanizawa Y., Tanjo T., Ogasawara O., Fujisawa T.et al. .. DDBJ update in 2023: the MetaboBank for metabolomics data and associated metadata. Nucleic Acids Res. 2024; 52:D67–D71. - PMC - PubMed
    1. Wilkinson M.D., Dumontier M., Aalbersberg I.J., Appleton G., Axton M., Baak A., Blomberg N., Boiten J.W., da Silva Santos L.B., Bourne P.E.et al. .. The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data. 2016; 3:160018. - PMC - PubMed

LinkOut - more resources