Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2001 Jan 1;29(1):137-40.
doi: 10.1093/nar/29.1.137.

RefSeq and LocusLink: NCBI gene-centered resources

Affiliations

RefSeq and LocusLink: NCBI gene-centered resources

K D Pruitt et al. Nucleic Acids Res. .

Abstract

Thousands of genes have been painstakingly identified and characterized a few genes at a time. Many thousands more are being predicted by large scale cDNA and genomic sequencing projects, with levels of evidence ranging from supporting mRNA sequence and comparative genomics to computing ab initio models. This, coupled with the burgeoning scientific literature, makes it critical to have a comprehensive directory for genes and reference sequences for key genomes. The NCBI provides two resources, LocusLink and RefSeq, to meet these needs. LocusLink organizes information around genes to generate a central hub for accessing gene-specific information for fruit fly, human, mouse, rat and zebrafish. RefSeq provides reference sequence standards for genomes, transcripts and proteins; human, mouse and rat mRNA RefSeqs, and their corresponding proteins, are discussed here. Together, RefSeq and LocusLink provide a non-redundant view of genes and other loci to support research on genes and gene families, variation, gene expression and genome annotation. Additional information about LocusLink and RefSeq is available at http://www.ncbi.nlm.nih.gov/LocusLink/.

PubMed Disclaimer

Figures

Figure 1
Figure 1
RefSeq and LocusLink growth over a 1-year period. (A) The number of RefSeq records for human, mouse and rat for mid-September of 1999 and 2000. These numbers include genes for which multiple reference sequences are provided to represent splice variants and their products. As of September 2000, 10 301 human reference sequence mRNAs (and their corresponding proteins) are provided for 9954 genes. The number of human RefSeq records in three categories is also indicated (Hs, human); Genome Annotation reference sequence numbers are not included. Rat, mouse and predicted human RefSeq records became available after September 1999. (B) Loci available in LocusLink for September of 1999 and 2000. Mouse, rat, zebrafish and fruit fly genomes were added after September 1999. Of the 17 214 known human loci, 14 103 have some associated sequence data (including ESTs), and 9954 loci have at least one mRNA and protein reference sequence.
Figure 1
Figure 1
RefSeq and LocusLink growth over a 1-year period. (A) The number of RefSeq records for human, mouse and rat for mid-September of 1999 and 2000. These numbers include genes for which multiple reference sequences are provided to represent splice variants and their products. As of September 2000, 10 301 human reference sequence mRNAs (and their corresponding proteins) are provided for 9954 genes. The number of human RefSeq records in three categories is also indicated (Hs, human); Genome Annotation reference sequence numbers are not included. Rat, mouse and predicted human RefSeq records became available after September 1999. (B) Loci available in LocusLink for September of 1999 and 2000. Mouse, rat, zebrafish and fruit fly genomes were added after September 1999. Of the 17 214 known human loci, 14 103 have some associated sequence data (including ESTs), and 9954 loci have at least one mRNA and protein reference sequence.

References

    1. White J.A., McAlpine,P.J., Antonarakis,S., Cann,H., Eppig,J.T., Frazer,K., Frezal,J., Lancet,D., Nahmias,J., Pearson,P., Peters,J., Scott,A., Scott,H., Spurr,N., Talbot,C.,Jr. and Povey,S. (1997) Guidelines for human gene nomenclature (1997). HUGO Nomenclature Committee. Genomics, 45, 468–471. - PubMed
    1. Hamosh A., Scott,A.F., Amberger,J., Valle,D. and McKusick,V.A. (2000) Online Mendelian Inheritance in Man (OMIM). Hum. Mutat., 15, 57–61. - PubMed
    1. Blake J.A., Eppig,J.T., Richardson,J.E. and Davisson,M.T. (2000) The Mouse Genome Database (MGD): expanding genetic and genomic resources for the laboratory mouse. The Mouse Genome Database Group. Nucleic Acids Res., 28, 108–111. Updated article in this issue: Nucleic Acids Res. (2001), 29, 91–94. - PMC - PubMed
    1. FlyBase Consortium (1999) The FlyBase database of the Drosophila Genome Projects and community literature. The FlyBase Consortium. Nucleic Acids Res., 27, 85–88. - PMC - PubMed
    1. Westerfield M., Doerry,E., Kirkpatric,A.E. and Douglas,S.A. (1999) Zebrafish informatics and the ZFIN database. Methods Cell Biol., 60, 339–355. - PubMed