Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007 Oct;39(10):1181-6.
doi: 10.1038/ng1007-1181.

The NCBI dbGaP database of genotypes and phenotypes

Affiliations

The NCBI dbGaP database of genotypes and phenotypes

Matthew D Mailman et al. Nat Genet. 2007 Oct.

Abstract

The National Center for Biotechnology Information has created the dbGaP public repository for individual-level phenotype, exposure, genotype and sequence data and the associations between them. dbGaP assigns stable, unique identifiers to studies and subsets of information from those studies, including documents, individual phenotypic variables, tables of trait data, sets of genotype data, computed phenotype-genotype associations, and groups of study subjects who have given similar consents for use of their data.

PubMed Disclaimer

Figures

Figure 1
Figure 1. Accession numbers in dbGaP
are created separately for a study and its subordinate objects: phenotype variables, phenotype trait tables, documents, and genotype datasets, prefixed a phs, phv, pht, phd, and phg, respectively. Accession numbers have suffixes “v” for data version, “p” for participant set version, and “c” for consent group version. Phenotype and genotype data are distributed as both public summary records and individual-level data that require authorization to use. Associations between genotypes and a phenotype trait of interest are not subordinate objects of a study, as an analysis may include data components from several studies simultaneously.

References

    1. The GAIN Collaborative Research Group. New Models of Collaboration in Genome-Wide Association Studies: The Genetic Association Information Network (GAIN) Nat Genet. 2007;39:1045–1051. - PubMed
    1. Lowrance WW, Collins FS. ETHICS: Identifiability in Genomic Research. Genomic data are unique to the individual and must be managed with care to maintain public trust. Science. 2007;317:600–602. - PubMed
    1. Harold ER. XML Bible. 2. Hungry Minds; New York, Indianapolis, Cleveland: 2001.
    1. Bray Tim, Paoli Jean, Sperberg-McQueen CM, Maler Eve, Yergeau Francois., editors. World Wide Web Consortium (W3C) Extensible Markup Language (XML) 1.0. Fourth. 2006. http://www.w3.org/TR/REC-xml/
    1. Clayton DG, Walker NM, Smyth DJ, Pask R, Cooper JD, Maier LM, Smink LJ, Lam AC, Ovington NR, Stevens HE, Nutland S, Howson JM, Faham M, Moorhead M, Jones HB, Falkowski M, Hardenbol P, Willis TD, Todd JA. Population structure, differential bias and genomic control in a large-scale, case-control association study. Nat Genet. 2005;37:1243–1246. - PubMed

Publication types