A standardized framework for representation of ancestry data in genomics studies, with application to the NHGRI-EBI GWAS Catalog
- PMID: 29448949
- PMCID: PMC5815218
- DOI: 10.1186/s13059-018-1396-2
A standardized framework for representation of ancestry data in genomics studies, with application to the NHGRI-EBI GWAS Catalog
Abstract
The accurate description of ancestry is essential to interpret, access, and integrate human genomics data, and to ensure that these benefit individuals from all ancestral backgrounds. However, there are no established guidelines for the representation of ancestry information. Here we describe a framework for the accurate and standardized description of sample ancestry, and validate it by application to the NHGRI-EBI GWAS Catalog. We confirm known biases and gaps in diversity, and find that African and Hispanic or Latin American ancestry populations contribute a disproportionately high number of associations. It is our hope that widespread adoption of this framework will lead to improved analysis, interpretation, and integration of human genomics data.
Keywords: Ancestry; Diversity; GWAS Catalog; Genome-wide association studies; Genomics; Population genetics.
Conflict of interest statement
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
PF is a member of the Scientific Advisory Board of Omicia, Inc.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figures
References
-
- GWAS Catalog. http://www.ebi.ac.uk/gwas/. Accessed 4 Aug 2017.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
