Public human microbiome data are dominated by highly developed countries
- PMID: 35167588
- PMCID: PMC8846514
- DOI: 10.1371/journal.pbio.3001536
Public human microbiome data are dominated by highly developed countries
Abstract
The importance of sampling from globally representative populations has been well established in human genomics. In human microbiome research, however, we lack a full understanding of the global distribution of sampling in research studies. This information is crucial to better understand global patterns of microbiome-associated diseases and to extend the health benefits of this research to all populations. Here, we analyze the country of origin of all 444,829 human microbiome samples that are available from the world's 3 largest genomic data repositories, including the Sequence Read Archive (SRA). The samples are from 2,592 studies of 19 body sites, including 220,017 samples of the gut microbiome. We show that more than 71% of samples with a known origin come from Europe, the United States, and Canada, including 46.8% from the US alone, despite the country representing only 4.3% of the global population. We also find that central and southern Asia is the most underrepresented region: Countries such as India, Pakistan, and Bangladesh account for more than a quarter of the world population but make up only 1.8% of human microbiome samples. These results demonstrate a critical need to ensure more global representation of participants in microbiome studies.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures

Similar articles
-
Extensive Unexplored Human Microbiome Diversity Revealed by Over 150,000 Genomes from Metagenomes Spanning Age, Geography, and Lifestyle.Cell. 2019 Jan 24;176(3):649-662.e20. doi: 10.1016/j.cell.2019.01.001. Epub 2019 Jan 17. Cell. 2019. PMID: 30661755 Free PMC article.
-
Human reference gut microbiome catalog including newly assembled genomes from under-represented Asian metagenomes.Genome Med. 2021 Aug 27;13(1):134. doi: 10.1186/s13073-021-00950-7. Genome Med. 2021. PMID: 34446072 Free PMC article.
-
Predicting drug-metagenome interactions: Variation in the microbial β-glucuronidase level in the human gut metagenomes.PLoS One. 2021 Jan 7;16(1):e0244876. doi: 10.1371/journal.pone.0244876. eCollection 2021. PLoS One. 2021. PMID: 33411719 Free PMC article.
-
Genome-resolved metagenomics: a game changer for microbiome medicine.Exp Mol Med. 2024 Jul;56(7):1501-1512. doi: 10.1038/s12276-024-01262-7. Epub 2024 Jul 1. Exp Mol Med. 2024. PMID: 38945961 Free PMC article. Review.
-
Metagenomics: a path to understanding the gut microbiome.Mamm Genome. 2021 Aug;32(4):282-296. doi: 10.1007/s00335-021-09889-x. Epub 2021 Jul 14. Mamm Genome. 2021. PMID: 34259891 Free PMC article. Review.
Cited by
-
Pangenome comparison of Bacteroides fragilis genomospecies unveils genetic diversity and ecological insights.mSystems. 2024 Jul 23;9(7):e0051624. doi: 10.1128/msystems.00516-24. Epub 2024 Jun 27. mSystems. 2024. PMID: 38934546 Free PMC article.
-
The gut microbiome and hypertension.Nat Rev Nephrol. 2023 Mar;19(3):153-167. doi: 10.1038/s41581-022-00654-0. Epub 2023 Jan 11. Nat Rev Nephrol. 2023. PMID: 36631562 Review.
-
Maternal Psychosocial Stress Is Associated with Reduced Diversity in the Early Infant Gut Microbiome.Microorganisms. 2023 Apr 8;11(4):975. doi: 10.3390/microorganisms11040975. Microorganisms. 2023. PMID: 37110398 Free PMC article.
-
Microbial community-scale metabolic modelling predicts personalized short-chain fatty acid production profiles in the human gut.Nat Microbiol. 2024 Jul;9(7):1700-1712. doi: 10.1038/s41564-024-01728-4. Epub 2024 Jun 24. Nat Microbiol. 2024. PMID: 38914826 Free PMC article.
-
The African Human Microbiome Portal: a public web portal of curated metagenomic metadata.Database (Oxford). 2024 Jan 10;2024:baad092. doi: 10.1093/database/baad092. Database (Oxford). 2024. PMID: 38204360 Free PMC article.
References
-
- Kaplan RC, Wang Z, Usyk M, Sotres-Alvarez D, Daviglus ML, Schneiderman N, et al.. Gut microbiome composition in the Hispanic Community Health Study/Study of Latinos is shaped by geographic relocation, environmental factors, and obesity. Genome Biol. 2019;20:219. doi: 10.1186/s13059-019-1831-z - DOI - PMC - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources