Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2023 Oct 2:2023.09.30.560263.
doi: 10.1101/2023.09.30.560263.

Admix-kit: An Integrated Toolkit and Pipeline for Genetic Analyses of Admixed Populations

Affiliations

Admix-kit: An Integrated Toolkit and Pipeline for Genetic Analyses of Admixed Populations

Kangcheng Hou et al. bioRxiv. .

Update in

  • Admix-kit: an integrated toolkit and pipeline for genetic analyses of admixed populations.
    Hou K, Gogarten S, Kim J, Hua X, Dias JA, Sun Q, Wang Y, Tan T; Polygenic Risk Methods in Diverse Populations (PRIMED) Consortium Methods Working Group; Atkinson EG, Martin A, Shortt J, Hirbo J, Li Y, Pasaniuc B, Zhang H. Hou K, et al. Bioinformatics. 2024 Mar 29;40(4):btae148. doi: 10.1093/bioinformatics/btae148. Bioinformatics. 2024. PMID: 38490256 Free PMC article.

Abstract

Admixed populations, with their unique and diverse genetic backgrounds, are often underrepresented in genetic studies. This oversight not only limits our understanding but also exacerbates existing health disparities. One major barrier has been the lack of efficient tools tailored for the special challenges of genetic study of admixed populations. Here, we present admix-kit, an integrated toolkit and pipeline for genetic analyses of admixed populations. Admix-kit implements a suite of methods to facilitate genotype and phenotype simulation, association testing, genetic architecture inference, and polygenic scoring in admixed populations.

PubMed Disclaimer

Conflict of interest statement

Competing interests The authors declare no competing interests.

Figures

Figure 1:
Figure 1:. Overview of admix-kit’s data structure, functionality, and illustrative analyses using a simulated dataset
(a) Local ancestry and phased genotypes are stored in matrix format. Individual-specific and SNP-specific covariates are stored as two tables with matching orders. (b) Analysis based on ancestry-specific genotype dosage. Starting with a phased genotype for an individual (0/1 denotes presence of minor allele, blue/red color denotes European/African local ancestry), genotypes are separated into ancestry-specific dosages. Local ancestry-informed downstream analyses can be subsequently performed. (c) visualization of local ancestry tracts. (d) Consistency of genome-wide genetic ancestry of simulated dataset (e) Consistency of allele frequencies from the simulated admixed genotypes.

References

    1. Atkinson E.G. et al. (2021) Tractor uses local ancestry to enable the inclusion of admixed individuals in GWAS and to boost power. Nat. Genet., 53, 195–204. - PMC - PubMed
    1. Ding Y. et al. (2023) Polygenic scoring accuracy varies across the genetic ancestry continuum. Nature, 618, 774–781. - PMC - PubMed
    1. Hou K. et al. (2023) Causal effects on complex traits are similar for common variants across segments of different continental ancestries within admixed individuals. Nat. Genet., 55, 549–558. - PMC - PubMed
    1. Hou K. et al. (2021) On powerful GWAS in admixed populations. Nat. Genet., 53, 1631–1633. - PMC - PubMed
    1. Kachuri L. et al. (2023) Principles and methods for transferring polygenic risk scores across global populations. Nat. Rev. Genet. - PMC - PubMed

Publication types