Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Mar 29;40(4):btae148.
doi: 10.1093/bioinformatics/btae148.

Admix-kit: an integrated toolkit and pipeline for genetic analyses of admixed populations

Collaborators, Affiliations

Admix-kit: an integrated toolkit and pipeline for genetic analyses of admixed populations

Kangcheng Hou et al. Bioinformatics. .

Abstract

Summary: Admixed populations, with their unique and diverse genetic backgrounds, are often underrepresented in genetic studies. This oversight not only limits our understanding but also exacerbates existing health disparities. One major barrier has been the lack of efficient tools tailored for the special challenges of genetic studies of admixed populations. Here, we present admix-kit, an integrated toolkit and pipeline for genetic analyses of admixed populations. Admix-kit implements a suite of methods to facilitate genotype and phenotype simulation, association testing, genetic architecture inference, and polygenic scoring in admixed populations.

Availability and implementation: Admix-kit package is open-source and available at https://github.com/KangchengHou/admix-kit. Additionally, users can use the pipeline designed for admixed genotype simulation available at https://github.com/UW-GAC/admix-kit_workflow.

PubMed Disclaimer

Conflict of interest statement

None declared.

Figures

Figure 1.
Figure 1.
Overview of admix-kit’s data structure, functionality, and illustrative analyses using a simulated dataset. (a) Local ancestry and phased genotypes are stored in matrix format. Individual-specific and SNP-specific covariates are stored as two tables with matching orders. (b) Analysis based on ancestry-specific genotype dosage. Starting with a phased genotype for an individual (0/1 denotes presence of minor allele), genotypes are separated into ancestry-specific dosages. Local ancestry-informed downstream analyses can be subsequently performed. (c) visualization of local ancestry tracts. (d) Consistency of genome-wide genetic ancestry of simulated dataset using YRI and CEU in 1000 Genomes as reference populations. (e) Consistency of allele frequencies from the simulated admixed genotypes.

Update of

References

    1. Atkinson EG, Maihofer AX, Kanai M et al. Tractor uses local ancestry to enable the inclusion of admixed individuals in GWAS and to boost power. Nat Genet 2021;53:195–204. - PMC - PubMed
    1. Auton A, Brooks LD, Durbin RM et al.; 1000 Genomes Project Consortium. A global reference for human genetic variation. Nature 2015;526:68–74. - PMC - PubMed
    1. Bitarello BD, Mathieson I. Polygenic scores for height in admixed populations. G3 (Bethesda) 2020;10:4027–36. - PMC - PubMed
    1. Cai M, Xiao J, Zhang S et al. A unified framework for cross-population trait prediction by leveraging the genetic correlation of polygenic traits. Am J Hum Genet 2021;108:632–55. - PMC - PubMed
    1. Ding Y, Hou K, Xu Z et al. Polygenic scoring accuracy varies across the genetic ancestry continuum. Nature 2023;618:774–81. - PMC - PubMed