Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Dec 29:9:e56029.
doi: 10.7554/eLife.56029.

Recent shifts in the genomic ancestry of Mexican Americans may alter the genetic architecture of biomedical traits

Affiliations

Recent shifts in the genomic ancestry of Mexican Americans may alter the genetic architecture of biomedical traits

Melissa L Spear et al. Elife. .

Abstract

People in the Americas represent a diverse continuum of populations with varying degrees of admixture among African, European, and Amerindigenous ancestries. In the United States, populations with non-European ancestry remain understudied, and thus little is known about the genetic architecture of phenotypic variation in these populations. Using genotype data from the Hispanic Community Health Study/Study of Latinos, we find that Amerindigenous ancestry increased by an average of ~20% spanning 1940s-1990s in Mexican Americans. These patterns result from complex interactions between several population and cultural factors which shaped patterns of genetic variation and influenced the genetic architecture of complex traits in Mexican Americans. We show for height how polygenic risk scores based on summary statistics from a European-based genome-wide association study perform poorly in Mexican Americans. Our findings reveal temporal changes in population structure within Hispanics/Latinos that may influence biomedical traits, demonstrating a need to improve our understanding of admixed populations.

Keywords: admixture; complex traits; evolutionary biology; genetic architecture; human; population genetics.

PubMed Disclaimer

Conflict of interest statement

MS, AD, EZ, JY, SG, DT, RH No competing interests declared

Figures

Figure 1.
Figure 1.. Genomic ancestry and population structure in HCHS/SOL.
(A) Ternary plot of HCHS/SOL (n = 10,268) colored by admixture proportions. (B) Ternary plot of global ancestry proportions colored by population for 10,268 HCHS/SOL individuals (C) Uniform Manifold Approximation and Projection (UMAP) plot depicting the genetic diversity of HCHS/SOL and the reference panel (n = 10,591) using three principal components, colored by admixture proportions Within the legend, AFR, EUR, and AI refer to African, European, and Amerindigenous global ancestries, respectively. (D) UMAP plot of HCHS/SOL and the reference panel (n = 10,591) using three principal components, colored by HCHS/SOL population.
Figure 1—figure supplement 1.
Figure 1—figure supplement 1.. Ancestral diversity of HCHS/SOL populations.
(A) Scree plot of principal component analysis of HCHS/SOL and the reference panel (n = 10,591). (B) Uniform Manifold Approximation and Projection (UMAP) plot of HCHS/SOL and the reference panel (n = 10,591) using three principal components, colored by reference population. (C) Uniform Manifold Approximation and Projection (UMAP) plot of HCHS/SOL only (n = 10,591) using three principal components, colored by population. (D) Uniform Manifold Approximation and Projection (UMAP) plot of HCHS/SOL and the larger reference panel (n = 11,567) using three principal components, colored by HCHS/SOL population (E) Uniform Manifold Approximation and Projection (UMAP) plot of HCHS/SOL and the larger reference panel (n = 11,567) using three principal components, colored by African population (F) Uniform Manifold Approximation and Projection (UMAP) plot of HCHS/SOL and the larger reference panel (n = 11,567) using three principal components, colored by Amerindigenous population (G) Uniform Manifold Approximation and Projection (UMAP) plot of HCHS/SOL and the larger reference panel (n = 11,567) using three principal components, colored by European population.
Figure 2.
Figure 2.. Amerindigenous ancestry has increased over time in Mexican Americans.
(A) Global Amerindigenous ancestry proportions plotted by birth year for Mexican Americans (n = 3,622). Fitted line is multiple regression of Amerindigenous ~ birth year + sampling weight. Bars represent 95% confidence intervals for individuals grouped by decade. (B) Bootstrap resampling (n = 1000 iterations) of Amerindigenous global ancestry for the Mexican American individuals with a fitted LOESS curve for each iteration. Dashed lines represent the 95% quantile range of LOESS curves and the blue line represents the fitted regression line from A.
Figure 2—figure supplement 1.
Figure 2—figure supplement 1.. Concordance of ADMIXTURE and RFMix global ancestry estimates.
(A) Amerindigenous ancestry (B) African ancestry and (C) European ancestry.
Figure 2—figure supplement 2.
Figure 2—figure supplement 2.. Amerindigenous ancestry has increased over time in Mexican Americans.
RFMix inferred Amerindigenous (AI) global ancestry proportions plotted by birth year for Mexican Americans (n = 3,622). Fitted line is multiple regression of AI global ancestry ~birth year + sampling weight (β=0.0022; SE = 0.0002, p<2E-16). Bars represent 95% confidence intervals for individuals grouped by decade.
Figure 2—figure supplement 3.
Figure 2—figure supplement 3.. Distributions of Amerindigenous global ancestry means for HCHS/SOL Mexican Americans (n = 3622) generated by 1000 bootstrap resampling iterations within each decade of binned birth years.
Figure 2—figure supplement 4.
Figure 2—figure supplement 4.. Replication in the Health and Retirement Study for 705 self-identified Mexican Americans.
(A) Ancestry over time (B) Distribution of regression slopes after 1000 bootstrap resampling iterations (C) Distribution of bootstrap regression p-values (D) ECDF of bootstrap regression p-values.
Figure 2—figure supplement 5.
Figure 2—figure supplement 5.. The increase in estimated AI ancestry over time is conditional on the number of US-born parents.
AI ancestry vs birth year with interaction between birth year and number of US-born parents for 634 HCHS/SOL Mexicans.
Figure 3.
Figure 3.. Architecture of genetic diversity in Mexican American Genomes.
(A) Genetic diversity (π) in Amerindigenous ancestry tracts stratified by US-born/not US-born status, and calculated between pairs of individuals born within each decade (with shaded envelopes showing 95% confidence intervals for each group). (B) Proportion of total Amerindigenous (AI) ancestral tracts in the HCHS/SOL Mexican American population by decade. (C) Variation in ROH by birth year. Solid lines show LOESS of the proportion of the genome with AI ancestry that overlap ROH of different lengths, while dotted lines show LOESS of the proportion of the genome with European ancestry that overlap ROH of different lengths. (D) Scatter plot of parents’ inferred global Amerindigenous (AI) ancestries using ANCESTOR.
Figure 3—figure supplement 1.
Figure 3—figure supplement 1.. FST within Amerindigenous ancestral tracts.
FST estimates calculated between each decade group. Bars represent the 95% CI.
Figure 3—figure supplement 2.
Figure 3—figure supplement 2.. Admixture mapping in HCHS/SOL Mexicans (n = 3622) for Amerindigenous ancestry and (A) birth year and (B) generation.
Ancestry association testing was performed at 211,151 markers using (A) linear regression and (B) logistic regression, both including global Amerindigenous ancestry, sampling weight and center as covariates.
Figure 3—figure supplement 3.
Figure 3—figure supplement 3.. Runs of homozygosity (ROH) in HCHS/SOL Mexican Americans.
(A) ROH (summed per person) across all ancestries separated by ROH class (B) ROH (summed per person) overlapping Amerindigenous (AI) haplotypes separated by ROH class. (C) Proportion of the genome covered by total AI ROH separated by ROH class.
Figure 3—figure supplement 4.
Figure 3—figure supplement 4.. Ancestry-related assortative mating in HCHS/SOL Mexican Americans.
Each distribution represents the difference in inferred parental Amerindigenous (AI) ancestry for each decade for (A) All (B) US-born and (c) Non-US-born. Within each segment is the correlation of parents inferred AI ancestry. Parental ancestry was inferred using ANCESTOR.
Figure 3—figure supplement 5.
Figure 3—figure supplement 5.. Standard neutral model simulations result in no change in ancestry proportions over time.
Blue lines show forward simulations while gray lines reproduce the LOESS curves from the observed data shown in Figure 2B.
Figure 3—figure supplement 6.
Figure 3—figure supplement 6.. Population growth does not affect the mean ancestry proportions in a population.
(A) The exponential growth rates evaluated. (B–E) The effect of increasing growth rates G={0, 0.1, 0.5, 1} on ancestry proportions.
Figure 3—figure supplement 7.
Figure 3—figure supplement 7.. Ancestry-based assortative mating does not change mean ancestry proportions, though variance in ancestry proportions can increase.
(A–B) Low effect of assortative mating, (C–D) moderate effect (with similar correlation to that seen in HCHS Mexican Americans, see main text), and (E–F) extreme assortative mating.
Figure 3—figure supplement 8.
Figure 3—figure supplement 8.. Ancestry-based fecundity differences can induce systematic changes in ancestry proportions in a population.
(A) We model the probability of reproducing (‘Prob Reprod’) using a Beta distribution over the ranked ancestry proportions in the population using parameter FAI. (B–E) As ancestry-based fecundity increases, the mean ancestry proportion in the population increases. (F–I) Ancestry-based assortative mating magnifies the effects of ancestry-based fecundity differences (here AM = 0.75, see Figure 3—figure supplement 7).
Figure 3—figure supplement 9.
Figure 3—figure supplement 9.. The ancestry proportions in the migrant population are modeled as a Beta distribution, with mean given by a weighted average between the domestic population at one with weight mAI.
When mAI=0, migrants have the same distribution of ancestry proportions as the domestic population. When mAI=1, all migrants have 100% Amerindigenous ancestry.
Figure 3—figure supplement 10.
Figure 3—figure supplement 10.. Simulating the effects of migration on changing ancestry proportions.
We show how the ancestry proportions in the domestic population change as we increase M (the probability that a new individual is migrant) and mAI (the parameter that governs the ancestry proportions in the migrant population, see Figure 3—figure supplement 9).
Figure 3—figure supplement 11.
Figure 3—figure supplement 11.. Similar to Figure 3—figure supplement 10, but adding assortative mating (AM=0.75, consistent with our data) and ancestry-based fecundity differences (FAI=0.1, see Figure 3—figure supplement 8A).
Figure 3—figure supplement 12.
Figure 3—figure supplement 12.. Similar to Figure 3—figure supplement 10, but adding assortative mating (AM=0.75, consistent with our data) and ancestry-based fecundity differences (FAI=0.2, see Figure 3—figure supplement 8A).
Figure 3—figure supplement 13.
Figure 3—figure supplement 13.. Similar to Figure 3—figure supplement 10, but adding assortative mating (AM=0.75, consistent with our data) and ancestry-based fecundity differences (FAI=0.4, see Figure 3—figure supplement 8A).
Figure 3—figure supplement 14.
Figure 3—figure supplement 14.. Similar to Figure 3—figure supplement 10, but adding assortative mating (AM=0.75, consistent with our data) and ancestry-based fecundity differences (FAI=0.8, see Figure 3—figure supplement 8A).
Figure 4.
Figure 4.. Global Amerindigenous ancestry and biomedical traits in HCHS/SOL Mexican Americans.
(A) The effect size of global AI ancestry on each of 69 quantile normalized traits (see Materials and methods) while controlling for birth year, center, gender, sampling weight, educational attainment, US-born status, and number of US-born parents. (B–C) The relationship between (B) Birth year and height and (C) Height and polygenic height score (PHS). The black line indicates the fitted linear model for all individuals. Each color represents a different quartile of Amerindigenous global ancestry. Polygenic height scores were assessed utilizing UKBB summary statistics for 1,078 SNPs.
Figure 4—figure supplement 1.
Figure 4—figure supplement 1.. Distribution of variable effects associated with quantile normalized traits.
For 69 biomedical traits we used a multiple linear regression model to analyze the effects of global AI ancestry on each trait while controlling for birth year, center, gender, sampling weight, educational attainment, US-born status, and number of US-born parents. Variables significantly associated with the traits (Bonferroni correction p<6.6E-5) are highlighted in red.
Figure 4—figure supplement 2.
Figure 4—figure supplement 2.. Comparison of allele frequencies used in polygenic height score calculations.
Plotted are the allele frequencies of the non-reference allele in UKBB vs. 1000 Genomes Americas (AMR) population for the 1078 SNPs used to calculate the polygenic height score for the HCHS/SOL Mexican Americans. Colors indicated whether the non-reference allele has a positive or negative effect.
Figure 4—figure supplement 3.
Figure 4—figure supplement 3.. Polygenic height scores over time.
The relationship between birth year and polygenic height score; the black line indicates the fitted linear model for all individuals. Each color represents a different quartile of Amerindigenous global ancestry. Polygenic height scores were assessed utilizing UKBB summary statistics for 1,078 SNPs.
Figure 4—figure supplement 4.
Figure 4—figure supplement 4.. Correlation of 69 p-values for Amerindigenous effect sizes of untransformed vs quantile normalized traits.
Author response image 1.
Author response image 1.

References

    1. Abecasis GR, Auton A, Brooks LD, DePristo MA, Durbin RM, Handsaker RE, Kang HM, Marth GT, McVean GA, 1000 Genomes Project Consortium An integrated map of genetic variation from 1,092 human genomes. Nature. 2012;491:56–65. doi: 10.1038/nature11632. - DOI - PMC - PubMed
    1. Agarwala V, Flannick J, Sunyaev S, Altshuler D, GoT2D Consortium Evaluating empirical bounds on complex disease genetic architecture. Nature Genetics. 2013;45:1418–1427. doi: 10.1038/ng.2804. - DOI - PMC - PubMed
    1. Alexander DH, Novembre J, Lange K. Fast model-based estimation of ancestry in unrelated individuals. Genome Research. 2009;19:1655–1664. doi: 10.1101/gr.094052.109. - DOI - PMC - PubMed
    1. Altshuler DM, Gibbs RA, Peltonen L, Altshuler DM, Gibbs RA, Peltonen L, Dermitzakis E, Schaffner SF, Yu F, Peltonen L, Dermitzakis E, Bonnen PE, Altshuler DM, Gibbs RA, de Bakker PI, Deloukas P, Gabriel SB, Gwilliam R, Hunt S, Inouye M, Jia X, Palotie A, Parkin M, Whittaker P, Yu F, Chang K, Hawes A, Lewis LR, Ren Y, Wheeler D, Gibbs RA, Muzny DM, Barnes C, Darvishi K, Hurles M, Korn JM, Kristiansson K, Lee C, McCarrol SA, Nemesh J, Dermitzakis E, Keinan A, Montgomery SB, Pollack S, Price AL, Soranzo N, Bonnen PE, Gibbs RA, Gonzaga-Jauregui C, Keinan A, Price AL, Yu F, Anttila V, Brodeur W, Daly MJ, Leslie S, McVean G, Moutsianas L, Nguyen H, Schaffner SF, Zhang Q, Ghori MJ, McGinnis R, McLaren W, Pollack S, Price AL, Schaffner SF, Takeuchi F, Grossman SR, Shlyakhter I, Hostetter EB, Sabeti PC, Adebamowo CA, Foster MW, Gordon DR, Licinio J, Manca MC, Marshall PA, Matsuda I, Ngare D, Wang VO, Reddy D, Rotimi CN, Royal CD, Sharp RR, Zeng C, Brooks LD, McEwen JE, International HapMap 3 Consortium Integrating common and rare genetic variation in diverse human populations. Nature. 2010;467:52–58. doi: 10.1038/nature09298. - DOI - PMC - PubMed
    1. Auton A, Brooks LD, Durbin RM, Garrison EP, Kang HM, Korbel JO, Marchini JL, McCarthy S, McVean GA, Abecasis GR, 1000 Genomes Project Consortium A global reference for human genetic variation. Nature. 2015;526:68–74. doi: 10.1038/nature15393. - DOI - PMC - PubMed

Publication types