Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 May 12;3(3):100117.
doi: 10.1016/j.xhgg.2022.100117. eCollection 2022 Jul 14.

Accounting for population structure in genetic studies of cystic fibrosis

Affiliations

Accounting for population structure in genetic studies of cystic fibrosis

Hanley Kingston et al. HGG Adv. .

Abstract

CFTR F508del (c.1521_1523delCTT, p.Phe508delPhe) is the most common pathogenic allele underlying cystic fibrosis (CF), and its frequency varies in a geographic cline across Europe. We hypothesized that genetic variation associated with this cline is overrepresented in a large cohort (N > 5,000) of persons with CF who underwent whole-genome sequencing and that this pattern could result in spurious associations between variants correlated with both the F508del genotype and CF-related outcomes. Using principal-component (PC) analyses, we showed that variation in the CFTR region disproportionately contributes to a PC explaining a relatively high proportion of genetic variance. Variation near CFTR was correlated with population structure among persons with CF, and this correlation was driven by a subset of the sample inferred to have European ancestry. We performed genome-wide association studies comparing persons with CF with one versus two copies of the F508del allele; this allowed us to identify genetic variation associated with the F508del allele and to determine that standard PC-adjustment strategies eliminated the significant association signals. Our results suggest that PC adjustment can adequately prevent spurious associations between genetic variants and CF-related traits and are therefore effective tools to control for population structure even when population structure is confounded with disease severity and a common pathogenic variant.

Keywords: CFTR F508del; genome-wide association study; population structure.

PubMed Disclaimer

Conflict of interest statement

M.B. is the editor-in-chief and J.X.C. (member of the Cystic Fibrosis Genome Project) is the deputy editor of HGG Advances. The authors declare no other competing interests.

Figures

Figure 1
Figure 1
Population structure within the entire CFGP (n = 4,939) Pairwise principal-component (PC) plots are shown for PCs 1–4 with frequency distributions and percentage of variance explained by each PC on the diagonal. Ancestry estimates indicate the ancestry with the highest estimated proportion using Somalier. Abbreviations: AFR, sub-Saharan African; AMR, Native American; EAS, East Asian; EUR, European; SAS, South Asian.
Figure 2
Figure 2
Correlation between PCs and genomic position (A and B) The correlation between PCs (Y axis) and genomic position (X axis) are shown for the (A) CFGP (n = 4,939) and (B) CFGP participants with estimated European ancestry >80% (n = 4,567). The number of PCs shown is the number used to calculate the genetic relatedness matrix and, for the total CFGP dataset, used in the PC-adjusted GWAS analysis. Color-coded regions include 7q21.31 (CFTR, pink) and three regions that have previously shown evidence of long-range LD: 2q21.1-2q22.1 (LCT, teal), 6p22.3-6p21.2 (the major histocompatibility complex, orange), and the 8p23 inversion polymorphism (purple).
Figure 3
Figure 3
GWASs for CFTR F508del heterozygosity versus homozygosity (Top) The baseline model adjusted for site and relatedness. (Bottom) The PC-adjusted model. Association signals are measured as -log10(p values). Plot is truncated at p = 1 × 10-10, as the peak at CFTR on chr7 reaches p < 1 × 10-300 under both models. The genome-wide significance level, p < 5 × 10-8, is indicated by the horizontal line.

References

    1. Bell S.C., Mall M.A., Gutierrez H., Macek M., Madge S., Davies J.C., Burgel P.R., Tullis E., Castanos C., Castellani C., et al. The future of cystic fibrosis care: a global perspective. Lancet Respir. Med. 2020;8:65–124. doi: 10.1016/S2213-2600(19)30337-6. - DOI - PMC - PubMed
    1. Lopes-Pacheco M. CFTR modulators: the changing face of cystic fibrosis in the era of precision medicine. Front. Pharmacol. 2020;10:1662. doi: 10.3389/fphar.2019.01662. - DOI - PMC - PubMed
    1. Drumm M.L., Konstan M.W., Schluchter M.D., Handler A., Pace R., Zou F., Zariwala M., Fargo D., Xu A., Dunn J.M., et al. Genetic modifiers of lung disease in cystic fibrosis. N. Engl. J. Med. 2005;353:1443–1453. doi: 10.1056/NEJMoa051469. - DOI - PubMed
    1. Karczewski K.J., Francioli L.C., MacArthur D.G., Cummings B.B., Alföldi J., Wang Q., Collins R.L., Laricchia K.M., Ganna A., Birnbaum D.P., et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature. 2020;581:434–443. doi: 10.1530/ey.17.14.3. - DOI - PMC - PubMed
    1. Mateu E., Calafell F., Ramos M.D., Casals T., Bertranpetit J. Can a place of origin of the main cystic fibrosis mutations be identified? Am. J. Hum. Genet. 2002;70:257–264. doi: 10.1086/338243. - DOI - PMC - PubMed