Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Apr 17;11(4):e1004219.
doi: 10.1371/journal.pcbi.1004219. eCollection 2015 Apr.

MAGMA: generalized gene-set analysis of GWAS data

Affiliations

MAGMA: generalized gene-set analysis of GWAS data

Christiaan A de Leeuw et al. PLoS Comput Biol. .

Abstract

By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical power for most methods is strongly affected by linkage disequilibrium between markers, multi-marker associations are often hard to detect, and the reliance on permutation to compute p-values tends to make the analysis computationally very expensive. To address these issues we have developed MAGMA, a novel tool for gene and gene-set analysis. The gene analysis is based on a multiple regression model, to provide better statistical performance. The gene-set analysis is built as a separate layer around the gene analysis for additional flexibility. This gene-set analysis also uses a regression structure to allow generalization to analysis of continuous properties of genes and simultaneous analysis of multiple gene sets and other gene properties. Simulations and an analysis of Crohn's Disease data are used to evaluate the performance of MAGMA and to compare it to a number of other gene and gene-set analysis tools. The results show that MAGMA has significantly more power than other tools for both the gene and the gene-set analysis, identifying more genes and gene sets associated with Crohn's Disease while maintaining a correct type 1 error rate. Moreover, the MAGMA analysis of the Crohn's Disease data was found to be considerably faster as well.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Comparison of gene analysis results for different test-statistics.
Gene -log10 p-values from the CD data gene analysis in MAGMA for three different gene test-statistics, comparing analyses using (A) the mean χ 2 statistic with the top χ 2 statistic, (B) the mean χ 2 statistic and the PC regression model and (C) the top χ 2 statistic and the PC regression model. P-values below 10–8 are truncated to 10–8 (grey points) to preserve the visibility of the other points.
Fig 2
Fig 2. Comparison of self-contained gene-set analysis results.
Gene set—log10 p-values from the CD data self-contained gene-set analysis for MAGMA and PLINK. Panel (A) shows the PLINK-avg (no pruning) results compared with the MAGMA-main analysis, panel (B) the PLINK-prune results compared with the MAGMA-main analysis and (C) the two PLINK analyses compared to each other. P-values below 10–8 are truncated to 10–8 (grey points) to preserve the visibility of the other points.
Fig 3
Fig 3. Comparison of competitive gene-set analysis results.
Gene set -log10 p-values from the CD data competitive gene-set analysis for MAGMA, ALIGATOR, INRICH and MAGENTA. Results for ALIGATOR and INRICH are shown for each for the SNP p-value cutoff that yielded the highest observed power (0.01 and 0.0001 respectively), MAGENTA at the advised 5th percentile cutoff. P-values for gene sets not evaluated by one of the methods are shown in grey. The shown correlations are for the -log10 p-values for gene-sets evaluated by both methods.
Fig 4
Fig 4. Comparison of competitive gene-set analysis results at different SNP cut-offs.
Comparison of gene set -log10 p-values from the CD data competitive gene-set analysis at different SNP p-value cut-offs for ALIGATOR (top row), INRICH (middle row) and MAGENTA (bottom row). The highest cut-off on the horizontal axis is compared to each of the lower cut-offs. P-values for gene sets not evaluated at the lower cut-off are shown in grey. The shown correlations are for the -log10 p-values for gene-sets evaluated at both cut-offs. Horizontal and vertical grey dotted lines demarcate the p = 0.05 nominal significance threshold.

Similar articles

Cited by

  • Single cell transcriptomes and multiscale networks from persons with and without Alzheimer's disease.
    Wang Q, Antone J, Alsop E, Reiman R, Funk C, Bendl J, Dudley JT, Liang WS, Karr TL, Roussos P, Bennett DA, De Jager PL, Serrano GE, Beach TG, Van Keuren-Jensen K, Mastroeni D, Reiman EM, Readhead BP. Wang Q, et al. Nat Commun. 2024 Jul 10;15(1):5815. doi: 10.1038/s41467-024-49790-0. Nat Commun. 2024. PMID: 38987616 Free PMC article.
  • Common risk variants in NPHS1 and TNFSF15 are associated with childhood steroid-sensitive nephrotic syndrome.
    Jia X, Yamamura T, Gbadegesin R, McNulty MT, Song K, Nagano C, Hitomi Y, Lee D, Aiba Y, Khor SS, Ueno K, Kawai Y, Nagasaki M, Noiri E, Horinouchi T, Kaito H, Hamada R, Okamoto T, Kamei K, Kaku Y, Fujimaru R, Tanaka R, Shima Y; Research Consortium on Genetics of Childhood Idiopathic Nephrotic Syndrome in Japan; Baek J, Kang HG, Ha IS, Han KH, Yang EM; Korean Consortium of Hereditary Renal Diseases in Children; Abeyagunawardena A, Lane B, Chryst-Stangl M, Esezobor C, Solarin A; Midwest Pediatric Nephrology Consortium (Genetics of Nephrotic Syndrome Study Group); Dossier C, Deschênes G; NEPHROVIR; Vivarelli M, Debiec H, Ishikura K, Matsuo M, Nozu K, Ronco P, Cheong HI, Sampson MG, Tokunaga K, Iijima K. Jia X, et al. Kidney Int. 2020 Nov;98(5):1308-1322. doi: 10.1016/j.kint.2020.05.029. Epub 2020 Jun 14. Kidney Int. 2020. PMID: 32554042 Free PMC article.
  • A Genome-Wide Association Study of Metabolic Syndrome in the Taiwanese Population.
    Ho CY, Lee JI, Huang SP, Chen SC, Geng JH. Ho CY, et al. Nutrients. 2023 Dec 25;16(1):77. doi: 10.3390/nu16010077. Nutrients. 2023. PMID: 38201907 Free PMC article.
  • Independent replications and integrative analyses confirm TRANK1 as a susceptibility gene for bipolar disorder.
    Li W, Cai X, Li HJ, Song M, Zhang CY, Yang Y, Zhang L, Zhao L, Liu W, Wang L, Shao M, Zhang Y, Zhang C, Cai J, Zhou DS, Li X, Hui L, Jia QF, Qu N, Zhong BL, Zhang SF, Chen J, Xia B, Li Y, Song X, Fan W, Tang W, Tang W, Tang J, Chen X, Yue W, Zhang D, Fang Y, Xiao X, Li M, Lv L, Chang H. Li W, et al. Neuropsychopharmacology. 2021 May;46(6):1103-1112. doi: 10.1038/s41386-020-00788-4. Epub 2020 Aug 13. Neuropsychopharmacology. 2021. PMID: 32791513 Free PMC article.
  • The neocortical infrastructure for language involves region-specific patterns of laminar gene expression.
    Wong MMK, Sha Z, Lütje L, Kong XZ, van Heukelum S, van de Berg WDJ, Jonkman LE, Fisher SE, Francks C. Wong MMK, et al. Proc Natl Acad Sci U S A. 2024 Aug 20;121(34):e2401687121. doi: 10.1073/pnas.2401687121. Epub 2024 Aug 12. Proc Natl Acad Sci U S A. 2024. PMID: 39133845 Free PMC article.

References

    1. Visscher PM, Brown MA, McCarthy MI, Yang J (2012) Five years of GWAS discovery. Am J Hum Genet 90: 7–24. 10.1016/j.ajhg.2011.11.029 - DOI - PMC - PubMed
    1. Manolio TA, Collins FS, Cox NJ, Goldstein DB, Hindorff LA, et al. (2009) Finding the missing heritability of complex diseases. Nature 461: 747–753. 10.1038/nature08494 - DOI - PMC - PubMed
    1. Lee SH, Wray NR, Goddard ME, Visscher PM (2011) Estimating missing heritability for disease from genome-wide association studies. Am J Hum Genet 88: 294–305. 10.1016/j.ajhg.2011.02.002 - DOI - PMC - PubMed
    1. Lango Allen H, Estrada K, Lettre G, Berndt SI, Weedon MN, et al. (2010) Hundreds of variants clustered in genomic loci and biological pahtways affect human height. Nature 467: 832–838. 10.1038/nature09410 - DOI - PMC - PubMed
    1. Ripke S, O’Dushlaine C, Chambert K, Moran JL, Kähler AK, et al. (2013) Genome-wide association analysis identifies 13 new risk loci for schizophrenia. Nat Genet 45: 1150–1159. 10.1038/ng.2742 - DOI - PMC - PubMed

Publication types