Ranking metrics in gene set enrichment analysis: do they matter?

Joanna Zyla¹, Michal Marczyk², January Weiner³, Joanna Polanska¹

Affiliations

¹ Data Mining Group, Institute of Automatic Control, Faculty of Automatic Control, Electronics and Computer Science, Silesian University of Technology, Akademicka 16, Gliwice, 44-100, Poland.
² Data Mining Group, Institute of Automatic Control, Faculty of Automatic Control, Electronics and Computer Science, Silesian University of Technology, Akademicka 16, Gliwice, 44-100, Poland. michal.marczyk@polsl.pl.
³ Max Planck Institute for Infection Biology, Charitéplatz 1, Berlin, 10117, Germany.

PMID: 28499413
PMCID: PMC5427619
DOI: 10.1186/s12859-017-1674-0

Ranking metrics in gene set enrichment analysis: do they matter?

Joanna Zyla et al. BMC Bioinformatics. 2017.

. 2017 May 12;18(1):256.

doi: 10.1186/s12859-017-1674-0.

Authors

Joanna Zyla¹, Michal Marczyk², January Weiner³, Joanna Polanska¹

Affiliations

¹ Data Mining Group, Institute of Automatic Control, Faculty of Automatic Control, Electronics and Computer Science, Silesian University of Technology, Akademicka 16, Gliwice, 44-100, Poland.
² Data Mining Group, Institute of Automatic Control, Faculty of Automatic Control, Electronics and Computer Science, Silesian University of Technology, Akademicka 16, Gliwice, 44-100, Poland. michal.marczyk@polsl.pl.
³ Max Planck Institute for Infection Biology, Charitéplatz 1, Berlin, 10117, Germany.

PMID: 28499413
PMCID: PMC5427619
DOI: 10.1186/s12859-017-1674-0

Abstract

Background: There exist many methods for describing the complex relation between changes of gene expression in molecular pathways or gene ontologies under different experimental conditions. Among them, Gene Set Enrichment Analysis seems to be one of the most commonly used (over 10,000 citations). An important parameter, which could affect the final result, is the choice of a metric for the ranking of genes. Applying a default ranking metric may lead to poor results.

Methods and results: In this work 28 benchmark data sets were used to evaluate the sensitivity and false positive rate of gene set analysis for 16 different ranking metrics including new proposals. Furthermore, the robustness of the chosen methods to sample size was tested. Using k-means clustering algorithm a group of four metrics with the highest performance in terms of overall sensitivity, overall false positive rate and computational load was established i.e. absolute value of Moderated Welch Test statistic, Minimum Significant Difference, absolute value of Signal-To-Noise ratio and Baumgartner-Weiss-Schindler test statistic. In case of false positive rate estimation, all selected ranking metrics were robust with respect to sample size. In case of sensitivity, the absolute value of Moderated Welch Test statistic and absolute value of Signal-To-Noise ratio gave stable results, while Baumgartner-Weiss-Schindler and Minimum Significant Difference showed better results for larger sample size. Finally, the Gene Set Enrichment Analysis method with all tested ranking metrics was parallelised and implemented in MATLAB, and is available at https://github.com/ZAEDPolSl/MrGSEA .

Conclusions: Choosing a ranking metric in Gene Set Enrichment Analysis has critical impact on results of pathway enrichment analysis. The absolute value of Moderated Welch Test has the best overall sensitivity and Minimum Significant Difference has the best overall specificity of gene set analysis. When the number of non-normally distributed genes is high, using Baumgartner-Weiss-Schindler test statistic gives better outcomes. Also, it finds more enriched pathways than other tested metrics, which may induce new biological discoveries.

Keywords: Functional genomics; GSEA; Pathway analysis; Ranking metrics.

PubMed Disclaimer

Figures

**Fig. 1**
Boxplots of surrogate sensitivity and FPR of gene set analysis. Panel a represents the distribution of target pathways enrichment p-value to each metric presented in logarithmic scale - the lower the better; Panel b represents the results of FPR estimation, where the red line represents the expected outcome - the closer to 5% the better

**Fig. 2**
Results of k-means cluster analysis based on three performance criteria. Results highlighted with *green colour* show good performance, *red colour* represents poor performance and *yellow colour* represents medium performance

**Fig. 3**
Results of k-means cluster analysis based on two performance criteria. The best results have those metrics, where FPR estimation is closest to 0, and sensitivity estimation (1- ${\hat{π}}_{0}$ ) is closest to 1

**Fig. 4**
Robustness of ranking metrics to sample size. Panel a represents surrogate sensitivity assessment of four best metrics for different sample size. Panel b represents FPR estimates under tested sample size

**Fig. 5**
Results of detecting significant gene sets across various thresholds. Panel a represents percentage of significantly enriched pathways. *Solid lines* represent average value across analysed data sets whereas dashed lines represent its confidence intervals. Panel b represents percentage of significantly enriched pathways in experiment design dedicated to FPR evaluation. *Red dashed line* represents the expected outcome

See this image and copyright information in PMC

References

1. Tavazoie S, Hughes JD, Campbell MJ, Cho RJ, Church GM. Systematic determination of genetic network architecture. Nat Genet. 1999;22(3):281–5. doi: 10.1038/10343. - DOI - PubMed
1. Falcon S, Gentleman R. Using GOstats to test gene lists for GO term association. Bioinformatics. 2007;23(2):257–8. doi: 10.1093/bioinformatics/btl567. - DOI - PubMed
1. Huang DW, Sherman BT, Tan Q, Kir J, Liu D, Bryant D, Guo Y, Stephens R, Baseler MW, Lane HC, et al. DAVID bioinformatics resources: expanded annotation database and novel algorithms to better extract biology from large gene lists. Nucleic Acids Res. 2007;35(suppl 2):169–75. doi: 10.1093/nar/gkm415. - DOI - PMC - PubMed
1. Gruca A, Sikora M, Polanski A. RuleGO: a logical rules-based tool for description of gene groups by means of Gene Ontology. Nucleic Acids Res. 2011;39(suppl 2):293–301. doi: 10.1093/nar/gkr507. - DOI - PMC - PubMed
1. Mootha VK, Lindgren CM, Eriksson KF, Subramanian A, Sihag S, Lehar J, Puigserver P, Carlsson E, Ridderstråle M, Laurila E, et al. PGC-1 α-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes. Nat Genet. 2003;34(3):267–73. doi: 10.1038/ng1180. - DOI - PubMed

MeSH terms

Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Ranking metrics in gene set enrichment analysis: do they matter?

Affiliations

Ranking metrics in gene set enrichment analysis: do they matter?

Authors

Affiliations

Abstract

Figures

References

MeSH terms

LinkOut - more resources

Full Text Sources

Other Literature Sources