Nine quick tips for pathway enrichment analysis
- PMID: 35951505
- PMCID: PMC9371296
- DOI: 10.1371/journal.pcbi.1010348
Nine quick tips for pathway enrichment analysis
Abstract
Pathway enrichment analysis (PEA) is a computational biology method that identifies biological functions that are overrepresented in a group of genes more than would be expected by chance and ranks these functions by relevance. The relative abundance of genes pertinent to specific pathways is measured through statistical methods, and associated functional pathways are retrieved from online bioinformatics databases. In the last decade, along with the spread of the internet, higher availability of computational resources made PEA software tools easy to access and to use for bioinformatics practitioners worldwide. Although it became easier to use these tools, it also became easier to make mistakes that could generate inflated or misleading results, especially for beginners and inexperienced computational biologists. With this article, we propose nine quick tips to avoid common mistakes and to out a complete, sound, thorough PEA, which can produce relevant and robust results. We describe our nine guidelines in a simple way, so that they can be understood and used by anyone, including students and beginners. Some tips explain what to do before starting a PEA, others are suggestions of how to correctly generate meaningful results, and some final guidelines indicate some useful steps to properly interpret PEA results. Our nine tips can help users perform better pathway enrichment analyses and eventually contribute to a better understanding of current biology.
Conflict of interest statement
The authors have declared that no competing interests exist.
Similar articles
-
Ten quick tips for bioinformatics analyses using an Apache Spark distributed computing environment.PLoS Comput Biol. 2023 Jul 20;19(7):e1011272. doi: 10.1371/journal.pcbi.1011272. eCollection 2023 Jul. PLoS Comput Biol. 2023. PMID: 37471333 Free PMC article.
-
Seven quick tips for gene-focused computational pangenomic analysis.BioData Min. 2024 Sep 3;17(1):28. doi: 10.1186/s13040-024-00380-2. BioData Min. 2024. PMID: 39227987 Free PMC article.
-
Ten quick tips for clinical electroencephalographic (EEG) data acquisition and signal processing.PeerJ Comput Sci. 2024 Sep 3;10:e2256. doi: 10.7717/peerj-cs.2256. eCollection 2024. PeerJ Comput Sci. 2024. PMID: 39314688 Free PMC article.
-
Bioinformatics software resources.Brief Bioinform. 2004 Sep;5(3):300-4. doi: 10.1093/bib/5.3.300. Brief Bioinform. 2004. PMID: 15383216 Review.
-
Ten quick tips for machine learning in computational biology.BioData Min. 2017 Dec 8;10:35. doi: 10.1186/s13040-017-0155-3. eCollection 2017. BioData Min. 2017. PMID: 29234465 Free PMC article. Review.
Cited by
-
Towards a potential pan-cancer prognostic signature for gene expression based on probesets and ensemble machine learning.BioData Min. 2022 Nov 3;15(1):28. doi: 10.1186/s13040-022-00312-y. BioData Min. 2022. PMID: 36329531 Free PMC article.
-
Genes Differentially Expressed Across Major Arteries Are Enriched in Endothelial Dysfunction-Related Gene Sets: Implications for Relative Inter-artery Atherosclerosis Risk.Bioinform Biol Insights. 2024 May 16;18:11779322241251563. doi: 10.1177/11779322241251563. eCollection 2024. Bioinform Biol Insights. 2024. PMID: 38765020 Free PMC article.
-
Plasma Proteomic Signatures of Physical Activity Provide Insights into Biological Impacts of Physical Activity and its Protective Role Against Dementia.medRxiv [Preprint]. 2025 Jan 31:2025.01.16.25320290. doi: 10.1101/2025.01.16.25320290. medRxiv. 2025. PMID: 39867359 Free PMC article. Preprint.
-
Combined inhibition of EZH2 and CDK4/6 perturbs endoplasmic reticulum-mitochondrial homeostasis and increases antitumor activity against glioblastoma.NPJ Precis Oncol. 2024 Jul 25;8(1):156. doi: 10.1038/s41698-024-00653-3. NPJ Precis Oncol. 2024. PMID: 39054369 Free PMC article.
-
Cluefish: mining the dark matter of transcriptional data series with over-representation analysis enhanced by aggregated biological prior knowledge.NAR Genom Bioinform. 2025 Jul 30;7(3):lqaf103. doi: 10.1093/nargab/lqaf103. eCollection 2025 Sep. NAR Genom Bioinform. 2025. PMID: 40740691 Free PMC article.
References
-
- Trupp M, Altman T, Fulcher CA, Caspi R, Krummenacker M, Paley S, et al.. Beyond the genome (BTG) is a (PGDB) pathway genome database: HumanCyc. Genome Biol. 2010;11(1):1–1.
-
- Acevedo A, Durán C, Ciucci S, Gerl M, Cannistraci CV. LIPEA: lipid pathway enrichment analysis. bioRxiv. 2018;274969:1–5.
MeSH terms
LinkOut - more resources
Full Text Sources