Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2025 Mar 3:16:1474026.
doi: 10.3389/fphar.2025.1474026. eCollection 2025.

Genome-wide functional annotation of variants: a systematic review of state-of-the-art tools, techniques and resources

Affiliations
Review

Genome-wide functional annotation of variants: a systematic review of state-of-the-art tools, techniques and resources

Eleftherios Pilalis et al. Front Pharmacol. .

Abstract

The recent advancement of sequencing technologies marks a significant shift in the character and complexity of the digital genomic data universe, encompassing diverse types of molecular data, screened through manifold technological platforms. As a result, a plethora of fully assembled genomes are generated that span vertically the evolutionary scale. Notwithstanding the tsunami of thriving innovations that accomplish unprecedented, nucleotide-level, structural and functional annotation, an exhaustive, systemic, massive genome-wide functional annotation remains elusive, particularly when the criterion is automation and efficiency in data-agnostic interpretation. The latter is of paramount importance for the elaboration of strategies for sophisticated, data-driven genome-wide annotation, which aim to impart a sustainable and comprehensive systemic approach to addressing whole genome variation. Therefore, it is essential to develop methods and tools that promote systematic functional genomic annotation, with emphasis on mechanistic information exceeding the limits of coding regions, and exploiting the chunks of pertinent information residing in non-coding regions, including promoter and enhancer sequences, non-coding RNAs, DNA methylation sites, transcription factor binding sites, transposable elements and more. This review provides an overview of the current state-of-the-art in genome-wide functional annotation of genetic variation, including existing bioinformatic tools, resources, databases and platforms currently available or reported in the literature. Particular emphasis is placed on the functional annotation of variants that lie outside protein-coding genomic regions (intronic or intergenic), their potential co-localization with regulatory element areas, such as putative non-coding RNA regions, and the assessment of their functional impact on the investigated phenotype. In addition, state-of-the-art tools that leverage data obtained from WGS and GWAS-based analyses are discussed, along with future bioinformatics directions and developments. These future directions emphasize efficient, comprehensive, and largely automated functional annotation of both coding and non-coding genomic variants, as well as their optimal evaluation.

Keywords: GWAS; genome-wide; genomic variation; genotyping; intergenic; variant annotation; whole-exome sequencing; whole-genome sequencing.

PubMed Disclaimer

Conflict of interest statement

TK, MA and TS are full-time employees of Pfizer. EP and AC are co-founders and DZ and CA are full-time employees of e-NIOS Applications PC. The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision. This research was conducted as a collaboration between e-NIOS Applications and Pfizer. Pfizer is the research sponsor. Pfizer has been involved in the Validation, Writing-review & editing and Project administration.

Figures

FIGURE 1
FIGURE 1
PRISMA flowchart that shows the step-by-step process of the application of inclusion and exclusion criteria to generate a final number of articles for analysis in the literature review.
FIGURE 2
FIGURE 2
Selection of publications attributed to three distinct categories, namely major computational approaches (fundamental annotation tools), databases and repositories (annotation resources) and aggregator tools and platforms.
FIGURE 3
FIGURE 3
Classification of functional annotation tools (A) Venn diagram of annotation tools, classified by genomic region of focus (B) Venn diagram of aggregator platforms, classified by genomic region of focus (C) Proportions of aggregator tools based on the prediction of the systemic impact of variants.

References

    1. Adzhubei I. A., Schmidt S., Peshkin L., Ramensky V. E., Gerasimova A., Bork P., et al. (2010). A method and server for predicting damaging missense mutations. Nat. Methods 7, 248–249. 10.1038/nmeth0410-248 - DOI - PMC - PubMed
    1. Ashburner M., Ball C. A., Blake J. A., Botstein D., Butler H., Cherry J. M., et al. (2000). Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat. Genet. 25, 25–29. 10.1038/75556 - DOI - PMC - PubMed
    1. Balvert M., Cooper-Knock J., Stamp J., Byrne R. P., Mourragui S., Van Gils J., et al. (2024). Considerations in the search for epistasis. Genome Biol. 25, 296. 10.1186/s13059-024-03427-z - DOI - PMC - PubMed
    1. Bernstein B. E., Stamatoyannopoulos J. A., Costello J. F., Ren B., Milosavljevic A., Meissner A., et al. (2010). The NIH Roadmap Epigenomics mapping Consortium. Nat. Biotechnol. 28, 1045–1048. 10.1038/nbt1010-1045 - DOI - PMC - PubMed
    1. Biggs H., Parthasarathy P., Gavryushkina A., Gardner P. P. (2020). ncVarDB: a manually curated database for pathogenic non-coding variants and benign controls. Database J. Biol. Databases Curation 2020, baaa105. 10.1093/database/baaa105 - DOI - PMC - PubMed

LinkOut - more resources