Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2025 Oct 14.
doi: 10.1007/s00239-025-10272-6. Online ahead of print.

Quest for Orthologs in the era of Data Deluge and AI: Challenges and Innovations in Orthology Prediction and Data Integration

Collaborators, Affiliations
Review

Quest for Orthologs in the era of Data Deluge and AI: Challenges and Innovations in Orthology Prediction and Data Integration

Sina Majidian et al. J Mol Evol. .

Abstract

The rapid advancement of DNA sequencing technologies and computational algorithms has led to an unprecedented surge in genomic data, driven by several large-scale sequencing projects worldwide. Orthology plays a crucial role in understanding evolutionary patterns of genes and their functions. At the last Quest for Orthologs meeting (Montréal, Canada-2024), we discussed recent advances in orthology inference, with a focus on the impact of artificial intelligence (AI), protein structures, RNA splicing isoforms, and protein domain evolution together with other evolutionary considerations. A long-standing challenge in the field is the functional annotation of paralogs, for which we present novel approaches. The meeting also emphasised strategies for integrating diverse genetic features into the concept of orthology, encouraging frameworks that account for elements like alternative splicing, domain organisation, and regulatory sequences. We discuss various applications of orthology and paralogy to environmental research, agriculture, and comparative genomics. Additionally, we report recent progress in orthology inference methodologies and resources. This work represents a collaborative synthesis of insights and innovations presented at the 8th Quest for Orthologs meeting, highlighting current progress while outlining future directions for orthology research.

Keywords: Artificial intelligence; Gene function; Orthology; Paralogy; Protein domains.

PubMed Disclaimer

Conflict of interest statement

Declarations. Conflict of Interest: The authors have no competing interests to declare that are relevant to the content of this article.

References

    1. Ahdritz G, Bouatta N, Floristean C et al (2024) Openfold: retraining AlphaFold2 yields new insights into its learning mechanisms and capacity for generalization. Nat Methods 21:1514–1524 - PubMed - PMC - DOI
    1. Alcaraz AJ, Murray S, Ankley P et al (2025) Transcriptomics points-of-departure (tPODs) to support hazard assessment of Benzo[a]pyrene in early-life-stage rainbow trout. Environ Sci Technol. https://doi.org/10.1021/acs.est.4c11870 - DOI - PubMed
    1. Ali RH, Muhammad S, Khan M, Arvestad L (2013) Quantitative synteny scoring improves homology inference and partitioning of gene families. BMC Bioinformatics 14(Suppl 15):S12 - PubMed - PMC - DOI
    1. Ali RH, Muhammad SA, Arvestad L (2016) Genfamclust: an accurate, synteny-aware and reliable homology inference algorithm. BMC Evol Biol 16:120 - PubMed - PMC - DOI
    1. Altenhoff AM, Boeckmann B, Capella-Gutierrez S et al (2016) Standardized benchmarking in the quest for orthologs. Nat Methods 13:425–430 - PubMed - PMC - DOI

LinkOut - more resources