Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jun;93(3):295-305.
doi: 10.1007/s00239-025-10246-8. Epub 2025 Apr 29.

Ancient Host-Virus Gene Transfer Hints at a Diverse Pre-LECA Virosphere

Affiliations

Ancient Host-Virus Gene Transfer Hints at a Diverse Pre-LECA Virosphere

Sangita Karki et al. J Mol Evol. 2025 Jun.

Abstract

The details surrounding the early evolution of eukaryotes and their viruses are largely unknown. Several key enzymes involved in DNA synthesis and transcription are shared between eukaryotes and large DNA viruses in the phylum Nucleocytoviricota, but the evolutionary relationships between these genes remain unclear. In particular, previous studies of eukaryotic DNA and RNA polymerases often show deep-branching clades of eukaryotes and viruses indicative of ancient gene exchange. Here, we performed updated phylogenetic analysis of eukaryotic and viral family B DNA polymerases, multimeric RNA polymerases, and mRNA-capping enzymes to explore their evolutionary relationships. Our results show that viral enzymes form clades that are typically adjacent to eukaryotes, suggesting that they originate prior to the emergence of the Last Eukaryotic Common Ancestor (LECA). The machinery for viral DNA replication, transcription, and mRNA capping are all key processes needed for the maintenance of virus factories, which are complex structures formed by many nucleocytoviruses during infection, indicating that viruses capable of making these structures are ancient. These findings hint at a diverse and complex pre-LECA virosphere and indicate that large DNA viruses may encode proteins that are relics of extinct proto-eukaryotic lineages.

Keywords: Early eukaryotes; Giant viruses; Mirusviruses; Nucleocytoviricota; Virus factory.

PubMed Disclaimer

Conflict of interest statement

Declarations. Conflict of interest: The authors declare no conflicts of interest.

Figures

Fig. 1
Fig. 1
Phylogenetic tree of DNA polymerase family B demonstrating nested placement of Polδ in a viral clade and Polε with Asgard archaea (957 total sequences, 1417 sites). a Maximum-likelihood analysis was performed using IQ-TREE using the complex model LG+C60+F+G. Ultrafast bootstrap support values for select deep-branching nodes are shown (black dot > = 95%, blue dot 90–94%). For clarity, support values are only provided for select internal nodes. b Rectangular representation of the region of the polB phylogenetic tree highlighting the evolutionary relationships between viral groups and eukaryotic Polδ. Values at nodes represent their ultrafast bootstrap support. Polζ = PolZeta; Polα = PolAlpha; Polδ = PolDelta and Polε = PolEpsilon in the figure (Color figure online)
Fig. 2
Fig. 2
Phylogenetic tree for RNA Polymerase (RNAP). The alignment is based on a concatenated set of Beta and Beta prime subunits from 1017 sequences (resulting in a total alignment length of 3812 sites). Maximum-likelihood analysis was performed using IQ-TREE under a complex model (LG+C60+F+G). The dots on the branches represent ultrafast bootstrap support values (black dot > = 99%). For clarity, support values are only provided for selected internal nodes. Full trees are available in the supplemental material. The tree is rooted between the bacteria and all other taxa (Color figure online)
Fig. 3
Fig. 3
Phylogenetic tree for mRNA-capping enzyme (upper clade) along with ATP-dependent DNA ligase (lower clade, labeled). Trees were made using different trimming strategies. Sites with 90% gaps removed (left) resulting in total alignment of 856 sites. Sites with 50% gaps removed, resulting in total alignment of 326 sites (right). Maximum-likelihood analysis was performed using IQ-TREE using LG+F+R10 model
Fig. 4
Fig. 4
Schematic of possible evolutionary scenarios that would lead to the nested placement of core eukaryotic genes within broader clades of viruses. In scenario 1 (S1), viruses acquire core machinery from proto-eukaryotic lineages that subsequently go extinct, In scenario 2 (S2), virus-to-eukaryotic gene transfer takes place prior to the emergence of LECA

Similar articles

Cited by

References

    1. Aylward FO, Moniruzzaman M (2022) Viral complexity. Biomolecules 12:1061 - PMC - PubMed
    1. Aylward FO, Moniruzzaman M, Ha AD, Koonin EV (2021) A phylogenomic framework for charting the diversity and evolution of giant viruses. PLoS Biol 19:e3001430 - PMC - PubMed
    1. Bell PJ (2001) Viral eukaryogenesis: was the ancestor of the nucleus a complex DNA virus? J Mol Evol 53:251–256 - PubMed
    1. Bosmon T, Abergel C, Claverie J-M (2025) 20 years of research on giant viruses. Npj Viruses 3:1–11 - PMC - PubMed
    1. Brahim Belhaouari D, Pires De Souza GA, Lamb DC et al (2022) Metabolic arsenal of giant viruses: host hijack or self-use? Elife. 10.7554/eLife.78674 - PMC - PubMed

Substances

LinkOut - more resources