Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Jan 29:4:e05378.
doi: 10.7554/eLife.05378.

Unprecedented genomic diversity of RNA viruses in arthropods reveals the ancestry of negative-sense RNA viruses

Affiliations

Unprecedented genomic diversity of RNA viruses in arthropods reveals the ancestry of negative-sense RNA viruses

Ci-Xiu Li et al. Elife. .

Abstract

Although arthropods are important viral vectors, the biodiversity of arthropod viruses, as well as the role that arthropods have played in viral origins and evolution, is unclear. Through RNA sequencing of 70 arthropod species we discovered 112 novel viruses that appear to be ancestral to much of the documented genetic diversity of negative-sense RNA viruses, a number of which are also present as endogenous genomic copies. With this greatly enriched diversity we revealed that arthropods contain viruses that fall basal to major virus groups, including the vertebrate-specific arenaviruses, filoviruses, hantaviruses, influenza viruses, lyssaviruses, and paramyxoviruses. We similarly documented a remarkable diversity of genome structures in arthropod viruses, including a putative circular form, that sheds new light on the evolution of genome organization. Hence, arthropods are a major reservoir of viral genetic diversity and have likely been central to viral evolution.

Keywords: RNA virus; arthropods; evolution; infectious disease; microbiology; negative-sense; phylogeny; segmentation; viruses.

PubMed Disclaimer

Conflict of interest statement

The authors declare that no competing interests exist.

Figures

Figure 1.
Figure 1.. Host component of each pool used in the RNA-seq library construction and sequencing.
The taxonomic units in the tree correspond to the unit samples used in the RNA extraction. Species or genus information is marked to the left of the tree. DOI: http://dx.doi.org/10.7554/eLife.05378.004
Figure 2.
Figure 2.. Abundance level (transcripts per million—TPM) of the RdRp genes from the negative-sense RNA viruses detected in this study.
Abundance is calculated after the removal of ribosomal RNA reads. As a comparison, we show the abundance of the two well characterized (positive-sense) RNA viruses: Japanese encephalitis virus and Gill-associated virus found in the Mosquito-Hubei and Shrimp libraries, respectively, as well as the range of abundance of host mitochondrial COI genes in these same multi-host libraries. DOI: http://dx.doi.org/10.7554/eLife.05378.008
Figure 3.
Figure 3.. Evolutionary history of negative-sense RNA viruses based on RdRp.
This is initially displayed in an unrooted maximum likelihood (ML) tree including all major groups of negative-sense RNA viruses (A). Separate and more detailed ML phylogenies are then shown for the Orthomyxoviridae-like (B), Bunya-Arenaviridae-like (C), and Mononegavirales-like viruses (D). In all the phylogenies, the RdRp sequences described here from arthropods are either shaded purple or marked with solid gray circles. The names of previously defined genera/families are labeled to the right of the phylogenies. Based on their host types, the branches are shaded red (vertebrate-specific), yellow (vertebrate and arthropod), green (plant and arthropod), blue (non-arthropod invertebrates), or black (arthropod only). For clarity, statistical supports (i.e., approximate likelihood-ratio test (aLRT) with Shimodaira–Hasegawa-like procedure/posterior probabilities) are shown for key internal nodes only. DOI: http://dx.doi.org/10.7554/eLife.05378.009
Figure 3—figure supplement 1.
Figure 3—figure supplement 1.. A fully labeled ML phylogeny for Orthomyxoviridae-like viruses.
The phylogeny is reconstructed using RdRp alignments. Statistical support from the approximate likelihood-ratio test (aLRT) is shown on each node of the tree. The names of the viruses discovered in this study are shown in red. The names of reference sequences, which contain both the GenBank accession number and the virus species name, are shown in black. The names of previously defined genera/families are shown to the right of the phylogenies. DOI: http://dx.doi.org/10.7554/eLife.05378.010
Figure 3—figure supplement 2.
Figure 3—figure supplement 2.. A fully labeled ML phylogeny for Bunya-Arenaviridae-like viruses.
The phylogeny is reconstructed using RdRp alignments. Statistical support from the aLRT is shown on each node of the tree. The names of the viruses discovered in this study are shown in red. The names of reference sequences, which contain both the GenBank accession number and the virus species name, are shown in black. The names of previously defined genera/families are shown to the right of the phylogenies. DOI: http://dx.doi.org/10.7554/eLife.05378.011
Figure 3—figure supplement 3.
Figure 3—figure supplement 3.. A fully labeled ML phylogeny for Mononegavirales-like viruses.
The phylogeny is reconstructed using RdRp alignments. Statistical support from the aLRT is shown on each node of the tree. The names of the viruses discovered in this study are shown in red. The names of reference sequences, which contain both the GenBank accession number and the virus species name, are shown in black. The names of previously defined genera/families are shown to the right of the phylogenies. DOI: http://dx.doi.org/10.7554/eLife.05378.012
Figure 4.
Figure 4.. The unrooted ML phylogeny based on RdRp showing the topological position of segmented viruses within the genetic diversity of negative-sense RNA viruses.
The segmented viruses are labeled with segment numbers and shaded red. The unsegmented viruses are shaded green. The Chuviridae, which exhibit a wide variety of genome organizations, are shaded cyan. Three major types of putative chuvirus genomes (circular, circular and segmented, and linear) are shown in the right panel and are annotated with predicted ORFs: putative RdRp genes are shaded blue, putative glycoprotein genes are shaded orange, and the remaining ORFs are shaded gray. DOI: http://dx.doi.org/10.7554/eLife.05378.013
Figure 5.
Figure 5.. The unrooted ML phylogeny of negative-sense RNA viruses (RdRp) with the common names of the principle arthropod hosts analyzed in this study indicated.
Vertebrate-specific viruses are shaded red, those infecting both vertebrates and arthropods (or with unknown vectors) are shaded yellow, those infecting both plants and arthropods are shaded green, those infecting non-arthropod invertebrates are shaded blue, and the remainder (arthropod only) are shaded black. DOI: http://dx.doi.org/10.7554/eLife.05378.014
Figure 6.
Figure 6.. Phylogenetic congruence between viruses (M segments) and hosts.
The comparisons include (A) Wuhan Horsefly Virus, (B) Wuhan Fly Virus 1, (C) Wuhan Mosquito Virus 2, and (D) Wuhan Mosquito Virus 1. Different host species/genera are distinguished with different colors, which are then mapped onto virus phylogeny to assess the phylogenetic congruence. ML phylogenetic trees were inferred in all cases. DOI: http://dx.doi.org/10.7554/eLife.05378.015
Figure 7.
Figure 7.. The differing genome organizations in the Chuviridae.
(A) ML trees of three main putative proteins conserved among the chuviruses. Viruses with circular genomes (Type I) are shaded blue, while those with segmented genomes (Type II) are shaded red. (B) Structures of all complete chuvirus genomes. Circular genomes are indicated with the arrow (blue) situated at the 3′ end, and the genome is drawn in a linear form for ease of comparison only, being broken at the region of variable sequence (refer to the ‘Materials and methods’). (C) An example showing mapping of sequencing reads to the circular chuvirus genome. The template for mapping contains two genomes connected head-to-tail. The two boxes magnify the genomic region containing abundant sequence variation. DOI: http://dx.doi.org/10.7554/eLife.05378.016
Figure 8.
Figure 8.. Genome structures of segmented negative-sense RNA viruses.
Predicted viral proteins homologous to known viral proteins are shown and colored according to their putative functions. The numbers below each ORF box give the predicted molecular mass. DOI: http://dx.doi.org/10.7554/eLife.05378.017
Figure 9.
Figure 9.. Genome structures of unsegmented negative-sense RNA viruses.
Predicted ORFs encoding viral proteins with >10 kDa molecular mass are shown and colored according to their putative functions. The numbers below each ORF box give the predicted molecular mass. DOI: http://dx.doi.org/10.7554/eLife.05378.018
Figure 10.
Figure 10.. Comparison of the genome structure of a potential tenui-like virus from horsefly with a prototype tenuivirus (Rice grassy stunt virus) genome.
DOI: http://dx.doi.org/10.7554/eLife.05378.019
Figure 11.
Figure 11.. ML phylogeny of EVEs.
The phylogeny is based on the glycoprotein of chuviruses in the context of exogenous members of this family (A), with subtrees magnified for (B) the Culicidae clade and (C) the Hymenoptera clade. The EVEs used in the phylogeny covered the complete or near complete length of the glycoprotein gene and are shown in red and labeled according to host taxonomy in the overall tree. For clarity, monophyletic groups are collapsed based on the host taxonomy. Only bootstrap values >70% are shown. DOI: http://dx.doi.org/10.7554/eLife.05378.021
Figure 12.
Figure 12.. Transmission of negative-sense RNA viruses in arthropods and non-arthropods.
Three types of transmission cycle are shown: (i) those between arthropods and plants are shaded green; (ii) those between arthropods and vertebrates are shaded yellow; and (iii) those that are vertebrate-only are shaded red. Viruses associated with each transmission type are also indicated. DOI: http://dx.doi.org/10.7554/eLife.05378.022

References

    1. Ballinger MJ, Bruenn JA, Hay J, Czechowski D, Taylor DJ. Discovery and evolution of bunyavirids in arctic phantom midges and ancient bunyavirid-like sequences in insect genomes. Journal of Virology. 2014;88:8783–8794. doi: 10.1128/JVI.00531-14. - DOI - PMC - PubMed
    1. Capella-Gutierrez S, Silla-Martinez JM, Gabaldon T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 2009;25:1972–1973. doi: 10.1093/bioinformatics/btp348. - DOI - PMC - PubMed
    1. Cook S, Chung BY, Bass D, Moureau G, Tang S, McAlister E, Culverwell CL, Glucksman E, Wang H, Brown TD, Gould EA, Harbach RE, de Lamballerie X, Firth AE. Novel virus discovery and genome reconstruction from field RNA samples reveals highly divergent viruses in dipteran hosts. PLOS ONE. 2013;8:e80720. doi: 10.1371/journal.pone.0080720. - DOI - PMC - PubMed
    1. Cui J, Holmes EC. Endogenous RNA viruses of plants in insect genomes. Virology. 2012;427:77–79. doi: 10.1016/j.virol.2012.02.014. - DOI - PMC - PubMed
    1. Falk BW, Tsai JH. Biology and molecular biology of viruses in the genus Tenuivirus. Annual Review of Phytopathology. 1998;36:139–163. doi: 10.1146/annurev.phyto.36.1.139. - DOI - PubMed

Publication types