Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Sep 13;12(18):3257.
doi: 10.3390/plants12183257.

Investigating Variability in Viral Presence and Abundance across Soybean Seed Development Stages Using Transcriptome Analysis

Affiliations

Investigating Variability in Viral Presence and Abundance across Soybean Seed Development Stages Using Transcriptome Analysis

Hoseong Choi et al. Plants (Basel). .

Abstract

Plant transcriptomes offer a valuable resource for studying viral communities (viromes). In this study, we explore how plant transcriptome data can be applied to virome research. We analyzed 40 soybean transcriptomes across different growth stages and identified six viruses: broad bean wilt virus 2 (BBWV2), brassica yellow virus (BrYV), beet western yellow virus (BWYV), cucumber mosaic virus (CMV), milk vetch dwarf virus (MDV), and soybean mosaic virus (SMV). SMV was the predominant virus in both Glycine max (GM) and Glycine soja (GS) cultivars. Our analysis confirmed its abundance in both, while BBWV2 and CMV were more prevalent in GS than GM. The viral proportions varied across developmental stages, peaking in open flowers. Comparing viral abundance measured by viral reads and fragments per kilobase of transcript per million (FPKM) values revealed insights. SMV showed similar FPKM values in GM and GS, but BBWV2 and CMV displayed higher FPKM proportions in GS. Notably, the differences in viral abundance between GM and GS were generally insignificant based on the FPKM values across developmental stages, except for the apical bud stage in four GM cultivars. We also detected MDV, a multi-segmented virus, in two GM samples, with variable proportions of its segments. In conclusion, our study demonstrates the potential of plant transcriptomes for virome research, highlighting their strengths and limitations.

Keywords: soybean seed development; transcriptome analysis; viral load shifts; viromes; virus.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Images of soybean tissues at five developmental stages. All images were captured by Won Kyong Cho in Gadam-ri, Hoengseong-gun, Republic of Korea. (A) Apical bud, (B) flower bud, (C) open flower, (D) developing pod at five days post-fertilization, (E) developing pod at fifteen days post-fertilization. Red-colored arrows are used to highlight each specific tissue.
Figure 2
Figure 2
The identification of viruses from 40 transcriptomes’ data derived from Glycine max (GM) and Glycine soja (GS). (A) The number of viral contigs for the identified viruses from GM and GS are labeled in blue and orange, respectively. (B) The proportion of identified viruses is shown by combining 40 soybean transcriptomes based on the number of contigs assigned to each virus. The number of viral contigs in each sample derived from GM (C) and GS (D). Bean plots display the number of viral contigs in each developmental stage for GM (E) and GS (F). In the bean plot, the black solid lines represent the median values, while the white lines depict individual data points.
Figure 3
Figure 3
The number of viral reads assigned to each virus and the proportion of viral reads in each sample. (A) The number of viral reads for the identified viruses is shown by combining 40 soybean transcriptomes, with blue and orange colors indicating GM and GS, respectively. (B) The proportion of viruses is shown according to the viral reads assigned to each virus. The proportion of viral reads in each sample derived from GM (C) and GS (D) is displayed. The proportion of viral reads in each developmental stage derived from GM (E) and GS (F) is shown. In the bean plot, the black solid lines represent the median values, while the white lines depict individual data points.
Figure 4
Figure 4
The proportion of identified viruses in each sample is based on the FPKM (fragments per kilobase of transcript per million) value. (A) The FPKM value representing viral transcripts for each virus is shown, with blue and orange colors indicating GM and GS, respectively. (B) The proportion of identified viruses is shown by combining 40 soybean transcriptomes based on the FPKM values. The proportion of viral transcripts in each sample based on the FPKM values is displayed for GM (C) and GS (D).
Figure 5
Figure 5
The proportion of viral transcripts in each developmental stage based on the FPKM values. Violin plots display the distribution of the total amount of viral transcripts derived from four samples for each soybean developmental stage for GM (A) and GS (B). The distribution of viral transcripts in each soybean developmental stage for SMV is shown for both GM (C) and GS (D). The distribution of viral transcripts in each soybean developmental stage for CMV is shown for both GM (E) and GS (F). The distribution of viral transcripts in each soybean developmental stage for BBWV2 is shown for both GM (G) and GS (H).
Figure 6
Figure 6
The comparison of viral transcripts for viruses composed of multiple genome segments. (A) A comparison of viral transcripts among DNA segments of MDV in two different samples. (B) A comparison of viral transcripts among RNA segments of CMV in different samples. (C) A comparison of viral transcripts between two RNA segments of BBWV2 in different samples from GM. (D) A comparison of viral transcripts between two RNA segments of BBWV2 in different samples from GS.
Figure 7
Figure 7
Size distribution of assembled viral contigs from GM and GS. (A) Viral contigs assembled using the Trinity assembler. (B) Viral contigs assembled using the Trinity assembler followed by the CAP3 assembler.
Figure 8
Figure 8
Mapping results of SMV-associated reads from the GS3-AB sample to the SMV reference genome. (A) Alignment outcomes of raw data sourced from the GS3-AB sample (SRR7538229) were computed against the SMV reference genome utilizing BWA software and subsequently visualized using the IGV viewer. Within the visualization, inverted triangles colored in red and blue delineate gap regions where the reads failed to align. A focused view on the mapping results in proximity to nucleotide positions 720 (B) and 1300 (C) within the SMV reference genome is presented. (D) Zoomed-in depiction of the mapped reads, represented by bars of distinct colors. Notably, gray signifies nucleotides not found within the reference genome, indicating insertion or deletion events (indels) relative to the reference sequence. Adenine (Green), thymine (Red), cytosine (Blue), and guanine (Orange) are color-coded representations of nucleotide bases.
Figure 9
Figure 9
Visualization of contigs associated with SMV. (A) Contigs assembled from GM samples using the Trinity assembler aligned to the SMV reference genome. (B) Contigs assembled from GM samples using the Trinity assembler, followed by the CAP3 assembler, aligned to the SMV reference genome. (C) Contigs assembled from GS samples using the Trinity assembler aligned to the SMV reference genome. (D) Contigs assembled from GS samples using the Trinity assembler, followed by the CAP3 assembler, aligned to the SMV reference genome. The obtained viral contigs do not map to the nucleotide regions indicated by the red-colored bars on the SMV reference genome.

References

    1. Toomer O.T., Oviedo E.O., Ali M., Patino D., Joseph M., Frinsko M., Vu T., Maharjan P., Fallen B., Mian R. Current agronomic practices, harvest & post-harvest processing of soybeans (Glycine max)—A review. Agronomy. 2023;13:427.
    1. Elmore M.G., Groves C.L., Hajimorad M., Stewart T.P., Gaskill M.A., Wise K.A., Sikora E., Kleczewski N.M., Smith D.L., Mueller D.S. Detection and discovery of plant viruses in soybean by metagenomic sequencing. Virol. J. 2022;19:149. doi: 10.1186/s12985-022-01872-5. - DOI - PMC - PubMed
    1. Jo Y., Yoon Y.N., Jang Y.-W., Choi H., Lee Y.-H., Kim S.-M., Choi S.Y., Lee B.C., Cho W.K. Soybean Viromes in the Republic of Korea Revealed by RT-PCR and Next-Generation Sequencing. Microorganisms. 2020;8:1777. doi: 10.3390/microorganisms8111777. - DOI - PMC - PubMed
    1. Hill J.H., Whitham S.A. Advances in Virus Research. Volume 90. Elsevier; Amsterdam, The Netherlands: 2014. Control of virus diseases in soybeans; pp. 355–390. - PubMed
    1. Choi H., Jo Y., Choi S.Y., Kim S.-M., Choi Y.M., Hong J.-S., Lee B.C., Cho W.K. Evolution and Phylogeny of Soybean Mosaic Virus Based on 143 Complete Genomes. Int. J. Mol. Sci. 2022;24:22. doi: 10.3390/ijms24010022. - DOI - PMC - PubMed

LinkOut - more resources