Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2004 Dec;42(12):5472-6.
doi: 10.1128/JCM.42.12.5472-5476.2004.

Sequencing needs for viral diagnostics

Affiliations

Sequencing needs for viral diagnostics

Shea N Gardner et al. J Clin Microbiol. 2004 Dec.

Abstract

We built a system to guide decisions regarding the amount of genomic sequencing required to develop diagnostic DNA signatures, which are short sequences that are sufficient to uniquely identify a viral species. We used our existing DNA diagnostic signature prediction pipeline, which selects regions of a target species genome that are conserved among strains of the target (for reliability, to prevent false negatives) and unique relative to other species (for specificity, to avoid false positives). We performed simulations, based on existing sequence data, to assess the number of genome sequences of a target species and of close phylogenetic relatives (near neighbors) that are required to predict diagnostic signature regions that are conserved among strains of the target species and unique relative to other bacterial and viral species. For DNA viruses such as variola (smallpox), three target genomes provide sufficient guidance for selecting species-wide signatures. Three near-neighbor genomes are critical for species specificity. In contrast, most RNA viruses require four target genomes and no near-neighbor genomes, since lack of conservation among strains is more limiting than uniqueness. Severe acute respiratory syndrome and Ebola Zaire are exceptional, as additional target genomes currently do not improve predictions, but near-neighbor sequences are urgently needed. Our results also indicate that double-stranded DNA viruses are more conserved among strains than are RNA viruses, since in most cases there was at least one conserved signature candidate for the DNA viruses and zero conserved signature candidates for the RNA viruses.

PubMed Disclaimer

Figures

FIG. 1.
FIG. 1.
Range plot for plum pox virus, displaying results typical of many viruses. To discriminate samples in which zero NNs were used, the range is drawn as a horizontal grey line, and when n is >0, the range is drawn as a black line. The best estimate of the true value is the quality measure determined using the entire target and NN pools and is represented by a vertical black line. This best estimate plus a constant c of 20 is at the location of the vertical dashed line and was selected to indicate a reasonable distance from the true answer. The 75% quantile for each range is shown with a black, vertical tick mark.
FIG. 2.
FIG. 2.
Range plot for vaccinia, illustrating that NN sequence information is critical to eliminate signature candidates that are not species specific. Three or four each of target and NN sequences appear to be adequate for prediction of a short list of signature candidates that are suitable for laboratory screening.
FIG. 3.
FIG. 3.
Range plot for SARS, similar to that for Ebola Zaire (data not shown), illustrates that additional target sequence data do not narrow the list of signature candidates. No NN sequences available at the time of these analyses were similar enough to winnow the list of signature candidates.

References

    1. Benson, D. A., I. Karsch-Mizrachi, D. J. Lipman, J. Ostell, B. A. Rapp, and D. L. Wheeler. 2000. GenBank. Nucleic Acids Res. 28:15-18. - PMC - PubMed
    1. Fitch, J. P., B. A. Chromy, C. E. Forde, E. Garcia, S. N. Gardner, P. Gu, T. A. Kuczmarksi, C. Melius, S. L. McCutchen-Maloney, F. M. Milanovich, V. L. Motin, L. L. Ott, A. Quong, J. Quong, J. M. Rocco, T. R. Slezak, B. A. Sokhansanj, E. A. Vitalis, A. T. Zemla, and P. M. McCready. 2002. Presented at the IEEE Workshop on Genomic Signal Processing and Statistics (GENSIPS), Oct. 12-13, 2002, Raleigh, N.C.
    1. Fitch, J. P., S. N. Gardner, T. A. Kuczmarski, S. Kurtz, R. Myers, L. L. Ott, T. R. Slezak, E. A. Vitalis, A. T. Zemla, and P. M. McCready. 2002. Rapid development of nucleic acid diagnostics. Proc. IEEE 90:1708-1721.
    1. Gardner, S. N., T. A. Kuczmarski, E. A. Vitalis, and T. R. Slezak. 2003. Limitations of TaqMan PCR for detecting divergent viral pathogens illustrated by hepatitis A, B, C, and E viruses and human immunodeficiency virus. J. Clin. Microbiol. 41:2417-2427. - PMC - PubMed
    1. Giegerich, R., S. Kurtz, and J. Stoye. 2003. Efficient implementation of lazy suffix trees. Softw. Pract. Exp. 33:1035-1049.

Publication types