Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Jun 1;22(3):457-470.
doi: 10.1287/ijoc.1090.0360.

Least-Squares Support Vector Machine Approach to Viral Replication Origin Prediction

Affiliations

Least-Squares Support Vector Machine Approach to Viral Replication Origin Prediction

Raul Cruz-Cano et al. INFORMS J Comput. .

Abstract

Replication of their DNA genomes is a central step in the reproduction of many viruses. Procedures to find replication origins, which are initiation sites of the DNA replication process, are therefore of great importance for controlling the growth and spread of such viruses. Existing computational methods for viral replication origin prediction have mostly been tested within the family of herpesviruses. This paper proposes a new approach by least-squares support vector machines (LS-SVMs) and tests its performance not only on the herpes family but also on a collection of caudoviruses coming from three viral families under the order of caudovirales. The LS-SVM approach provides sensitivities and positive predictive values superior or comparable to those given by the previous methods. When suitably combined with previous methods, the LS-SVM approach further improves the prediction accuracy for the herpesvirus replication origins. Furthermore, by recursive feature elimination, the LS-SVM has also helped find the most significant features of the data sets. The results suggest that the LS-SVMs will be a highly useful addition to the set of computational tools for viral replication origin prediction and illustrate the value of optimization-based computing techniques in biomedical applications.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Replication Origins of Herpesviruses (left) and Caudoviruses (right). The numbers on top of the Caudovirus DNA are the last 4 digits of their Accession Numbers.
Figure 2
Figure 2
The Sensitivity (top) and PPV (bottom) for the herpesviruses (left) and caudoviruses (right) are compared for the BWS1 method, the ANN method, LS-SVM with 23 variables, LS-SVM using the dinucleotides, and LS-SVM with 23 variables for the artificial genomes.

Similar articles

Cited by

References

    1. Abe S. Support Vector Machines for Pattern Classification (Advances in Pattern Recognition) Springer-Verlag New York, Inc.; Secaucus, NJ, USA: 2005.
    1. Abe S. ANNPR ’08: Proceedings of the 3rd IAPR workshop on Artificial Neural Networks in Pattern Recognition. Springer-Verlag, Berlin, Heidelberg; 2008. Sparse least squares support vector machines by forward selection based on linear discriminant analysis; pp. 54–65.
    1. Aizenberg I, Moraga C. Multilayer feedforward neural network based on multi-valued neurons (mlmvn) and a backpropagation learning algorithm. Soft Comput. 2006;11:169–183.
    1. Babuska R. Fuzzy Systems in Medicine, Studies in Fuzziness and Soft Computing. vol. 41. Physica-Verlag; 2000. Fuzzy clustering algorithms with applications to rule extraction; pp. 139–173.
    1. Borin A, Ferro MF, Mello C, Cordi L, Pataca LCM, Durn N, Poppi RJ. Quantification of lactobacillus in fermented milk by multivariate image analysis with least-squares support-vector machines. Anal Bioanal Chem. 2007;387:1105–1112. - PubMed

LinkOut - more resources