Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2013;24(7):565-80.
doi: 10.1080/1062936X.2012.762425. Epub 2013 Jan 25.

Improvement of carcinogenicity prediction performances based on sensitivity analysis in variable selection of SVM models

Affiliations
Comparative Study

Improvement of carcinogenicity prediction performances based on sensitivity analysis in variable selection of SVM models

K Tanabe et al. SAR QSAR Environ Res. 2013.

Abstract

A new sensitivity analysis (SA) method for variable selection in support vector machine (SVM) was proposed to improve the performance level of the QSAR model to predict carcinogenicity based on the correlation coefficient (CC) method used in our preceding study. The performances of both methods were also compared with that of the F-score (FS) method proposed by Chang and Lin. The 911 non-congeneric chemicals were classified into 20 mutually overlapping groups according to contained substructures, and a specific SVM model created on chemicals belonging to each group was optimized by searching the best set of SVM parameters while successively omitting descriptors of lower absolute values of sensitivity, CC or FS until the maximum predictive performance was obtained. The SA method improves the overall accuracy from 80% of CC and FS to 84%, which is considerably higher than those of existing models for predicting the carcinogenicity of non-congeneric chemicals. It selects the optimum sets of effective descriptors fewer than the CC and FS methods, and is not time-consuming and can be applied to a large set of initial descriptors. It is concluded that SA is superior as a variable selection method in SVM models.

PubMed Disclaimer

Similar articles

Cited by

MeSH terms

LinkOut - more resources