Investigation of expert rule bases, logistic regression, and non-linear machine learning techniques for predicting response to antiretroviral treatment
- PMID: 19474477
Investigation of expert rule bases, logistic regression, and non-linear machine learning techniques for predicting response to antiretroviral treatment
Abstract
Background: The extreme flexibility of the HIV type-1 (HIV-1) genome makes it challenging to build the ideal antiretroviral treatment regimen. Interpretation of HIV-1 genotypic drug resistance is evolving from rule-based systems guided by expert opinion to data-driven engines developed through machine learning methods.
Methods: The aim of the study was to investigate linear and non-linear statistical learning models for classifying short-term virological outcome of antiretroviral treatment. To optimize the model, different feature selection methods were considered. Robust extra-sample error estimation and different loss functions were used to assess model performance. The results were compared with widely used rule-based genotypic interpretation systems (Stanford HIVdb, Rega and ANRS).
Results: A set of 3,143 treatment change episodes were extracted from the EuResist database. The dataset included patient demographics, treatment history and viral genotypes. A logistic regression model using high order interaction variables performed better than rule-based genotypic interpretation systems (accuracy 75.63% versus 71.74-73.89%, area under the receiver operating characteristic curve [AUC] 0.76 versus 0.68-0.70) and was equivalent to a random forest model (accuracy 76.16%, AUC 0.77). However, when rule-based genotypic interpretation systems were coupled with additional patient attributes, and the combination was provided as input to the logistic regression model, the performance increased significantly, becoming comparable to the fully data-driven methods.
Conclusions: Patient-derived supplementary features significantly improved the accuracy of the prediction of response to treatment, both with rule-based and data-driven interpretation systems. Fully data-driven models derived from large-scale data sources show promise as antiretroviral treatment decision support tools.
Similar articles
-
Rules-based HIV-1 genotypic resistance interpretation systems predict 8 week and 24 week virological antiretroviral treatment outcome and benefit from drug potency weighting.J Antimicrob Chemother. 2009 Sep;64(3):616-24. doi: 10.1093/jac/dkp252. Epub 2009 Jul 19. J Antimicrob Chemother. 2009. PMID: 19620134
-
A comparison of three computational modelling methods for the prediction of virological response to combination HIV therapy.Artif Intell Med. 2009 Sep;47(1):63-74. doi: 10.1016/j.artmed.2009.05.002. Epub 2009 Jun 12. Artif Intell Med. 2009. PMID: 19524413
-
Predicting the response to combination antiretroviral therapy: retrospective validation of geno2pheno-THEO on a large clinical database.J Infect Dis. 2009 Apr 1;199(7):999-1006. doi: 10.1086/597305. J Infect Dis. 2009. PMID: 19239365
-
Predicting response to antiretroviral treatment by machine learning: the EuResist project.Intervirology. 2012;55(2):123-7. doi: 10.1159/000332008. Epub 2012 Jan 24. Intervirology. 2012. PMID: 22286881 Review.
-
Computational models for prediction of response to antiretroviral therapies.AIDS Rev. 2012 Apr-Jun;14(2):145-53. AIDS Rev. 2012. PMID: 22627610 Review.
Cited by
-
Clinical evaluation of Rega 8: an updated genotypic interpretation system that significantly predicts HIV-therapy response.PLoS One. 2013 Apr 17;8(4):e61436. doi: 10.1371/journal.pone.0061436. Print 2013. PLoS One. 2013. PMID: 23613852 Free PMC article.
-
A Rough Set-Based Model of HIV-1 Reverse Transcriptase Resistome.Bioinform Biol Insights. 2009 Oct 5;3:109-27. doi: 10.4137/bbi.s3382. Bioinform Biol Insights. 2009. PMID: 20140064 Free PMC article.
-
Computational analysis of neonatal ventilator waveforms and loops.Pediatr Res. 2021 May;89(6):1432-1441. doi: 10.1038/s41390-020-01301-9. Epub 2020 Dec 7. Pediatr Res. 2021. PMID: 33288876 Free PMC article.
-
The individualized genetic barrier predicts treatment response in a large cohort of HIV-1 infected patients.PLoS Comput Biol. 2013;9(8):e1003203. doi: 10.1371/journal.pcbi.1003203. Epub 2013 Aug 29. PLoS Comput Biol. 2013. PMID: 24009493 Free PMC article.
-
Machine Learning Techniques for Classifying the Mutagenic Origins of Point Mutations.Genetics. 2020 May;215(1):25-40. doi: 10.1534/genetics.120.303093. Epub 2020 Mar 19. Genetics. 2020. PMID: 32193188 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Other Literature Sources
Medical