Integrating Machine Learning With Microsimulation to Classify Hypothetical, Novel Patients for Predicting Pregabalin Treatment Response Based on Observational and Randomized Data in Patients With Painful Diabetic Peripheral Neuropathy

Affiliations

¹ Global Medical Affairs, Pfizer Inc, New York, NY 10017, USA.
² Health Services Consulting Corporation, Boxborough, MA 01719, USA.
³ Fair Dynamics Consulting, SRL, Milan, Italy.
⁴ Global Statistics, Pfizer Inc, New York, NY 10017, USA.
⁵ Global Medical Affairs, Pfizer Inc, Groton, CT 06340, USA.
⁶ Global Medical Product Evaluation, Pfizer Inc, New York, NY 10017, USA.

^# Contributed equally.

PMID: 31802967
PMCID: PMC6827520
DOI: 10.2147/POR.S214412

Integrating Machine Learning With Microsimulation to Classify Hypothetical, Novel Patients for Predicting Pregabalin Treatment Response Based on Observational and Randomized Data in Patients With Painful Diabetic Peripheral Neuropathy

Joe Alexander Jr et al. Pragmat Obs Res. 2019.

. 2019 Oct 31:10:67-76.

doi: 10.2147/POR.S214412. eCollection 2019.

Authors

Affiliations

¹ Global Medical Affairs, Pfizer Inc, New York, NY 10017, USA.
² Health Services Consulting Corporation, Boxborough, MA 01719, USA.
³ Fair Dynamics Consulting, SRL, Milan, Italy.
⁴ Global Statistics, Pfizer Inc, New York, NY 10017, USA.
⁵ Global Medical Affairs, Pfizer Inc, Groton, CT 06340, USA.
⁶ Global Medical Product Evaluation, Pfizer Inc, New York, NY 10017, USA.

^# Contributed equally.

PMID: 31802967
PMCID: PMC6827520
DOI: 10.2147/POR.S214412

Abstract

Purpose: Variability in patient treatment responses can be a barrier to effective care. Utilization of available patient databases may improve the prediction of treatment responses. We evaluated machine learning methods to predict novel, individual patient responses to pregabalin for painful diabetic peripheral neuropathy, utilizing an agent-based modeling and simulation platform that integrates real-world observational study (OS) data and randomized clinical trial (RCT) data.

Patients and methods: The best supervised machine learning methods were selected (through literature review) and combined in a novel way for aligning patients with relevant subgroups that best enable prediction of pregabalin responses. Data were derived from a German OS of pregabalin (N=2642) and nine international RCTs (N=1320). Coarsened exact matching of OS and RCT patients was used and a hierarchical cluster analysis was implemented. We tested which machine learning methods would best align candidate patients with specific clusters that predict their pain scores over time. Cluster alignments would trigger assignments of cluster-specific time-series regressions with lagged variables as inputs in order to simulate "virtual" patients and generate 1000 trajectory variations for given novel patients.

Results: Instance-based machine learning methods (k-nearest neighbor, supervised fuzzy c-means) were selected for quantitative analyses. Each method alone correctly classified 56.7% and 39.1% of patients, respectively. An "ensemble method" (combining both methods) correctly classified 98.4% and 95.9% of patients in the training and testing datasets, respectively.

Conclusion: An ensemble combination of two instance-based machine learning techniques best accommodated different data types (dichotomous, categorical, continuous) and performed better than either technique alone in assigning novel patients to subgroups for predicting treatment outcomes using microsimulation. Assignment of novel patients to a cluster of similar patients has the potential to improve prediction of patient outcomes for chronic conditions in which initial treatment response can be incorporated using microsimulation.

Clinical trial registries: www.clinicaltrials.gov: NCT00156078, NCT00159679, NCT00143156, NCT00553475.

Keywords: agent-based modeling and simulation; coarsened exact matching; hierarchical cluster analysis; machine learning; time series regressions.

PubMed Disclaimer

Conflict of interest statement

Birol Emir, Bruce Parsons, Stephen Watt, and Ed Whalen are employees of Pfizer. Joe Alexander Jr and Marina Brodsky were employed by Pfizer at the time the study was conducted. Roger A Edwards is an employee of Health Services Consulting Corporation who was a paid consultant by Pfizer in connection with this study and development of this manuscript. Luigi Manca, Roberto Grugni, and Gianluca Bonfanti are employees of Fair Dynamics Consulting, who were paid subcontractors to Health Services Consulting Corporation in connection with this study and the development of this manuscript. The authors report no other conflicts of interest in this work.

Figures

**Figure 1**
Simulation steps. Reproduced from Alexander J, Edwards RA, Brodsky M, et al Using time-series analysis approaches for improved prediction of pain outcomes in subgroups of patients with painful diabetic peripheral neuropathy. *PLoS One*. 2018;13(12):e0207120. Creative commons license and disclaimer available from http://creativecommons.org/licenses/by/4.0/legalcode. **Abbreviations:** OS, observational study; PDF, probability density function; RCT, randomized controlled trial.

**Figure 2**
Accuracy results for the kNN method only, SFCM method only, and the ensemble method in (A) training dataset by cluster, (B) testing dataset by cluster, and (C) overall testing and training datasets. **Abbreviations:** kNN, k-nearest neighbors; SFCM, supervised fuzzy c-means.

See this image and copyright information in PMC

References

1. Collins FS, Varmus H. A new initiative on precision medicine. N Engl J Med. 2015;372(9):793–795. doi:10.1056/NEJMp1500523 - DOI - PMC - PubMed
1. Sim I. Two ways of knowing: big data and evidence-based medicine. Ann Intern Med. 2016;164(8):562–563. doi:10.7326/M15-2970 - DOI - PubMed
1. Berwick DM, Nolan TW, Whittington J. The triple aim: care, health, and cost. Health Aff (Millwood). 2008;27(3):759–769. doi:10.1377/hlthaff.27.3.759 - DOI - PubMed
1. Amarasingham R, Patzer RE, Huesch M, Nguyen NQ, Xie B. Implementing electronic health care predictive analytics: considerations and challenges. Health Aff (Millwood). 2014;33(7):1148–1154. doi:10.1377/hlthaff.2014.0352 - DOI - PubMed
1. Hannan EL. Randomized clinical trials and observational studies: guidelines for assessing respective strengths and limitations. JACC Cardiovasc Interv. 2008;1(3):211–217. doi:10.1016/j.jcin.2008.01.008 - DOI - PubMed

Associated data

Actions
- Search in PubMed
- Search in ClinicalTrials.gov
Actions
- Search in PubMed
- Search in ClinicalTrials.gov
Actions
- Search in PubMed
- Search in ClinicalTrials.gov
Actions
- Search in PubMed
- Search in ClinicalTrials.gov

LinkOut - more resources

Full Text Sources
Medical
- ClinicalTrials.gov

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Integrating Machine Learning With Microsimulation to Classify Hypothetical, Novel Patients for Predicting Pregabalin Treatment Response Based on Observational and Randomized Data in Patients With Painful Diabetic Peripheral Neuropathy

Affiliations

Integrating Machine Learning With Microsimulation to Classify Hypothetical, Novel Patients for Predicting Pregabalin Treatment Response Based on Observational and Randomized Data in Patients With Painful Diabetic Peripheral Neuropathy

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Associated data

LinkOut - more resources

Full Text Sources

Medical