Transparent Quality Optimization for Machine Learning-Based Regression in Neurology
- PMID: 35743693
- PMCID: PMC9224715
- DOI: 10.3390/jpm12060908
Transparent Quality Optimization for Machine Learning-Based Regression in Neurology
Abstract
The clinical monitoring of walking generates enormous amounts of data that contain extremely valuable information. Therefore, machine learning (ML) has rapidly entered the research arena to analyze and make predictions from large heterogeneous datasets. Such data-driven ML-based applications for various domains become increasingly applicable, and thus their software qualities are taken into focus. This work provides a proof of concept for applying state-of-the-art ML technology to predict the distance travelled of the 2-min walk test, an important neurological measurement which is an indicator of walking endurance. A transparent lean approach was emphasized to optimize the results in an explainable way and simultaneously meet the specified software requirements for a generic approach. It is a general-purpose strategy as a fractional−factorial design benchmark combined with standardized quality metrics based on a minimal technology build and a resulting optimized software prototype. Based on 400 training and 100 validation data, the achieved prediction yielded a relative error of 6.1% distributed over multiple experiments with an optimized configuration. The Adadelta algorithm (LR=0.000814, fModelSpread=5, nModelDepth=6, nepoch=1000) performed as the best model, with 90% of the predictions with an absolute error of <15 m. Factors such as gender, age, disease duration, or use of walking aids showed no effect on the relative error. For multiple sclerosis patients with high walking impairment (EDSS Ambulation Score ≥6), the relative difference was significant (n=30; 24.0%; p<0.050). The results show that it is possible to create a transparently working ML prototype for a given medical use case while meeting certain software qualities.
Keywords: deep learning; fractional factorial design benchmark; inertial measurement units; machine learning; multiple sclerosis; software quality.
Conflict of interest statement
The authors declare no conflict of interest.
Figures
References
-
- Zhou L., Pan S., Wang J., Vasilakos A.V. Machine learning on big data: Opportunities and challenges. Neurocomputing. 2017;237:350–361. doi: 10.1016/j.neucom.2017.01.026. - DOI
-
- L’heureux A., Grolinger K., Elyamany H.F., Capretz M.A.M. Machine learning with big data: Challenges and approaches. IEEE Access. 2017;5:7776–7797. doi: 10.1109/ACCESS.2017.2696365. - DOI
-
- Franch X., Ayala C., López L., Martinez-Fernández S., Rodriguez P., Gómez C., Jedlitschka A., Oivo M., Partanen J., Räty T., et al. Data-driven requirements engineering in agile projects: The Q-rapids approach; Proceedings of the 2017 IEEE 25th International Requirements Engineering Conference Workshops (REW); Lisbon, Portugal. 4–8 September 2017; pp. 411–414. - DOI
-
- Chitnis T., Glanz B.I., Gonzalez C., Healy B.C., Saraceno T.J., Sattarnezhad N., Diaz-Cruz C., Polgar-Turcsanyi M., Tummala S., Bakshi R., et al. Quantifying neurologic disease using biosensor measurements in-clinic and in free-living settings in multiple sclerosis. Npj Digit. Med. 2019;2:1–8. doi: 10.1038/s41746-019-0197-7. - DOI - PMC - PubMed
LinkOut - more resources
Full Text Sources
