Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Mar 21:1-42.
doi: 10.1007/s10479-021-04006-2. Online ahead of print.

Handling of uncertainty in medical data using machine learning and probability theory techniques: a review of 30 years (1991-2020)

Affiliations

Handling of uncertainty in medical data using machine learning and probability theory techniques: a review of 30 years (1991-2020)

Roohallah Alizadehsani et al. Ann Oper Res. .

Abstract

Understanding the data and reaching accurate conclusions are of paramount importance in the present era of big data. Machine learning and probability theory methods have been widely used for this purpose in various fields. One critically important yet less explored aspect is capturing and analyzing uncertainties in the data and model. Proper quantification of uncertainty helps to provide valuable information to obtain accurate diagnosis. This paper reviewed related studies conducted in the last 30 years (from 1991 to 2020) in handling uncertainties in medical data using probability theory and machine learning techniques. Medical data is more prone to uncertainty due to the presence of noise in the data. So, it is very important to have clean medical data without any noise to get accurate diagnosis. The sources of noise in the medical data need to be known to address this issue. Based on the medical data obtained by the physician, diagnosis of disease, and treatment plan are prescribed. Hence, the uncertainty is growing in healthcare and there is limited knowledge to address these problems. Our findings indicate that there are few challenges to be addressed in handling the uncertainty in medical raw data and new models. In this work, we have summarized various methods employed to overcome this problem. Nowadays, various novel deep learning techniques have been proposed to deal with such uncertainties and improve the performance in decision making.

Keywords: Bayesian inference; Classification; Fuzzy systems; Machine learning; Monte Carlo simulation; Uncertainty.

PubMed Disclaimer

Conflict of interest statement

Conflicts of interestThe authors declare that they have no conflict of interest.

Figures

Fig. 1
Fig. 1
Having uncertainty makes the decision-making process difficult
Fig. 2
Fig. 2
Paper selection mechanism
Fig. 3
Fig. 3
Number of papers published in handling uncentainity in medical data between 1991 and 2020
Fig. 4
Fig. 4
Most common algorithms used to handle uncertainties
Fig. 5
Fig. 5
Classical backpropagation sets fixed values as weights (left) while in Bayesian neural network a distribution is assigned to each weight (right)
Fig. 6
Fig. 6
Type-3 ANFIS structure
Fig. 7
Fig. 7
Three disjoint regions of a rough set
Fig. 8
Fig. 8
Percentage of uncertainty handling algorithms used in medical field researches
Fig. 9
Fig. 9
The trade-off between data and model uncertainty

References

    1. Acharya UR, Fujita H, Oh SL, Hagiwara Y, Tan JH, Adam M. Application of deep convolutional neural network for automated detection of myocardial infarction using ECG signals. Information Sciences. 2017;415–416:190–198. doi: 10.1016/j.ins.2017.06.027. - DOI
    1. Akkoyun E, Kwon ST, Acar AC, Lee W, Baek S. Predicting abdominal aortic aneurysm growth using patient-oriented growth models with two-step Bayesian inference. Computers in Biology and Medicine. 2020;117:103620. doi: 10.1016/j.compbiomed.2020.103620. - DOI - PMC - PubMed
    1. Alizadehsani R, et al. Machine learning-based coronary artery disease diagnosis: A comprehensive review. Computers in Biology and Medicine. 2019;111:103346. doi: 10.1016/j.compbiomed.2019.103346. - DOI - PubMed
    1. Alizadehsani R, et al. A database for using machine learning and data mining techniques for coronary artery disease diagnosis. Scientific Data. 2019;6:227. doi: 10.1038/s41597-019-0206-3. - DOI - PMC - PubMed
    1. Alizadehsani, R., et al. (2019c). Model uncertainty quantification for diagnosis of each main coronary artery stenosis. Soft Computing:1–12

LinkOut - more resources