. 2020 Feb 27;9(2):14.

doi: 10.1167/tvst.9.2.14.

Introduction to Machine Learning, Neural Networks, and Deep Learning

Rene Y Choi¹, Aaron S Coyner², Jayashree Kalpathy-Cramer³, Michael F Chiang^{1

2}, J Peter Campbell¹

Affiliations

¹ Department of Ophthalmology, Casey Eye Institute, Oregon Health & Science University (OHSU), Portland, Oregon, United States.
² Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, Oregon, United States.
³ Athinoula A. Martinos Center for Biomedical Imaging, Department of Radiology, Massachusetts General Hospital, Charlestown, Massachusetts, United States.

PMID: 32704420
PMCID: PMC7347027
DOI: 10.1167/tvst.9.2.14

Introduction to Machine Learning, Neural Networks, and Deep Learning

Rene Y Choi et al. Transl Vis Sci Technol. 2020.

. 2020 Feb 27;9(2):14.

doi: 10.1167/tvst.9.2.14.

Authors

Rene Y Choi¹, Aaron S Coyner², Jayashree Kalpathy-Cramer³, Michael F Chiang^{1

2}, J Peter Campbell¹

Affiliations

¹ Department of Ophthalmology, Casey Eye Institute, Oregon Health & Science University (OHSU), Portland, Oregon, United States.
² Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, Oregon, United States.
³ Athinoula A. Martinos Center for Biomedical Imaging, Department of Radiology, Massachusetts General Hospital, Charlestown, Massachusetts, United States.

PMID: 32704420
PMCID: PMC7347027
DOI: 10.1167/tvst.9.2.14

Abstract

Purpose: To present an overview of current machine learning methods and their use in medical research, focusing on select machine learning techniques, best practices, and deep learning.

Methods: A systematic literature search in PubMed was performed for articles pertinent to the topic of artificial intelligence methods used in medicine with an emphasis on ophthalmology.

Results: A review of machine learning and deep learning methodology for the audience without an extensive technical computer programming background.

Conclusions: Artificial intelligence has a promising future in medicine; however, many challenges remain.

Translational relevance: The aim of this review article is to provide the nontechnical readers a layman's explanation of the machine learning methods being used in medicine today. The goal is to provide the reader a better understanding of the potential and challenges of artificial intelligence within the field of medicine.

Keywords: artificial intelligence; deep learning; machine learning.

PubMed Disclaimer

Conflict of interest statement

Disclosure: R.Y. Choi, None; A.S. Coyner, None; J. Kalpathy-Cramer, None; M.F. Chiang, None; J.P. Campbell, None

Figures

**Figure 1.**
Umbrella of select data science techniques. Artificial intelligence (AI) falls within the realm of data science, and includes classical programming and machine learning (ML). ML contains many models and methods, including deep learning (DL) and artificial neural networks (ANN).

**Figure 2.**
Classical programming versus machine learning paradigm. (A) In classical programming, a computer is supplied with a dataset and an algorithm. The algorithm informs the computer how to operate upon the dataset to create outputs. (B) In machine learning, a computer is supplied with a dataset and associated outputs. The computer learns and generates an algorithm that describes the relationship between the two. This algorithm can be used for inference on future datasets.

**Figure 3.**
Sensitivity, specificity, positive predictive value, and negative predictive value. A population (dataset) is represented as circles colored *blue* if positive or orange if negative. The dataset is input to an algorithm that predicts each instance's class association. If an instance is correctly predicted as positive or negative, it is a true positive (TP) or true negative (TN), respectively. If an instance is incorrectly labeled positive or negative, it is a false positive (FP) or false negative (FN), respectively. (A) A model with perfect sensitivity ( $\sum \frac{T P}{T P + F N}$ ) and specificity ( $\sum \frac{T N}{T N + F P}$ ). (B) A model with perfect sensitivity (ability to correctly classify all positive cases), but poor specificity (ability to correctly classify all negative cases) and (C) a model with perfect specificity, but poor sensitivity. Although a model might have perfect sensitivity (B), it can have many false positives. Similarly, a model with perfect specificity (C) might have many false negatives. Therefore, it is also useful to evaluate the positive predictive value (PPV; $\sum \frac{T P}{T P + F P}$ ) and the negative predictive value (NPV; $\sum \frac{T N}{T N + F N}$ ). PPV and NPV are also thus dependent on the prevalence of disease in a population.

**Figure 4.**
Example receiver operating characteristics and precision-recall curves. *Red line*: a model that performs no better than chance has an area under the curve (AUC) of the receiver operating characteristics curve (AUROC) of 0.50 or area under the precision-recall curve (AUPR) at the class ratio (*red shaded area*). *Blue line*: a model that performs better than chance, but not perfectly, will have an AUC between 0.50 and 1.00 (*blue + red shaded areas*). *Green line*: a model that performs perfectly has an AUC of 1.00 (*red + blue + green shaded areas*).

**Figure 5.**
Example class probability prediction using linear and logistic regression. Presented are linear (*blue line*) and logistic (*red line*) regression models for predicting the probability of various samples (*gray circles*) as belonging to a particular class using a single variable, variable X, which ranges from -10 to 10. With logistic regression, variable X is transformed into class probabilities that are bounded between 0 and 1 using the sigmoid function. Simple linear regression attempts to estimate class probabilities, but is not bounded between 0 and 1; thus, it breaks a fundamental law of probability that does not allow for negative probabilities or those greater than 1.

**Figure 6.**
Structure of a decision tree. Splitting of the dataset begins at the root node. Each split connects to either another decision node, which results in further splitting of the data, or a terminal node that predicts the class of the data.

**Figure 7.**
Components of a neural network. (A) The basis of an artificial neural network, the perceptron. This algorithm uses the sigmoid function to scale and transform multiple inputs into a single output ranging from 0 to 1. (B) An artificial neural network connects multiple perceptron units, so that the output of one unit is used as input to another. Additionally, these units are not limited to using the sigmoid activation function. (C) Examples of four different activation functions: sigmoid, hyperbolic tangent, identity, and rectified linear unit. The sigmoid scales inputs between 0 and 1 using an S-shaped curved. Similarly, the hyperbolic tangent function uses an S-shaped curve, but scales inputs between -1 and 1. The identity function can multiply its input by any number to produce a linear output. The rectified linear unit is similar to the identity function, however all inputs < 0 are given an output value of 0. There are other activation functions outside of these, but these are arguably.

**Figure 8.**
Example of a digital image convolved with a filter. The image (*left*) is transformed into the feature map (*right*) via a convolutional filter (*center*). The convolutional filter is designed to locate diagonal lines running from top left to bottom right of the image. The filter passes over the image in a specified manner and each element in the image (*red*) is multiplied by the corresponding element in the convolutional filter (*blue*). The summation of these elements (*orange*) is output into a new matrix that reports the presence of a diagonal line. The feature map indicates 2 when the specified diagonal line is found, 1 if a portion of it is found, and 0 if none of it is found.

See this image and copyright information in PMC

References

1. Brown JM, Campbell JP, Beers A, et al.. Automated diagnosis of plus disease in retinopathy of prematurity using deep convolutional neural networks. JAMA Ophthalmol. 2018; 136: 803–810. 10.1001/jamaophthalmol.2018.1934. - DOI - PMC - PubMed
1. Gulshan V, Peng L, Coram M, et al.. Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA. 2016; 316: 2402–2410. 10.1001/jama.2016.17216. - DOI - PubMed
1. Coyner AS, Swan R, Campbell JP, et al.. Automated fundus image quality assessment in retinopathy of prematurity using deep convolutional neural networks. Ophthalmol Retina. 2019; 3: 444–450. 10.1016/j.oret.2019.01.015. - DOI - PMC - PubMed
1. Rajpurkar P, Irvin J, Zhu K, et al.. CheXNet: radiologist-level pneumonia detection on chest X-rays with deep learning. ArXiv171105225 Cs Stat. November 2017. http://arxiv.org/abs/1711.05225. Accessed October 23, 2019.
1. Jones LD, Golan D, Hanna SA, Ramachandran M. Artificial intelligence, machine learning and the evolution of healthcare: a bright future or cause for concern? Bone Jt Res. 2018; 7: 223–225. 10.1302/2046-3758.73.BJR-2017-0147.R1. - DOI - PMC - PubMed

Publication types

Actions
Actions
Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Introduction to Machine Learning, Neural Networks, and Deep Learning

Affiliations

Introduction to Machine Learning, Neural Networks, and Deep Learning

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources