. 2021 Jun 3;11(1):11730.

doi: 10.1038/s41598-021-90000-4.

Robust diagnostic classification via Q-learning

Victor Ardulov¹, Victor R Martinez², Krishna Somandepalli², Shuting Zheng³, Emma Salzman³, Catherine Lord⁴, Somer Bishop³, Shrikanth Narayanan²

Affiliations

¹ University of Southern California, Los Angeles, USA. ardulov@usc.edu.
² University of Southern California, Los Angeles, USA.
³ University of California San Francisco, San Francisco, USA.
⁴ University of California Los Angeles, Los Angeles, USA.

PMID: 34083579
PMCID: PMC8175431
DOI: 10.1038/s41598-021-90000-4

Robust diagnostic classification via Q-learning

Victor Ardulov et al. Sci Rep. 2021.

. 2021 Jun 3;11(1):11730.

doi: 10.1038/s41598-021-90000-4.

Authors

Victor Ardulov¹, Victor R Martinez², Krishna Somandepalli², Shuting Zheng³, Emma Salzman³, Catherine Lord⁴, Somer Bishop³, Shrikanth Narayanan²

Affiliations

¹ University of Southern California, Los Angeles, USA. ardulov@usc.edu.
² University of Southern California, Los Angeles, USA.
³ University of California San Francisco, San Francisco, USA.
⁴ University of California Los Angeles, Los Angeles, USA.

PMID: 34083579
PMCID: PMC8175431
DOI: 10.1038/s41598-021-90000-4

Abstract

Machine learning (ML) models have demonstrated the power of utilizing clinical instruments to provide tools for domain experts in gaining additional insights toward complex clinical diagnoses. In this context these tools desire two additional properties: interpretability, being able to audit and understand the decision function, and robustness, being able to assign the correct label in spite of missing or noisy inputs. This work formulates diagnostic classification as a decision-making process and utilizes Q-learning to build classifiers that meet the aforementioned desired criteria. As an exemplary task, we simulate the process of differentiating Autism Spectrum Disorder from Attention Deficit-Hyperactivity Disorder in verbal school aged children. This application highlights how reinforcement learning frameworks can be utilized to train more robust classifiers by jointly learning to maximize diagnostic accuracy while minimizing the amount of information required.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

**Figure 1**
ADI-R administration process. The parent is interviewed by a clinician. The clinicians asks open-ended questions that are tied to an item and listens to the responses from a parent. Typically the clinician is listening and asking about specific examples of the child’s behavior in relation to the item at hand. The clinician records a rating based on the presented information and can leave notes to themselves. After the interview is complete the clinician uses their recorded ratings to complete the ADI-R algorithm computing whether the child meets the instrument’s cut-off thresholds for ASD.

**Figure 2**
Distribution of demographic information: age, FSIQ and VIQ across different diagnostic conditions.

**Figure 3**
Process demonstrates how a single example is converted into masked examples. The 0s represent values that are unavailable to the classifier *a priori* and will be potentially imputed. The notation $C (\begin{matrix} m \\ n \end{matrix})$ (m choose n) represents the number of examples generated by masking n items.

**Figure 4**
F1-Score degradation as more features are masked from the inputs.

**Figure 5**
An example of how a policy updates with all possible responses from an inquiry. The top row captures the initial “empty” state of the policy, while the branch represent all of the possible state update that could occur depending on the observation made following the action taken. The column vector represents the state of the policy, or the items that the policy has information about so far. The horizontal bar chart captures the relative Q-value of each action (actions are equivalent to querying an item or making a prediction). As ADI_45 has the highest Q-value, it is the first item that is queried by the policy. The arrows capture possible responses, or observations, that the policy can have, which in turn are used to update the state. The verticle bar chart captures the current state’s predicted probabilities of ADHD and ASD respectively (*Belief*).

**Figure 6**
Importance of different items relative to each other according to different model types.

See this image and copyright information in PMC

References

1. Kononenko I. Machine learning for medical diagnosis: History, state of the art and perspective. Artif. Intell. Med. 2001;23:89–109. doi: 10.1016/S0933-3657(01)00077-X. - DOI - PubMed
1. Bellazzi R, Zupan B. Predictive data mining in clinical medicine: Current issues and guidelines. Int. J. Med. Inform. 2008;77:81–97. doi: 10.1016/j.ijmedinf.2006.11.006. - DOI - PubMed
1. Kourou K, Exarchos TP, Exarchos KP, Karamouzis MV, Fotiadis DI. Machine learning applications in cancer prognosis and prediction. Comput. Struct. Biotechnol. J. 2015;13:8–17. doi: 10.1016/j.csbj.2014.11.005. - DOI - PMC - PubMed
1. Papernot, N., Abadi, M., Erlingsson, U., Goodfellow, I. & Talwar, K. Semi-supervised knowledge transfer for deep learning from private training data. arXiv:1610.05755 (2016).
1. Sinzig J, Walter D, Doepfner M. Attention deficit/hyperactivity disorder in children and adolescents with autism spectrum disorder: Symptom or syndrome? J. Atten. Disord. 2009;13:117–126. doi: 10.1177/1087054708326261. - DOI - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Research Materials
- NCI CPTC Antibody Characterization Program

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Robust diagnostic classification via Q-learning

Affiliations

Robust diagnostic classification via Q-learning

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Research Materials