Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Nov:75:168-185.
doi: 10.1016/j.inffus.2021.05.015. Epub 2021 Jun 1.

Pay attention to doctor-patient dialogues: Multi-modal knowledge graph attention image-text embedding for COVID-19 diagnosis

Affiliations

Pay attention to doctor-patient dialogues: Multi-modal knowledge graph attention image-text embedding for COVID-19 diagnosis

Wenbo Zheng et al. Inf Fusion. 2021 Nov.

Abstract

The sudden increase in coronavirus disease 2019 (COVID-19) cases puts high pressure on healthcare services worldwide. At this stage, fast, accurate, and early clinical assessment of the disease severity is vital. In general, there are two issues to overcome: (1) Current deep learning-based works suffer from multimodal data adequacy issues; (2) In this scenario, multimodal (e.g., text, image) information should be taken into account together to make accurate inferences. To address these challenges, we propose a multi-modal knowledge graph attention embedding for COVID-19 diagnosis. Our method not only learns the relational embedding from nodes in a constituted knowledge graph but also has access to medical knowledge, aiming at improving the performance of the classifier through the mechanism of medical knowledge attention. The experimental results show that our approach significantly improves classification performance compared to other state-of-the-art techniques and possesses robustness for each modality from multi-modal data. Moreover, we construct a new COVID-19 multi-modal dataset based on text mining, consisting of 1393 doctor-patient dialogues and their 3706 images (347 X-ray + 2598 CT + 761 ultrasound) about COVID-19 patients and 607 non-COVID-19 patient dialogues and their 10754 images (9658 X-ray + 494 CT + 761 ultrasound), and the fine-grained labels of all. We hope this work can provide insights to the researchers working in this area to shift the attention from only medical images to the doctor-patient dialogue and its corresponding medical images.

Keywords: COVID-19 diagnose; Knowledge attention mechanism; Knowledge embedding; Knowledge-based representation learning.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Fig. 1
Fig. 1
Comparison about COVID-19 and non-COVID-19 dialogues.
Fig. 2
Fig. 2
An illustrative example of a multi-modal knowledge graph. From left to right, these are the four types of nodes (i.e., X-ray, CT, ultrasound, and text description of diagnose), six meta-paths involved in the given knowledge graph, a multi-modal knowledge graph.
Fig. 3
Fig. 3
Architecture of multi-modal knowledge graph model .
Fig. 4
Fig. 4
The proposed multi-modal knowledge graph attention embedding model. Given multimodal knowledge graph G, we propose the multimodal attention mechanisms including three parts: ➀ the single-level modality attention and its results denoted as {fknowledgeΦi,i=0,1,,C}; ➁ the multiple-level modality attention and its embedding denoted as fmultipleknowledge; ➂ cross-level modality attention mechanism that fuse the information of single-level modality and multiple-level modality attentions, and its the embedding matrix denoted as fknowledge. Meanwhile, we propose the Temporal Convolutional Self-Attention Network (TCSAN) to handle the inputted multimodal data and get the multimodal sentence vectors fnetwork. Then, we get the knowledge-based attention feature vector f. Finally, we use the classifier (in this paper, we use the ResNet-34 [70]) to gain the labels, i.e., yˆp.
Fig. 5
Fig. 5
The illustration of TCN with dilated causal convolutions.
Fig. 6
Fig. 6
The result of parameters experiments.
Fig. 7
Fig. 7
Scalability. The training time decreases as the number of training samples increases. Ours takes less training time to converge compared with others.
Fig. 8
Fig. 8
The result of robustness analysis. Performance modification using the altered data input over our model .
Fig. 9
Fig. 9
The confusion matrix of classifying the asymptomatic infection (COVID-19) cases and non-COVID-19 cases.
Fig. 10
Fig. 10
Error analysis.
None

Similar articles

Cited by

References

    1. van de Haar J., Hoes L.R., Coles C.E., Seamon K., Fröhling S., Jäger D., Valenza F., de Braud F., De Petris L., Bergh J., Ernberg I., Besse B., Barlesi F., Garralda E., Piris-Giménez A., Baumann M., Apolone G., Soria J.C., Tabernero J., Caldas C., Voest E.E. Caring for patients with cancer in the COVID-19 era. Nature Med. 2020;26:665–671. doi: 10.1038/s41591-020-0874-8. - DOI - PubMed
    1. Schett G., Manger B., Simon D., Caporali R. COVID-19 revisiting inflammatory pathways of arthritis. Nat. Rev. Rheumatol. 2020 doi: 10.1038/s41584-020-0451-z. doi: 10.1038/s41584-020-0451-z. URL: - DOI - DOI - PMC - PubMed
    1. Palayew A., Norgaard O., Safreed-Harmon K., Andersen T.H., Rasmussen L.N., Lazarus J.V. Pandemic publishing poses a new COVID-19 challenge. Nat. Hum. Behav. 2020 doi: 10.1038/s41562-020-0911-0. doi: 10.1038/s41562-020-0911-0. URL: - DOI - DOI - PubMed
    1. Robilotti E.V., Babady N.E., Mead P.A., Rolling T., Perez-Johnston R., Bernardes M., Bogler Y., Caldararo M., Figueroa C.J., Glickman M.S., Joanow A., Kaltsas A., Lee Y.J., Lucca A., Mariano A., Morjaria S., Nawar T., Papanicolaou G.A., Predmore J., Redelman-Sidi G., Schmidt E., Seo S.K., Sepkowitz K., Shah M.K., Wolchok J.D., Hohl T.M., Taur Y., Kamboj M. Determinants of COVID-19 disease severity in patients with cancer. Nature Med. 2020 doi: 10.1038/s41591-020-0979-0. doi: 10.1038/s41591-020-0979-0. URL: - DOI - DOI - PMC - PubMed
    1. Li Z., Zhao W., Shi F., Qi L., Xie X., Wei Y., Ding Z., Gao Y., Wu S., Shi Y., Shen D., Liu J. A novel multiple instance learning framework for COVID-19 severity assessment via data augmentation and self-supervised learning. Med. Image Anal. 2021;69 doi: 10.1016/j.media.2021.101978. URL: https://www.sciencedirect.com/science/article/pii/S1361841521000244. - DOI - PMC - PubMed

LinkOut - more resources