Is artificial intelligence capable of generating hospital discharge summaries from inpatient records?

doi:10.1371/journal.pdig.0000158

. 2022 Dec 12;1(12):e0000158.

doi: 10.1371/journal.pdig.0000158. eCollection 2022 Dec.

Is artificial intelligence capable of generating hospital discharge summaries from inpatient records?

Kenichiro Ando^{1

2

3}, Takashi Okumura⁴, Mamoru Komachi¹, Hiromasa Horiguchi³, Yuji Matsumoto²

Affiliations

¹ Graduate School of Systems Design, Tokyo Metropolitan University, Tokyo, Japan.
² Center for Advanced Intelligence Project, RIKEN, Tokyo, Japan.
³ National Hospital Organization, Tokyo, Japan.
⁴ School of Regional Innovation and Social Design Engineering, Kitami Institute of Technology, Hokkaido, Japan.

PMID: 36812600
PMCID: PMC9931331
DOI: 10.1371/journal.pdig.0000158

Is artificial intelligence capable of generating hospital discharge summaries from inpatient records?

Kenichiro Ando et al. PLOS Digit Health. 2022.

. 2022 Dec 12;1(12):e0000158.

doi: 10.1371/journal.pdig.0000158. eCollection 2022 Dec.

Authors

Kenichiro Ando^{1

2

3}, Takashi Okumura⁴, Mamoru Komachi¹, Hiromasa Horiguchi³, Yuji Matsumoto²

Affiliations

¹ Graduate School of Systems Design, Tokyo Metropolitan University, Tokyo, Japan.
² Center for Advanced Intelligence Project, RIKEN, Tokyo, Japan.
³ National Hospital Organization, Tokyo, Japan.
⁴ School of Regional Innovation and Social Design Engineering, Kitami Institute of Technology, Hokkaido, Japan.

PMID: 36812600
PMCID: PMC9931331
DOI: 10.1371/journal.pdig.0000158

Abstract

Medical professionals have been burdened by clerical work, and artificial intelligence may efficiently support physicians by generating clinical summaries. However, whether hospital discharge summaries can be generated automatically from inpatient records stored in electronic health records remains unclear. Therefore, this study investigated the sources of information in discharge summaries. First, the discharge summaries were automatically split into fine-grained segments, such as those representing medical expressions, using a machine learning model from a previous study. Second, these segments in the discharge summaries that did not originate from inpatient records were filtered out. This was performed by calculating the n-gram overlap between inpatient records and discharge summaries. The final source origin decision was made manually. Finally, to reveal the specific sources (e.g., referral documents, prescriptions, and physician's memory) from which the segments originated, they were manually classified by consulting medical professionals. For further and deeper analysis, this study designed and annotated clinical role labels that represent the subjectivity of the expressions and builds a machine learning model to assign them automatically. The analysis results revealed the following: First, 39% of the information in the discharge summary originated from external sources other than inpatient records. Second, patient's past clinical records constituted 43%, and patient referral documents constituted 18% of the expressions derived from external sources. Third, 11% of the missing information was not derived from any documents. These are possibly derived from physicians' memories or reasoning. According to these results, end-to-end summarization using machine learning is considered infeasible. Machine summarization with an assisted post-editing process is the best fit for this problem domain.

Copyright: © 2022 Ando et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest associated with this manuscript.

Figures

**Fig 1. Proposed framework of our study.**
The colored blocks in the dummy record represent the clinical segment developed in previous study, where the sentence is split by medical sense [20].

**Fig 2. Overview of the classification model for subjectivity, clinical role, and probable label.**
Each of the three labels is defined as three tasks. Input segments are fed to UTH-BERT, and then the outputs to the specific layers. Finally, the loss scores of three tasks are calculated and combined to obtain the overall loss score.

**Fig 3. Our annotation flowchart of the source origin.**
The source origin is manually determined in two steps using pre-filtering.

**Fig 4. Origin rate of segments in discharge summaries against the inpatient records.**
Distribution of origin rates using bi-grams from the randomly sampled data. Red, blue, and gray dots are sourced, unsourced, and filtered out segments, respectively. Note that symbols and segments categorized as middle subjectivity are excluded. The y-axis values were randomly generated from a uniform distribution of visibility.

**Fig 5. Origin rate of segments in discharge summaries against the inpatient records.**
Proportion of unsourced segments appearing in manually annotated data. The y-axis is the value averaged every 0.1 steps for segments with origin rates less than 0.5, as shown in Fig 4.

**Fig 6. Breakdown of the information source in discharge summaries.**

See this image and copyright information in PMC

Cited by

Using ChatGPT for writing hospital inpatient discharge summaries - perspectives from an inpatient infectious diseases service.
Koh MCY, Ngiam JN, Oon JEL, Lum LH, Smitasin N, Archuleta S. Koh MCY, et al. BMC Health Serv Res. 2025 Feb 10;25(1):221. doi: 10.1186/s12913-025-12373-w. BMC Health Serv Res. 2025. PMID: 39924512 Free PMC article.
Automated generation of discharge summaries: leveraging large language models with clinical data.
Ganzinger M, Kunz N, Fuchs P, Lyu CK, Loos M, Dugas M, Pausch TM. Ganzinger M, et al. Sci Rep. 2025 May 12;15(1):16466. doi: 10.1038/s41598-025-01618-7. Sci Rep. 2025. PMID: 40355506 Free PMC article.
ChatGPT in Plastic and Reconstructive Surgery.
Sharma SC, Ramchandani JP, Thakker A, Lahiri A. Sharma SC, et al. Indian J Plast Surg. 2023 Aug 2;56(4):320-325. doi: 10.1055/s-0043-1771514. eCollection 2023 Aug. Indian J Plast Surg. 2023. PMID: 37705820 Free PMC article. Review.
Improving specialist palliative care discharges from hospitals and hospices to community settings: a qualitative interview study of the communication experiences of patients, carers, and primary care professionals.
Weetman K, Grimley C, Bailey C, Bernstein CJ, Dale J, Fitch A, Mitchell S, Tiwana R, MacArtney JI. Weetman K, et al. BMC Palliat Care. 2025 Jul 26;24(1):214. doi: 10.1186/s12904-025-01851-x. BMC Palliat Care. 2025. PMID: 40713648 Free PMC article.
Optimizing Discharge Summaries: A Survey of Inpatient Clinician Perspectives and the Path to Standardization.
Silver AM, Goodman LA, Burton M, Rangan P, Chadha R, Thomas AK, O'Donnell C. Silver AM, et al. J Gen Intern Med. 2025 Jul 25. doi: 10.1007/s11606-025-09740-y. Online ahead of print. J Gen Intern Med. 2025. PMID: 40711633

See all "Cited by" articles

References

1. Tomašev N, Glorot X, Rae JW, Zielinski M, Askham H, Saraiva A, et al.. A Clinically Applicable Approach to Continuous Prediction of Future Acute Kidney Injury. Nature. 2019;572(7767):116–119. doi: 10.1038/s41586-019-1390-1 - DOI - PMC - PubMed
1. Ouyang D, He B, Ghorbani A, Yuan N, Ebinger J, Langlotz CP, et al.. Video-based AI for Beat-to-beat Assessment of Cardiac Function. Nature. 2020;580(7802):252–256. doi: 10.1038/s41586-020-2145-8 - DOI - PMC - PubMed
1. Lu MY, Chen TY, Williamson DF, Zhao M, Shady M, Lipkova J, et al.. AI-based Pathology Predicts Origins for Cancers of Unknown Primary. Nature. 2021;594(7861):106–110. doi: 10.1038/s41586-021-03512-4 - DOI - PubMed
1. Frazer J, Notin P, Dias M, Gomez A, Min JK, Brock K, et al.. Disease Variant Prediction with Deep Generative Models of Evolutionary Data. Nature. 2021;599(7883):91–95. doi: 10.1038/s41586-021-04043-8 - DOI - PubMed
1. Bastani H, Drakopoulos K, Gupta V, Vlachogiannis I, Hadjicristodoulou C, Lagiou P, et al.. Efficient and Targeted COVID-19 Border Testing via Reinforcement Learning. Nature. 2021;599(7883):108–113. doi: 10.1038/s41586-021-04014-z - DOI - PubMed

LinkOut - more resources

Full Text Sources
- Europe PubMed Central
- PubMed Central
Miscellaneous
- NCI CPTAC Assay Portal

[1] Tomašev N, Glorot X, Rae JW, Zielinski M, Askham H, Saraiva A, et al.. A Clinically Applicable Approach to Continuous Prediction of Future Acute Kidney Injury. Nature. 2019;572(7767):116–119. doi: 10.1038/s41586-019-1390-1 - DOI - PMC - PubMed

[2] Tomašev N, Glorot X, Rae JW, Zielinski M, Askham H, Saraiva A, et al.. A Clinically Applicable Approach to Continuous Prediction of Future Acute Kidney Injury. Nature. 2019;572(7767):116–119. doi: 10.1038/s41586-019-1390-1 - DOI - PMC - PubMed

[3] Ouyang D, He B, Ghorbani A, Yuan N, Ebinger J, Langlotz CP, et al.. Video-based AI for Beat-to-beat Assessment of Cardiac Function. Nature. 2020;580(7802):252–256. doi: 10.1038/s41586-020-2145-8 - DOI - PMC - PubMed

[4] Ouyang D, He B, Ghorbani A, Yuan N, Ebinger J, Langlotz CP, et al.. Video-based AI for Beat-to-beat Assessment of Cardiac Function. Nature. 2020;580(7802):252–256. doi: 10.1038/s41586-020-2145-8 - DOI - PMC - PubMed

[5] Lu MY, Chen TY, Williamson DF, Zhao M, Shady M, Lipkova J, et al.. AI-based Pathology Predicts Origins for Cancers of Unknown Primary. Nature. 2021;594(7861):106–110. doi: 10.1038/s41586-021-03512-4 - DOI - PubMed

[6] Lu MY, Chen TY, Williamson DF, Zhao M, Shady M, Lipkova J, et al.. AI-based Pathology Predicts Origins for Cancers of Unknown Primary. Nature. 2021;594(7861):106–110. doi: 10.1038/s41586-021-03512-4 - DOI - PubMed

[7] Frazer J, Notin P, Dias M, Gomez A, Min JK, Brock K, et al.. Disease Variant Prediction with Deep Generative Models of Evolutionary Data. Nature. 2021;599(7883):91–95. doi: 10.1038/s41586-021-04043-8 - DOI - PubMed

[8] Frazer J, Notin P, Dias M, Gomez A, Min JK, Brock K, et al.. Disease Variant Prediction with Deep Generative Models of Evolutionary Data. Nature. 2021;599(7883):91–95. doi: 10.1038/s41586-021-04043-8 - DOI - PubMed

[9] Bastani H, Drakopoulos K, Gupta V, Vlachogiannis I, Hadjicristodoulou C, Lagiou P, et al.. Efficient and Targeted COVID-19 Border Testing via Reinforcement Learning. Nature. 2021;599(7883):108–113. doi: 10.1038/s41586-021-04014-z - DOI - PubMed

[10] Bastani H, Drakopoulos K, Gupta V, Vlachogiannis I, Hadjicristodoulou C, Lagiou P, et al.. Efficient and Targeted COVID-19 Border Testing via Reinforcement Learning. Nature. 2021;599(7883):108–113. doi: 10.1038/s41586-021-04014-z - DOI - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Is artificial intelligence capable of generating hospital discharge summaries from inpatient records?

Affiliations

Is artificial intelligence capable of generating hospital discharge summaries from inpatient records?

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

LinkOut - more resources

Full Text Sources

Miscellaneous

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Related information

LinkOut - more resources

Full Text Sources

Miscellaneous