Generation and evaluation of artificial mental health records for Natural Language Processing

Julia Ive¹, Natalia Viani², Joyce Kam^#², Lucia Yin^#², Somain Verma^#², Stephen Puntis³, Rudolf N Cardinal^{4

5}, Angus Roberts², Robert Stewart^{2

6}, Sumithra Velupillai²

Affiliations

¹ 1Department of Computing, Imperial College London, London, SW7 2AZ UK.
² 2IoPPN, King's College London, SE5 8AF London, UK.
³ 3Department of Psychiatry, University of Oxford, Warneford Hospital, OX3 7JX Oxford, UK.
⁴ 4Department of Psychiatry, University of Cambridge, Downing Street, Cambridge, CB2 3EB UK.
⁵ 5Cambridge Biomedical Campus, Cambridgeshire and Peterborough NHS Foundation Trust, Box 190, Cambridge, CB2 0QQ UK.
⁶ 6South London and Maudsley NHS Foundation Trust, SE5 8AZ London, UK.

^# Contributed equally.

PMID: 32435697
PMCID: PMC7224173
DOI: 10.1038/s41746-020-0267-x

Generation and evaluation of artificial mental health records for Natural Language Processing

Julia Ive et al. NPJ Digit Med. 2020.

. 2020 May 14:3:69.

doi: 10.1038/s41746-020-0267-x. eCollection 2020.

Authors

Julia Ive¹, Natalia Viani², Joyce Kam^#², Lucia Yin^#², Somain Verma^#², Stephen Puntis³, Rudolf N Cardinal^{4

5}, Angus Roberts², Robert Stewart^{2

6}, Sumithra Velupillai²

Affiliations

¹ 1Department of Computing, Imperial College London, London, SW7 2AZ UK.
² 2IoPPN, King's College London, SE5 8AF London, UK.
³ 3Department of Psychiatry, University of Oxford, Warneford Hospital, OX3 7JX Oxford, UK.
⁴ 4Department of Psychiatry, University of Cambridge, Downing Street, Cambridge, CB2 3EB UK.
⁵ 5Cambridge Biomedical Campus, Cambridgeshire and Peterborough NHS Foundation Trust, Box 190, Cambridge, CB2 0QQ UK.
⁶ 6South London and Maudsley NHS Foundation Trust, SE5 8AZ London, UK.

^# Contributed equally.

PMID: 32435697
PMCID: PMC7224173
DOI: 10.1038/s41746-020-0267-x

Abstract

A serious obstacle to the development of Natural Language Processing (NLP) methods in the clinical domain is the accessibility of textual data. The mental health domain is particularly challenging, partly because clinical documentation relies heavily on free text that is difficult to de-identify completely. This problem could be tackled by using artificial medical data. In this work, we present an approach to generate artificial clinical documents. We apply this approach to discharge summaries from a large mental healthcare provider and discharge summaries from an intensive care unit. We perform an extensive intrinsic evaluation where we (1) apply several measures of text preservation; (2) measure how much the model memorises training data; and (3) estimate clinical validity of the generated text based on a human evaluation task. Furthermore, we perform an extrinsic evaluation by studying the impact of using artificial text in a downstream NLP text classification task. We found that using this artificial data as training data can lead to classification results that are comparable to the original results. Additionally, using only a small amount of information from the original data to condition the generation of the artificial data is successful, which holds promise for reducing the risk of these artificial data retaining rare information from the original data. This is an important finding for our long-term goal of being able to generate artificial clinical data that can be released to the wider research community and accelerate advances in developing computational methods that use healthcare data.

Keywords: Medical research; Scientific community.

PubMed Disclaimer

Conflict of interest statement

Competing interestsThe authors declare no competing interests.

Figures

**Fig. 1. Overview of the text generation procedure.**
Key phrases are extracted from paragraphs in the original data (genuine paragraph), and combined with clinical information (ICD-10 diagnosis code, gender and age). This is used in our text generation model, producing an artificial paragraph.

**Fig. 2. Overview of the extrinsic evaluation procedure.**
An NLP model is built on 1) genuine data and 2) artificial data. Both models are tested on real (genuine) test data. Comparing these results gives an indication of the usefulness of using artificial data for NLP model development.

**Fig. 3. Cumulative distributions (CDFs) of the TER bins for the *key*, *all*, *top+meta*, and *one+meta* sentences for *test-gen-mhr*.**
X-axis plots TER bins. Y-axis—respective cumulative frequencies of the *test-gen-mhr* sentences.

**Fig. 4. Matrix of inter-rater annotation agreement for 1K *top+meta* sentences.**
For each document, we defined A1 as the first annotator and A2 as the second annotator. Each cell in the matrix represents the number of sentences marked by an annotator with a certain category (as defined in Table 4).

See this image and copyright information in PMC

References

1. Chapman WW, et al. Overcoming barriers to NLP for clinical text: the role of shared tasks and the need for additional creative solutions. J. Am. Med. Info. Assoc. 2011;18:540–543. doi: 10.1136/amiajnl-2011-000465. - DOI - PMC - PubMed
1. Bachman P. An architecture for deep, hierarchical generative models. Adv. Neural Inf. Process. Syst. 2016;29:4826–4834.
1. Gulrajani, I. et al. PixelVAE: A latent variable model for natural images. In Proceedings of International Conference on Learning Representations (ICLR) (2016).
1. Jing, B., Xie, P. & Xing, E. On the automatic generation of medical imaging reports. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2577–2586 (2018).
1. Liu, P. J. Learning to write notes in electronic health records. Preprint at CoRR https://arxiv.org/abs/1808.02622 (2018).

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Generation and evaluation of artificial mental health records for Natural Language Processing

Affiliations

Generation and evaluation of artificial mental health records for Natural Language Processing

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources