Deep Transfer Learning Across Cancer Registries for Information Extraction from Pathology Reports
- PMID: 36081613
- PMCID: PMC9450101
- DOI: 10.1109/bhi.2019.8834586
Deep Transfer Learning Across Cancer Registries for Information Extraction from Pathology Reports
Abstract
Automated text information extraction from cancer pathology reports is an active area of research to support national cancer surveillance. A well-known challenge is how to develop information extraction tools with robust performance across cancer registries. In this study we investigated whether transfer learning (TL) with a convolutional neural network (CNN) can facilitate cross-registry knowledge sharing. Specifically, we performed a series of experiments to determine whether a CNN trained with single-registry data is capable of transferring knowledge to another registry or whether developing a cross-registry knowledge database produces a more effective and generalizable model. Using data from two cancer registries and primary tumor site and topography as the information extraction task of interest, our study showed that TL results in 6.90% and 17.22% improvement of classification macro F-score over the baseline single-registry models. Detailed analysis illustrated that the observed improvement is evident in the low prevalence classes.
Keywords: NLP; Transfer learning; convolutional neural network; information extraction; pathology reports.
Figures
References
-
- Qiu JX, Yoon H, Fearn PA, and Tourassi GD, “Deep learning for automated extraction of primary sites from cancer pathology reports,” IEEE Journal of Biomedical and Health Informatics, vol. 22, pp. 244–251, Jan 2018. - PubMed
-
- Alawad M, Yoon H, and Tourassi GD, “Coarse-to-fine multi-task training of convolutional neural networks for automated information extraction from cancer pathology reports,” in IEEE EMBS International Conference on Biomedical Health Informatics (BHI), March 2018.
-
- Semwal T, Yenigalla P, Mathur G, and Nair SB, “A practitioners’ guide to transfer learning for text classification using convolutional neural networks,” in Proceedings of the 2018 SIAM International Conference on Data Mining, SDM, pp. 513–521, May 2018.
-
- Weiss K, Khoshgoftaar TM, and Wang D, “A survey of transfer learning,” Journal of Big Data, vol. 3, p. 9, May 2016.
Grants and funding
LinkOut - more resources
Full Text Sources