A Masked Language Model for Multi-Source EHR Trajectories Contextual Representation Learning
- PMID: 37203760
- DOI: 10.3233/SHTI230217
A Masked Language Model for Multi-Source EHR Trajectories Contextual Representation Learning
Abstract
Using electronic health records data and machine learning to guide future decisions needs to address challenges, including 1) long/short-term dependencies and 2) interactions between diseases and interventions. Bidirectional transformers have effectively addressed the first challenge. Here we tackled the latter challenge by masking one source (e.g., ICD10 codes) and training the transformer to predict it using other sources (e.g., ATC codes).
Keywords: Masked language model; deep learning; disease prediction; electronic health records; patient trajectories; representation learning.
MeSH terms
LinkOut - more resources
Full Text Sources
Research Materials