Temporal Supervised Contrastive Learning for Modeling Patient Risk Progression
- PMID: 38550276
- PMCID: PMC10976929
Temporal Supervised Contrastive Learning for Modeling Patient Risk Progression
Abstract
We consider the problem of predicting how the likelihood of an outcome of interest for a patient changes over time as we observe more of the patient's data. To solve this problem, we propose a supervised contrastive learning framework that learns an embedding representation for each time step of a patient time series. Our framework learns the embedding space to have the following properties: (1) nearby points in the embedding space have similar predicted class probabilities, (2) adjacent time steps of the same time series map to nearby points in the embedding space, and (3) time steps with very different raw feature vectors map to far apart regions of the embedding space. To achieve property (3), we employ a nearest neighbor pairing mechanism in the raw feature space. This mechanism also serves as an alternative to "data augmentation", a key ingredient of contrastive learning, which lacks a standard procedure that is adequately realistic for clinical tabular data, to our knowledge. We demonstrate that our approach outperforms state-of-the-art baselines in predicting mortality of septic patients (MIMIC-III dataset) and tracking progression of cognitive impairment (ADNI dataset). Our method also consistently recovers the correct synthetic dataset embedding structure across experiments, a feat not achieved by baselines. Our ablation experiments show the pivotal role of our nearest neighbor pairing.
Keywords: contrastive learning; nearest neighbors; time series analysis.
Figures




























References
-
- Aguiar Henrique, Santos Mauro, Watkinson Peter, and Zhu Tingting. Learning of cluster-based feature importance for electronic health record time-series. In International Conference on Machine Learning, 2022.
-
- Bardes Adrien, Ponce Jean, and LeCun Yann. Vicreg: Variance-invariance-covariance regularization for self-supervised learning. arXiv preprint arXiv:2105.04906, 2021.
-
- Carr Oliver, Javer Avelino, Rockenschaub Patrick, Parsons Owen, and Durichen Robert. Longitudinal patient stratification of electronic health records with flexible adjustment for clinical outcomes. In Machine Learning for Health, 2021.
-
- Chen Ting, Kornblith Simon, Norouzi Mohammad, and Hinton Geoffrey. A simple framework for contrastive learning of visual representations. In International Conference on Machine Learning, 2020.
-
- Choi Edward, Bahadori Mohammad Taha, Sun Jimeng, Kulas Joshua, Schuetz Andy, and Stewart Walter. RETAIN: An interpretable predictive model for healthcare using reverse time attention mechanism. In Advances in Neural Information Processing Systems, 2016.
Grants and funding
LinkOut - more resources
Full Text Sources