Multimodal automatic assessment of acute pain through facial videos and heart rate signals utilizing transformer-based architectures

doi:10.3389/fpain.2024.1372814

. 2024 Mar 27:5:1372814.

doi: 10.3389/fpain.2024.1372814. eCollection 2024.

Multimodal automatic assessment of acute pain through facial videos and heart rate signals utilizing transformer-based architectures

Stefanos Gkikas^{1

2}, Nikolaos S Tachos^{3

4}, Stelios Andreadis⁵, Vasileios C Pezoulas³, Dimitrios Zaridis^{3

4}, George Gkois³, Anastasia Matonaki⁵, Thanos G Stavropoulos⁵, Dimitrios I Fotiadis^{3

4}

Affiliations

¹ Computational BioMedicine Laboratory (CBML), Institute of Computer Science, Foundation for Research and Technology - Hellas (FORTH), Heraklion, Greece.
² Department of Electrical & Computer Engineering, Hellenic Mediterranean University, Heraklion, Greece.
³ Biomedical Research Institute, Foundation for Research and Technology - Hellas (FORTH), Ioannina, Greece.
⁴ Unit of Medical Technology and Intelligent Information Systems, Department of Materials Science and Engineering, University of Ioannina, Ioannina, Greece.
⁵ Pfizer Center for Digital Innovation, Thessaloniki, Greece.

PMID: 38601923
PMCID: PMC11004333
DOI: 10.3389/fpain.2024.1372814

Multimodal automatic assessment of acute pain through facial videos and heart rate signals utilizing transformer-based architectures

Stefanos Gkikas et al. Front Pain Res (Lausanne). 2024.

. 2024 Mar 27:5:1372814.

doi: 10.3389/fpain.2024.1372814. eCollection 2024.

Authors

Affiliations

¹ Computational BioMedicine Laboratory (CBML), Institute of Computer Science, Foundation for Research and Technology - Hellas (FORTH), Heraklion, Greece.
² Department of Electrical & Computer Engineering, Hellenic Mediterranean University, Heraklion, Greece.
³ Biomedical Research Institute, Foundation for Research and Technology - Hellas (FORTH), Ioannina, Greece.
⁴ Unit of Medical Technology and Intelligent Information Systems, Department of Materials Science and Engineering, University of Ioannina, Ioannina, Greece.
⁵ Pfizer Center for Digital Innovation, Thessaloniki, Greece.

PMID: 38601923
PMCID: PMC11004333
DOI: 10.3389/fpain.2024.1372814

Abstract

Accurate and objective pain evaluation is crucial in developing effective pain management protocols, aiming to alleviate distress and prevent patients from experiencing decreased functionality. A multimodal automatic assessment framework for acute pain utilizing video and heart rate signals is introduced in this study. The proposed framework comprises four pivotal modules: the Spatial Module, responsible for extracting embeddings from videos; the Heart Rate Encoder, tasked with mapping heart rate signals into a higher dimensional space; the AugmNet, designed to create learning-based augmentations in the latent space; and the Temporal Module, which utilizes the extracted video and heart rate embeddings for the final assessment. The Spatial-Module undergoes pre-training on a two-stage strategy: first, with a face recognition objective learning universal facial features, and second, with an emotion recognition objective in a multitask learning approach, enabling the extraction of high-quality embeddings for the automatic pain assessment. Experiments with the facial videos and heart rate extracted from electrocardiograms of the BioVid database, along with a direct comparison to 29 studies, demonstrate state-of-the-art performances in unimodal and multimodal settings, maintaining high efficiency. Within the multimodal context, 82.74% and 39.77% accuracy were achieved for the binary and multi-level pain classification task, respectively, utilizing $9.62$ million parameters for the entire framework.

Keywords: ECG; data fusion; deep learning; pain recognition; vision transformer.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

**Figure 1**
ECG signal preprocessing stages (43). **(1st row)** Raw ECG signal. **(2nd row, left)** Signal after band-pass filtering (BPF) to isolate the frequency range of interest. **(2nd row, right)** Signal post-derivative filtering to highlight the QRS complex. **(3rd row, left)** Squared signal to accentuate dominant peaks. **(3rd row, right)** Moving window average applied to the squared signal, illustrating the final signal () with identified R peaks (), noise level (– –), signal level (), and adaptive thresholding ().

formula image — **Figure 1**
ECG signal preprocessing stages (43). **(1st row)** Raw ECG signal. **(2nd row, left)** Signal after band-pass filtering (BPF) to isolate the frequency range of interest. **(2nd row, right)** Signal post-derivative filtering to highlight the QRS complex. **(3rd row, left)** Squared signal to accentuate dominant peaks. **(3rd row, right)** Moving window average applied to the squared signal, illustrating the final signal () with identified R peaks (), noise level (– –), signal level (), and adaptive thresholding ().

**Figure 2**
Overview of the proposed framework for automatic pain assessment. (A) Video analysis pipeline. (B) ECG analysis pipeline. (C) Fusion analysis pipeline.

**Figure 3**
Comparison of average accuracy and inference time for unimodal and multimodal methodologies across NP vs. P₄ and MC tasks. Note: The plot employs a dual-y-axis format (left for accuracy, right for time) to illustrate the relation between performance and efficiency, with methodologies listed on the x-axis.

**Figure 4**
(A) Attention maps from the Spatial-Module. (B) Attention maps from the Temporal-Module. Yellow and red colors indicate high attention to the particular region. (A) **(1st row)** Original frame sequence. **(2nd row)** Computed from the *Spatial-Module* following the first stage pretraining. **(3rd row)** Computed from the *Spatial-Module* following the second stage pretraining. **(4th row)** Computed from the *Spatial-Module* trained on *BioVid*. (B) **(1st row)** Computed from the *Temporal-Module* with video embedding. **(2nd row)** Computed from the *Temporal-Module* with heart rate embedding. **(3rd row)** Computed from the *Temporal-Module* with fused (video & heart rate) embedding.

**Figure A1**
Attention maps from the *Spatial-Module*. Yellow and red color indicates high attention to the particular region. **(1st row)** Original frame sequence. **(2nd row)** Computed from the *Spatial-Module* following the first stage pretraining. **(3rd row)** Computed from the *Spatial-Module* following the second stage pretraining. **(4th row)** Computed from the *Spatial-Module* trained on *BioVid*.

**Figure A2**
Attention maps from the *Spatial-Module*. Yellow and red color indicates high attention to the particular region. **(1st row)** Original frame sequence. **(2nd row)** Computed from the *Spatial-Module* following the first stage pretraining. **(3rd row)** Computed from the *Spatial-Module* following the second stage pretraining. **(4th row)** Computed from the *Spatial-Module* trained on *BioVid*.

See this image and copyright information in PMC

Cited by

Facial expression deep learning algorithms in the detection of neurological disorders: a systematic review and meta-analysis.
Yoonesi S, Abedi Azar R, Arab Bafrani M, Yaghmayee S, Shahavand H, Mirmazloumi M, Moazeni Limoudehi N, Rahmani M, Hasany S, Idjadi FZ, Aalipour MA, Gharedaghi H, Salehi S, Asadi Anar M, Soleimani MS. Yoonesi S, et al. Biomed Eng Online. 2025 May 22;24(1):64. doi: 10.1186/s12938-025-01396-3. Biomed Eng Online. 2025. PMID: 40405223 Free PMC article.
Pseudo-labeling based adaptations of pain domain classifiers.
Ricken TB, Gruss S, Walter S, Schwenker F. Ricken TB, et al. Front Pain Res (Lausanne). 2025 Apr 23;6:1562099. doi: 10.3389/fpain.2025.1562099. eCollection 2025. Front Pain Res (Lausanne). 2025. PMID: 40337527 Free PMC article.
Expert consensus on feasibility and application of automatic pain assessment in routine clinical use.
Cascella M, Ponsiglione AM, Santoriello V, Romano M, Cerrone V, Esposito D, Montedoro M, Pellecchia R, Savoia G, Lo Bianco G, Innamorato M, Natoli S, Montomoli J, Semeraro F, Bignami EG, Bellini V, Leoni MLG, Occhigrossi F, Vittori A, Pace MC, Buonanno P, Forte M, Chinè E, Carpenedo R, De Cassai A, Papa A, Marchesini M, Terranova G, Micheli F, Demartini L, Marinangeli F, Raffaeli W, Coluzzi F, Tinnirello A, Arcioni R, Marra A, Shariff MN, Monaco F, Finco G, Bramanti A, Piazza O. Cascella M, et al. J Anesth Analg Crit Care. 2025 Jun 2;5(1):29. doi: 10.1186/s44158-025-00249-8. J Anesth Analg Crit Care. 2025. PMID: 40457422 Free PMC article. Review.
Dual stream transformer for medication state classification in Parkinson's disease patients using facial videos.
Skaramagkas V, Boura I, Karamanis G, Kyprakis I, Fotiadis DI, Kefalopoulou Z, Spanaki C, Tsiknakis M. Skaramagkas V, et al. NPJ Digit Med. 2025 Apr 26;8(1):226. doi: 10.1038/s41746-025-01630-1. NPJ Digit Med. 2025. PMID: 40287603 Free PMC article.
A CrossMod-Transformer deep learning framework for multi-modal pain detection through EDA and ECG fusion.
Farmani J, Bargshady G, Gkikas S, Tsiknakis M, Rojas RF. Farmani J, et al. Sci Rep. 2025 Aug 12;15(1):29467. doi: 10.1038/s41598-025-14238-y. Sci Rep. 2025. PMID: 40796769 Free PMC article.

References

1. Williams ACDC, Craig KD. Updating the definition of pain. Pain. (2016) 157(11):2420–3. 10.1097/j.pain.0000000000000613 - DOI - PubMed
1. Khalid S, Tubbs RS. Neuroanatomy, neuropsychology of pain. Cureus. (2017) 9(10). 10.7759/CUREUS.1754 - DOI - PMC - PubMed
1. Turk DC, Melzack R. The measurement of pain, the assessment of people experiencing pain. In: Handbook of Pain Assessment. The Guilford Press (2011). p. 3–16.
1. Sinatra R. Causes, consequences of inadequate management of acute pain. Pain Med. (2010) 11(12):1859–71. 10.1111/j.1526-4637.2010.00983.x - DOI - PubMed
1. De Ruddere L, Tait R. Facing Others in Pain: Why Context Matters. Cham: Springer International Publishing; (2018). p. 241–69.

LinkOut - more resources

Full Text Sources

[1] Williams ACDC, Craig KD. Updating the definition of pain. Pain. (2016) 157(11):2420–3. 10.1097/j.pain.0000000000000613 - DOI - PubMed

[2] Williams ACDC, Craig KD. Updating the definition of pain. Pain. (2016) 157(11):2420–3. 10.1097/j.pain.0000000000000613 - DOI - PubMed

[3] Khalid S, Tubbs RS. Neuroanatomy, neuropsychology of pain. Cureus. (2017) 9(10). 10.7759/CUREUS.1754 - DOI - PMC - PubMed

[4] Khalid S, Tubbs RS. Neuroanatomy, neuropsychology of pain. Cureus. (2017) 9(10). 10.7759/CUREUS.1754 - DOI - PMC - PubMed

[5] Turk DC, Melzack R. The measurement of pain, the assessment of people experiencing pain. In: Handbook of Pain Assessment. The Guilford Press (2011). p. 3–16.

[6] Turk DC, Melzack R. The measurement of pain, the assessment of people experiencing pain. In: Handbook of Pain Assessment. The Guilford Press (2011). p. 3–16.

[7] Sinatra R. Causes, consequences of inadequate management of acute pain. Pain Med. (2010) 11(12):1859–71. 10.1111/j.1526-4637.2010.00983.x - DOI - PubMed

[8] Sinatra R. Causes, consequences of inadequate management of acute pain. Pain Med. (2010) 11(12):1859–71. 10.1111/j.1526-4637.2010.00983.x - DOI - PubMed

[9] De Ruddere L, Tait R. Facing Others in Pain: Why Context Matters. Cham: Springer International Publishing; (2018). p. 241–69.

[10] De Ruddere L, Tait R. Facing Others in Pain: Why Context Matters. Cham: Springer International Publishing; (2018). p. 241–69.

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Multimodal automatic assessment of acute pain through facial videos and heart rate signals utilizing transformer-based architectures

Affiliations

Multimodal automatic assessment of acute pain through facial videos and heart rate signals utilizing transformer-based architectures

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

LinkOut - more resources

Full Text Sources

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Related information

LinkOut - more resources

Full Text Sources