Unsupervised brain imaging 3D anomaly detection and segmentation with transformers

doi:10.1016/j.media.2022.102475

. 2022 Jul:79:102475.

doi: 10.1016/j.media.2022.102475. Epub 2022 May 4.

Unsupervised brain imaging 3D anomaly detection and segmentation with transformers

Walter H L Pinaya¹, Petru-Daniel Tudosiu², Robert Gray³, Geraint Rees⁴, Parashkev Nachev³, Sebastien Ourselin², M Jorge Cardoso²

Affiliations

¹ Department of Biomedical Engineering, School of Biomedical Engineering & Imaging Sciences, King's College London, London, UK. Electronic address: walter.diaz_sanz@kcl.ac.uk.
² Department of Biomedical Engineering, School of Biomedical Engineering & Imaging Sciences, King's College London, London, UK.
³ UCL Queen Square Institute of Neurology, University College London, London, UK.
⁴ UCL Faculty of Life Sciences, University College London, London, UK.

PMID: 35598520
PMCID: PMC10108352
DOI: 10.1016/j.media.2022.102475

Unsupervised brain imaging 3D anomaly detection and segmentation with transformers

Walter H L Pinaya et al. Med Image Anal. 2022 Jul.

. 2022 Jul:79:102475.

doi: 10.1016/j.media.2022.102475. Epub 2022 May 4.

Authors

Walter H L Pinaya¹, Petru-Daniel Tudosiu², Robert Gray³, Geraint Rees⁴, Parashkev Nachev³, Sebastien Ourselin², M Jorge Cardoso²

Affiliations

¹ Department of Biomedical Engineering, School of Biomedical Engineering & Imaging Sciences, King's College London, London, UK. Electronic address: walter.diaz_sanz@kcl.ac.uk.
² Department of Biomedical Engineering, School of Biomedical Engineering & Imaging Sciences, King's College London, London, UK.
³ UCL Queen Square Institute of Neurology, University College London, London, UK.
⁴ UCL Faculty of Life Sciences, University College London, London, UK.

PMID: 35598520
PMCID: PMC10108352
DOI: 10.1016/j.media.2022.102475

Abstract

Pathological brain appearances may be so heterogeneous as to be intelligible only as anomalies, defined by their deviation from normality rather than any specific set of pathological features. Amongst the hardest tasks in medical imaging, detecting such anomalies requires models of the normal brain that combine compactness with the expressivity of the complex, long-range interactions that characterise its structural organisation. These are requirements transformers have arguably greater potential to satisfy than other current candidate architectures, but their application has been inhibited by their demands on data and computational resources. Here we combine the latent representation of vector quantised variational autoencoders with an ensemble of autoregressive transformers to enable unsupervised anomaly detection and segmentation defined by deviation from healthy brain imaging data, achievable at low computational cost, within relative modest data regimes. We compare our method to current state-of-the-art approaches across a series of experiments with 2D and 3D data involving synthetic and real pathological lesions. On real lesions, we train our models on 15,000 radiologically normal participants from UK Biobank and evaluate performance on four different brain MR datasets with small vessel disease, demyelinating lesions, and tumours. We demonstrate superior anomaly detection performance both image-wise and pixel/voxel-wise, achievable without post-processing. These results draw attention to the potential of transformers in this most challenging of imaging tasks.

Keywords: Anomaly detection; Transformer; Unsupervised anomaly segmentation; Vector quantized variational autoencoder.

PubMed Disclaimer

Conflict of interest statement

Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Image, graphical abstract — **Graphical abstract**

**Fig. 1**
Our method uses a VQ-VAE to learn the latent discrete representation of brain data. This latent representation is transformed into a 1D sequence that is learned by the autoregressive transformer.

**Fig. 2**
Anomaly segmentation method. A) the sequence obtained from the VQ-VAE is fed to the transformer with an “begin of sentence” token prepended. For each position of the sequence, the transformer will predict the value of the next element. Using the output probability of each real value, we apply a threshold (in this example, we use a threshold of 0.05) to identify which one is anomalous. A binary mask (the “resampling mask”) is created to indicate which value is below the threshold and should be corrected. B) For each position indicated in the resampling mask, we use the transformer to obtain values that have a higher probability of occurrence and we create a healed sequence. C) The healed 1-dimensional sequence is reshaped and processed by the VQ-VAE decoder to create a reconstruction without anomalies.

**Fig. 3**
Using the spatial information from the resampling mask to improve segmentation. First, we reshape the resampling mask back to the format of the VQ-VAE latent space. Then, we upsample it to have the input image shape and we smooth it with a Gaussian filter. Finally, we use this mask to filter the residuals maps obtained from the difference between the inputted image and its healed reconstruction.

**Fig. 4**
To predict the probability of the value in the red square, the transformer using the ordering of the left image (raster ordering, left → right, top → bottom) mostly uses the information of the image background as context (blue squares). If the transformer uses the ordering of the right image (raster ordering, right → left, bottom → top), it will have a richer context, with more information about the brain, that could help make a more accurate prediction about the value in the red square.

**Fig. 5**
Residual maps on the synthetic examples from the variational autoencoder and different steps of our approach.

**Fig. 6**
Different orderings used to transform the 2D latent representation into a 1D sequence.

**Fig. 7**
Performance with synthetic anomalies with different intensity values. We also performed the analysis including an additive Gaussian noise into the anomalies. The performance is measure by the best achievable DICE-score.

**Fig. 8**
Log-likelihood distribution of the classes of examples evaluated by our ensemble of models, in-distribution, near out-of-distribution (near OOD), and far out-of-distribution (far OOD). The model assigned higher log-likelihoods for examples similar to the training set, intermediary values for examples with small synthetic lesions and lower values for examples of different classes.

**Fig. 9**
Residual maps on the real lesions from the variational autoencoder, the f-AnoGAN, and our transformer-based method.

**Fig. 10**
Anomaly detection image-wise on 3D data. In this experiment, we use the log-likelihood obtained from the transformers and the lesion size from the binary mask predicted by our models to train a one-class SVM and classify subjects with multiple sclerosis diagnosis in their records as out of distribution.

See this image and copyright information in PMC

Cited by

Evaluating the use of synthetic T1-w images in new T2 lesion detection in multiple sclerosis.
Valencia L, Clèrigues A, Valverde S, Salem M, Oliver A, Rovira À, Lladó X. Valencia L, et al. Front Neurosci. 2022 Sep 29;16:954662. doi: 10.3389/fnins.2022.954662. eCollection 2022. Front Neurosci. 2022. PMID: 36248650 Free PMC article.
Geometry-invariant abnormality detection.
Patel A, Tudosiu PD, Pinaya WHL, Adeleke O, Cook G, Goh V, Ourselin S, Cardoso MJ. Patel A, et al. Med Image Comput Comput Assist Interv. 2023 Jan 10;2023:300-309. doi: 10.1007/978-3-031-43907-0_29. Med Image Comput Comput Assist Interv. 2023. PMID: 39206415 Free PMC article.
Cross Attention Transformers for Multi-modal Unsupervised Whole-Body PET Anomaly Detection.
Patel A, Tudosiu PD, Pinaya WHL, Cook G, Goh V, Ourselin S, Cardoso MJ. Patel A, et al. Deep Gener Model (2022). 2022;13609:14-23. doi: 10.1007/978-3-031-18576-2_2. Epub 2022 Oct 8. Deep Gener Model (2022). 2022. PMID: 39404690 Free PMC article.
Self-Supervised Anomaly Detection from Anomalous Training Data via Iterative Latent Token Masking.
Patel A, Tudosiu PD, Pinaya WHL, Graham MS, Adeleke O, Cook G, Goh V, Ourselin S, Cardoso MJ. Patel A, et al. IEEE Int Conf Comput Vis Workshops. 2023 Dec 25;2023:2394-2402. doi: 10.1109/ICCVW60793.2023.00254. IEEE Int Conf Comput Vis Workshops. 2023. PMID: 39205863 Free PMC article.
Semi-supervised Label Generation for 3D Multi-modal MRI Bone Tumor Segmentation.
Curto-Vilalta A, Schlossmacher B, Valle C, Gersing A, Neumann J, von Eisenhart-Rothe R, Rueckert D, Hinterwimmer F. Curto-Vilalta A, et al. J Imaging Inform Med. 2025 Feb 20. doi: 10.1007/s10278-025-01448-z. Online ahead of print. J Imaging Inform Med. 2025. PMID: 39979760

See all "Cited by" articles

References

1. Alfaro-Almagro F., Jenkinson M., Bangerter N.K., Andersson J.L.R., Griffanti L., Douaud G., Sotiropoulos S.N., Jbabdi S., Hernandez-Fernandez M., Vallee E. Image processing and quality control for the first 10,000 brain imaging datasets from UK Biobank. Neuroimage. 2018;166:400–424. - PMC - PubMed
1. Avants B.B., Tustison N.J., Song G., Cook P.A., Klein A., Gee J.C. A reproducible evaluation of ANTs similarity metric performance in brain image registration. Neuroimage. 2011;54:2033–2044. - PMC - PubMed
1. Bakas S., Akbari H., Sotiras A., Bilello M., Rozycki M., Kirby J.S., Freymann J.B., Farahani K., Davatzikos C. Advancing the cancer genome atlas glioma MRI collections with expert segmentation labels and radiomic features. Sci. Data. 2017;4 - PMC - PubMed
1. Bakas, S., Reyes, M., Jakab, A., Bauer, S., Rempfler, M., Crimi, A., Shinohara, R.T., Berger, C., Ha, S.M., Rozycki, M., 2018. Identifying the best machine learning algorithms for brain tumor segmentation, progression assessment, and overall survival prediction in the BRATS challenge. arXiv Prepr. arXiv:1811.02629.
1. Baur, C., Denner, S., Wiestler, B., Albarqouni, S., Navab, N., 2020a. Autoencoders for Unsupervised Anomaly Segmentation in Brain MR Images: A Comparative Study. arXiv Prepr. arXiv:2004.03271. - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information

[1] Alfaro-Almagro F., Jenkinson M., Bangerter N.K., Andersson J.L.R., Griffanti L., Douaud G., Sotiropoulos S.N., Jbabdi S., Hernandez-Fernandez M., Vallee E. Image processing and quality control for the first 10,000 brain imaging datasets from UK Biobank. Neuroimage. 2018;166:400–424. - PMC - PubMed

[2] Alfaro-Almagro F., Jenkinson M., Bangerter N.K., Andersson J.L.R., Griffanti L., Douaud G., Sotiropoulos S.N., Jbabdi S., Hernandez-Fernandez M., Vallee E. Image processing and quality control for the first 10,000 brain imaging datasets from UK Biobank. Neuroimage. 2018;166:400–424. - PMC - PubMed

[3] Avants B.B., Tustison N.J., Song G., Cook P.A., Klein A., Gee J.C. A reproducible evaluation of ANTs similarity metric performance in brain image registration. Neuroimage. 2011;54:2033–2044. - PMC - PubMed

[4] Avants B.B., Tustison N.J., Song G., Cook P.A., Klein A., Gee J.C. A reproducible evaluation of ANTs similarity metric performance in brain image registration. Neuroimage. 2011;54:2033–2044. - PMC - PubMed

[5] Bakas S., Akbari H., Sotiras A., Bilello M., Rozycki M., Kirby J.S., Freymann J.B., Farahani K., Davatzikos C. Advancing the cancer genome atlas glioma MRI collections with expert segmentation labels and radiomic features. Sci. Data. 2017;4 - PMC - PubMed

[6] Bakas S., Akbari H., Sotiras A., Bilello M., Rozycki M., Kirby J.S., Freymann J.B., Farahani K., Davatzikos C. Advancing the cancer genome atlas glioma MRI collections with expert segmentation labels and radiomic features. Sci. Data. 2017;4 - PMC - PubMed

[7] Bakas, S., Reyes, M., Jakab, A., Bauer, S., Rempfler, M., Crimi, A., Shinohara, R.T., Berger, C., Ha, S.M., Rozycki, M., 2018. Identifying the best machine learning algorithms for brain tumor segmentation, progression assessment, and overall survival prediction in the BRATS challenge. arXiv Prepr. arXiv:1811.02629.

[8] Bakas, S., Reyes, M., Jakab, A., Bauer, S., Rempfler, M., Crimi, A., Shinohara, R.T., Berger, C., Ha, S.M., Rozycki, M., 2018. Identifying the best machine learning algorithms for brain tumor segmentation, progression assessment, and overall survival prediction in the BRATS challenge. arXiv Prepr. arXiv:1811.02629.

[9] Baur, C., Denner, S., Wiestler, B., Albarqouni, S., Navab, N., 2020a. Autoencoders for Unsupervised Anomaly Segmentation in Brain MR Images: A Comparative Study. arXiv Prepr. arXiv:2004.03271. - PubMed

[10] Baur, C., Denner, S., Wiestler, B., Albarqouni, S., Navab, N., 2020a. Autoencoders for Unsupervised Anomaly Segmentation in Brain MR Images: A Comparative Study. arXiv Prepr. arXiv:2004.03271. - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Unsupervised brain imaging 3D anomaly detection and segmentation with transformers

Affiliations

Unsupervised brain imaging 3D anomaly detection and segmentation with transformers

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Medical

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Related information

Grants and funding

LinkOut - more resources

Full Text Sources

Medical