Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2024 Feb;310(2):e231319.
doi: 10.1148/radiol.231319.

The Image Biomarker Standardization Initiative: Standardized Convolutional Filters for Reproducible Radiomics and Enhanced Clinical Insights

Affiliations
Review

The Image Biomarker Standardization Initiative: Standardized Convolutional Filters for Reproducible Radiomics and Enhanced Clinical Insights

Philip Whybra et al. Radiology. 2024 Feb.

Abstract

Filters are commonly used to enhance specific structures and patterns in images, such as vessels or peritumoral regions, to enable clinical insights beyond the visible image using radiomics. However, their lack of standardization restricts reproducibility and clinical translation of radiomics decision support tools. In this special report, teams of researchers who developed radiomics software participated in a three-phase study (September 2020 to December 2022) to establish a standardized set of filters. The first two phases focused on finding reference filtered images and reference feature values for commonly used convolutional filters: mean, Laplacian of Gaussian, Laws and Gabor kernels, separable and nonseparable wavelets (including decomposed forms), and Riesz transformations. In the first phase, 15 teams used digital phantoms to establish 33 reference filtered images of 36 filter configurations. In phase 2, 11 teams used a chest CT image to derive reference values for 323 of 396 features computed from filtered images using 22 filter and image processing configurations. Reference filtered images and feature values for Riesz transformations were not established. Reproducibility of standardized convolutional filters was validated on a public data set of multimodal imaging (CT, fluorodeoxyglucose PET, and T1-weighted MRI) in 51 patients with soft-tissue sarcoma. At validation, reproducibility of 486 features computed from filtered images using nine configurations × three imaging modalities was assessed using the lower bounds of 95% CIs of intraclass correlation coefficients. Out of 486 features, 458 were found to be reproducible across nine teams with lower bounds of 95% CIs of intraclass correlation coefficients greater than 0.75. In conclusion, eight filter types were standardized with reference filtered images and reference feature values for verifying and calibrating radiomics software packages. A web-based tool is available for compliance checking.

PubMed Disclaimer

Conflict of interest statement

Disclosures of conflicts of interest: P.W. Supported by Engineering and Physical Sciences Research Council grant EP/N509449/1 supported by Cardiff University Innovation for All scheme, that was backed by Research Wales Innovation Funding from HEFCW (Higher Education Funding Council for Wales); royalties from collaboration with Hero Imaging. A.Z. No relevant relationships. V.A. No relevant relationships. R.S. No relevant relationships. A.P.A. Grant support from Memorial Sloan Kettering Cancer Center (P30 CA008748). A.A. No relevant relationships. B.B. No relevant relationships. S.B. No relevant relationships. A.B. No relevant relationships. R.B. No relevant relationships. L.B. No relevant relationships. I.B. Research grants from Dosisoft; GE HealthCare; Siemens Healthcare; leadership role in Society of Nuclear Medicine AI Task Force. G.J.R.C. Consulting fees from Amgen, GE HealthCare, Blue Earth Diagnostics, Serac Healthcare. F.D. No relevant relationships. N.D. No relevant relationships. H.S.G. No relevant relationships. V.G. Member of the Radiology editorial board; grant to institution from Siemens Healthcare; royalties from Oxford University Press; honoraria or payment for lectures from the European School of Radiology; support for meetings and/or travel from the European Society of Radiology; leadership in Royal College of Radiologists. M.G. Grants from Varian, AstraZeneca, ViewRay; participation on a data safety monitoring board or advisory board from AstraZeneca. M. Hatt No relevant relationships. M. Hosseinzadeh No relevant relationships. A.I. No relevant relationships. J.L. No relevant relationships. M.A.L.L. No relevant relationships. S.L. No relevant relationships. F.M. No relevant relationships. O.M. No relevant relationships. C.N. No relevant relationships. F.O. No relevant relationships. S.P. No relevant relationships. A.R. Founder and board member of Ascinta Technologies. S.M.R. No relevant relationships. C.G.R. No relevant relationships. M.R.S. No relevant relationships. A.S. No relevant relationships. I.S. No relevant relationships. E.S. No relevant relationships. S.T.L. Institutional grants from Varian (Siemens Healthcare), Viewray; consulting fees from Varian (Siemens Healthcare), Viewray; board member, SASRO. F.T. No relevant relationships. T.U. No relevant relationships. V.V. No relevant relationships. J.J.M.v.G. No relevant relationships. F.Y. No relevant relationships. H.Z. No relevant relationships. H.M. Grant from Roche. M.V. No relevant relationships. A.D. No relevant relationships.

Figures

None
Graphical abstract
Overview of convolutional filters. An image is filtered using convolution
to create a filtered image (top). Each image consists of values. At convolution,
a filter with three weights (1.0, −2.0, 1.0) is slid across the image,
and adjacent image values are multiplied with the corresponding filter values
and summed to create a response value for each position in the image.
Convolutional filtering is positioned after resampling in the overall radiomics
image processing scheme (center). This workflow starts with an image obtained
from a repository or archiving system in a digital format. The image is
optionally converted (eg, from PET activity to standardized uptake values) and
undergoes postprocessing (eg, MRI bias-field correction). Segmentation masks are
either loaded in a digital format or automatically created. Both image and
segmentation masks are optionally resampled. Filtered images are created by
filtering the image. Filtered images and segmentation masks are then used to
compute radiomic features. This study attempts to standardize several types of
convolution filters (bottom). The original CT image is shown for
reference.
Figure 1:
Overview of convolutional filters. An image is filtered using convolution to create a filtered image (top). Each image consists of values. At convolution, a filter with three weights (1.0, −2.0, 1.0) is slid across the image, and adjacent image values are multiplied with the corresponding filter values and summed to create a response value for each position in the image. Convolutional filtering is positioned after resampling in the overall radiomics image processing scheme (center). This workflow starts with an image obtained from a repository or archiving system in a digital format. The image is optionally converted (eg, from PET activity to standardized uptake values) and undergoes postprocessing (eg, MRI bias-field correction). Segmentation masks are either loaded in a digital format or automatically created. Both image and segmentation masks are optionally resampled. Filtered images are created by filtering the image. Filtered images and segmentation masks are then used to compute radiomic features. This study attempts to standardize several types of convolution filters (bottom). The original CT image is shown for reference.
Three filters are used to quantify different characteristics of the
peritumoral region in a chest CT, with an out-of-plane tumor. For each filter,
mean and maximum intensity are computed within the segmentation masks in three
filtered images. The standardized filtered image was created by applying a
standardized filter to the original image. The other two filtered images
resulted from filter implementations that were not standardized. The Laplacian
of Gaussian filter is used to quantify the presence of edges and highlight fine
details. The scale of the filter is 2.0 mm, and it is truncated at 8.0 mm. The
nonstandardized filters use 2.0 voxels and truncate at one filter scale (2.0
mm). Separable wavelets are designed to quantify image contents for different
frequency bands, though in many radiomics analyses they are used to quantify
edges. A pair of low-pass and high-pass wavelet kernels is used to filter the
image, highlighting edges in the lateral direction. The nonstandardized filters
either use an incorrect orientation (ie, low-pass and high-pass kernels were
swapped) or are faulty because the first kernel is used for all directions (ie,
a pair of low-pass-low-pass wavelet kernels). Gabor filters are used to quantify
directional structures (eg, fibrosis and bronchi). The standardized filter used
scale and wavelength parameters of 2.0 mm and was oriented under 30°. The
nonstandardized filters use an incorrect orientation or express parameters in
2.0 voxels.
Figure 2:
Three filters are used to quantify different characteristics of the peritumoral region in a chest CT, with an out-of-plane tumor. For each filter, mean and maximum intensity are computed within the segmentation masks in three filtered images. The standardized filtered image was created by applying a standardized filter to the original image. The other two filtered images resulted from filter implementations that were not standardized. The Laplacian of Gaussian filter is used to quantify the presence of edges and highlight fine details. The scale of the filter is 2.0 mm, and it is truncated at 8.0 mm. The nonstandardized filters use 2.0 voxels and truncate at one filter scale (2.0 mm). Separable wavelets are designed to quantify image contents for different frequency bands, though in many radiomics analyses they are used to quantify edges. A pair of low-pass and high-pass wavelet kernels is used to filter the image, highlighting edges in the lateral direction. The nonstandardized filters either use an incorrect orientation (ie, low-pass and high-pass kernels were swapped) or are faulty because the first kernel is used for all directions (ie, a pair of low-pass-low-pass wavelet kernels). Gabor filters are used to quantify directional structures (eg, fibrosis and bronchi). The standardized filter used scale and wavelength parameters of 2.0 mm and was oriented under 30°. The nonstandardized filters use an incorrect orientation or express parameters in 2.0 voxels.
Study overview. Several figure elements adapted, under a CC BY 4.0
license, from reference 10.
Figure 3:
Study overview. Several figure elements adapted, under a CC BY 4.0 license, from reference .
Results overview. In phase 1, participating teams computed 36 filtered
images of convolutional filters according to predefined configurations.
These filtered images were compared, and consensus was measured. Teams
updated their implementations iteratively, which led to an improvement of
consensus over time (arbitrary [arb.] unit; 27 months). Consensus strength
was based on matching the voxel-wise difference between filtered images and
the tentative reference filtered image within a tolerance. The number of
participating teams at each point is shown. In phase 2, participating teams
computed 396 features from filtered images of convolutional filters
according to predefined filter and image processing configurations. As in
phase 1, teams updated their implementations iteratively. Unlike phase 1,
improvement in consensus was mostly because of more teams enrolling over
time (arbitrary unit; 15 months). Consensus strength was based on the number
of teams matching the tentative reference feature value within a tolerance
and was assigned according to the same categories as in phase 1. In phase 3,
reproducibility of features computed from filtered images was validated.
Teams computed 486 features from a public data set of 51 patients with
soft-tissue sarcoma that were scanned using CT, fluorine 18
fluorodeoxyglucose (FDG) PET, and T1-weighted (T1w) MRI. Reproducibility was
assessed using the lower bound of the 95% CI of the intraclass correlation
coefficient: poor, lower bound less than 0.50; moderate, between 0.50 and
0.75; good, between 0.75 and 0.90; excellent, greater than 0.90; and
unknown, computed by fewer than two teams.
Figure 4:
Results overview. In phase 1, participating teams computed 36 filtered images of convolutional filters according to predefined configurations. These filtered images were compared, and consensus was measured. Teams updated their implementations iteratively, which led to an improvement of consensus over time (arbitrary [arb.] unit; 27 months). Consensus strength was based on matching the voxel-wise difference between filtered images and the tentative reference filtered image within a tolerance. The number of participating teams at each point is shown. In phase 2, participating teams computed 396 features from filtered images of convolutional filters according to predefined filter and image processing configurations. As in phase 1, teams updated their implementations iteratively. Unlike phase 1, improvement in consensus was mostly because of more teams enrolling over time (arbitrary unit; 15 months). Consensus strength was based on the number of teams matching the tentative reference feature value within a tolerance and was assigned according to the same categories as in phase 1. In phase 3, reproducibility of features computed from filtered images was validated. Teams computed 486 features from a public data set of 51 patients with soft-tissue sarcoma that were scanned using CT, fluorine 18 fluorodeoxyglucose (FDG) PET, and T1-weighted (T1w) MRI. Reproducibility was assessed using the lower bound of the 95% CI of the intraclass correlation coefficient: poor, lower bound less than 0.50; moderate, between 0.50 and 0.75; good, between 0.75 and 0.90; excellent, greater than 0.90; and unknown, computed by fewer than two teams.

Similar articles

Cited by

References

    1. Gillies RJ , Kinahan PE , Hricak H . Radiomics: Images Are More than Pictures, They Are Data . Radiology 2016. ; 278 ( 2 ): 563 – 577 . - PMC - PubMed
    1. Tomaszewski MR , Gillies RJ . The Biological Meaning of Radiomic Features . Radiology 2021. ; 298 ( 3 ): 505 – 516 . - PMC - PubMed
    1. Huang EP , O’Connor JPB , McShane LM , et al. . Criteria for the translation of radiomics into clinically useful tests . Nat Rev Clin Oncol 2022. ; 20 ( 2 ): 69 – 82 . - PMC - PubMed
    1. Zwanenburg A , Vallières M , Abdalah MA , et al. . The Image Biomarker Standardization Initiative: Standardized Quantitative Radiomics for High-Throughput Image-based Phenotyping . Radiology 2020. ; 295 ( 2 ): 328 – 338 . - PMC - PubMed
    1. Fornacon-Wood I , Mistry H , Ackermann CJ , et al. . Reliability and prognostic value of radiomic features are highly dependent on choice of feature extraction platform . Eur Radiol 2020. ; 30 ( 11 ): 6241 – 6250 . - PMC - PubMed