Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Aug 15;81(16):4188-4193.
doi: 10.1158/0008-5472.CAN-21-0950. Epub 2021 Jun 15.

NCI Imaging Data Commons

Affiliations

NCI Imaging Data Commons

Andrey Fedorov et al. Cancer Res. .

Abstract

The National Cancer Institute (NCI) Cancer Research Data Commons (CRDC) aims to establish a national cloud-based data science infrastructure. Imaging Data Commons (IDC) is a new component of CRDC supported by the Cancer Moonshot. The goal of IDC is to enable a broad spectrum of cancer researchers, with and without imaging expertise, to easily access and explore the value of deidentified imaging data and to support integrated analyses with nonimaging data. We achieve this goal by colocating versatile imaging collections with cloud-based computing resources and data exploration, visualization, and analysis tools. The IDC pilot was released in October 2020 and is being continuously populated with radiology and histopathology collections. IDC provides access to curated imaging collections, accompanied by documentation, a user forum, and a growing number of analysis use cases that aim to demonstrate the value of a data commons framework applied to cancer imaging research. SIGNIFICANCE: This study introduces NCI Imaging Data Commons, a new repository of the NCI Cancer Research Data Commons, which will support cancer imaging research on the cloud.

PubMed Disclaimer

Figures

Figure 1. High-level diagram of relevant components of the Imaging Data Commons and related entities, and their relation to the steps of the envisioned CRDC user flow with the emphasis on imaging applications. Green boxes correspond to the envisioned user flow. IDC Extract Transform Load (ETL) process maintains the content of the data collected by external entities (e.g., TCIA) colocated with the various cloud-based tools, such as those maintained by Cloud Resources or by the Google Cloud Platform. The data can be accessed using both the interactive components (e.g., IDC Portal and Viewer) and programmatic APIs.
Figure 1.
High-level diagram of relevant components of the Imaging Data Commons and related entities, and their relation to the steps of the envisioned CRDC user flow with the emphasis on imaging applications. Green boxes correspond to the envisioned user flow. IDC Extract Transform Load (ETL) process maintains the content of the data collected by external entities (e.g., TCIA) colocated with the various cloud-based tools, such as those maintained by Cloud Resources or by the Google Cloud Platform. The data can be accessed using both the interactive components (e.g., IDC Portal and Viewer) and programmatic APIs.
Figure 2. Elements of IDC Portal user interface. Left, front page of the IDC Portal for the pilot (preproduction) release of the platform, available at https://imaging.datacommons.cancer.gov. Right, example of filters available for defining cohort based on the attributes describing segmentation results available in IDC.
Figure 2.
Elements of IDC Portal user interface. Left, front page of the IDC Portal for the pilot (preproduction) release of the platform, available at https://imaging.datacommons.cancer.gov. Right, example of filters available for defining cohort based on the attributes describing segmentation results available in IDC.

References

    1. Jaffee EM, Dang CV, Agus DB, Alexander BM, Anderson KC, Ashworth A, et al. Future cancer research priorities in the USA: a lancet oncology commission. Lancet Oncol 2017;18:e653–706. - PMC - PubMed
    1. Grossman RL, Heath A, Murphy M, Patterson M, Wells W. A case for data commons: toward data science as a service. Comput Sci Eng 2016;18:10–20. - PMC - PubMed
    1. Hinkson IV, Davidsen TM, Klemm JD, Kerlavage AR, Kibbe WA. A comprehensive infrastructure for big data in cancer research: accelerating cancer research and precision medicine. Front Cell Dev Biol 2017;5:83. - PMC - PubMed
    1. Jensen MA, Ferretti V, Grossman RL, Staudt LM. The NCI genomic data commons as an engine for precision medicine. Blood 2017;130:453–9. - PMC - PubMed
    1. Reynolds SM, Miller M, Lee P, Leinonen K, Paquette SM, Rodebaugh Z, et al. The ISB cancer genomics cloud: a flexible cloud-based platform for cancer genomics research. Cancer Res 2017;77:e7–10. - PMC - PubMed

Publication types

MeSH terms