Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 May 2;84(9):1384-1387.
doi: 10.1158/0008-5472.CAN-23-2655.

NCI Cancer Research Data Commons: Core Standards and Services

Affiliations

NCI Cancer Research Data Commons: Core Standards and Services

Arthur Brady et al. Cancer Res. .

Abstract

The NCI Cancer Research Data Commons (CRDC) is a collection of data commons, analysis platforms, and tools that make existing cancer data more findable and accessible by the cancer research community. In practice, the two biggest hurdles to finding and using data for discovery are the wide variety of models and ontologies used to describe data, and the dispersed storage of that data. Here, we outline core CRDC services to aggregate descriptive information from multiple studies for findability via a single interface and to provide a single access method that spans multiple data commons. See related articles by Wang et al., p. 1388, Pot et al., p. 1396, and Kim et al., p. 1404.

PubMed Disclaimer

Figures

Figure 1. NCI CRDC Core Standards and Services. Researchers submitting research results to CRDC are encouraged to use terminologies and ontologies provided by DSS. Each data commons is routinely indexed by both CDA and DCF. Researchers looking for data will query against the database of aggregated indices built by CDA, using the cda-python tool. Query results include a unique persistent identifier for each file, provided by DCF, which also manages authentication and authorization for controlled data.
Figure 1.
NCI CRDC Core Standards and Services. Researchers submitting research results to CRDC are encouraged to use terminologies and ontologies provided by DSS. Each data commons is routinely indexed by both CDA and DCF. Researchers looking for data will query against the database of aggregated indices built by CDA, using the cda-python tool. Query results include a unique persistent identifier for each file, provided by DCF, which also manages authentication and authorization for controlled data.

References

    1. Grossman RL. Ten lessons for data sharing with a data commons. Sci Data 2023;10:120. - PMC - PubMed
    1. Charbonneau AL, Brady A, Czajkowski K, Aluvathingal J, Canchi S, Carter R, et al. . Making common fund data more findable: catalyzing a data ecosystem. Gigascience 2022;11:giac105. - PMC - PubMed
    1. Harrow J, Drysdale R, Smith A, Repo S, Lanfear J, Blomberg N. ELIXIR: providing a sustainable infrastructure for life science data at European scale. Bioinformatics 2021;37:2506–11. - PMC - PubMed
    1. Budroni P, Claude-Burgelman J, Schouppe M. Architectures of knowledge: the European open science cloud. ABI-Tech 2019;39:130–41.
    1. Barnes C, Bajracharya B, Cannalte M, Gowani Z, Haley W, Kass-Hout T, et al. . The biomedical research hub: a federated platform for patient research data. J Am Med Inform Assoc 2022;29:619–25. - PMC - PubMed

Publication types

LinkOut - more resources