Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Jan;90(1069):20160689.
doi: 10.1259/bjr.20160689. Epub 2016 Dec 8.

Big Data in radiation therapy: challenges and opportunities

Affiliations

Big Data in radiation therapy: challenges and opportunities

Tim Lustberg et al. Br J Radiol. 2017 Jan.

Abstract

Data collected and generated by radiation oncology can be classified by the Volume, Variety, Velocity and Veracity (4Vs) of Big Data because they are spread across different care providers and not easily shared owing to patient privacy protection. The magnitude of the 4Vs is substantial in oncology, especially owing to imaging modalities and unclear data definitions. To create useful models ideally all data of all care providers are understood and learned from; however, this presents challenges in the guise of poor data quality, patient privacy concerns, geographical spread, interoperability and large volume. In radiation oncology, there are many efforts to collect data for research and innovation purposes. Clinical trials are the gold standard when proving any hypothesis that directly affects the patient. Collecting data in registries with strict predefined rules is also a common approach to find answers. A third approach is to develop data stores that can be used by modern machine learning techniques to provide new insights or answer hypotheses. We believe all three approaches have their strengths and weaknesses, but they should all strive to create Findable, Accessible, Interoperable, Reusable (FAIR) data. To learn from these data, we need distributed learning techniques, sending machine learning algorithms to FAIR data stores around the world, learning from trial data, registries and routine clinical data rather than trying to centralize all data. To improve and personalize medicine, rapid learning platforms must be able to process FAIR "Big Data" to evaluate current clinical practice and to guide further innovation.

PubMed Disclaimer

References

    1. Zarrouk M. Delivering excellence in patient care with ready access to clinical data. 2012 [Cited 28 July 2016]. Available from: http://www.netapp.com/us/media/wp-7169.pdf
    1. Oberije C, Nalbantov G, Dekker A, Boersma L, Borger J, Reymen B, et al. . A prospective study comparing the predictions of doctors versus models for treatment outcome of lung cancer patients: a step towards individualized care and shared decision making. Radiother Oncol 2014; 112: 37–43. doi: https://doi.org/10.1016/j.radonc.2014.04.012 - DOI - PMC - PubMed
    1. Benedict SH, Hoffman K, Martel MK, Abernethy AP, Asher AL, Capala J, et al. . Overview of the American Society for Radiation Oncology-National Institutes of Health-American Association of Physicists in Medicine Workshop 2015: exploring opportunities for radiation oncology in the era of big data. Int J Radiat Oncol Biol Phys 2016; 95: 873–9. doi: https://doi.org/10.1016/j.ijrobp.2016.03.006 - DOI - PMC - PubMed
    1. Kohn MS, Sun J, Knoop S, Shabo A, Carmeli B, Sow D, et al. . IBM's health analytics and clinical decision support. Yearb Med Inform 2014; 9: 154–62. doi: https://doi.org/10.15265/IY-2014-0002 - DOI - PMC - PubMed
    1. Bilimoria KY, Stewart AK, Winchester DP, Ko CY. The national cancer data base: a powerful initiative to improve cancer care in the United States. Ann Surg Oncol 2008; 15: 683–90. doi: https://doi.org/10.1245/s10434-007-9747-3 - DOI - PMC - PubMed