Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2019 Jun 28;25(24):2990-3008.
doi: 10.3748/wjg.v25.i24.2990.

Application of Big Data analysis in gastrointestinal research

Affiliations
Review

Application of Big Data analysis in gastrointestinal research

Ka-Shing Cheung et al. World J Gastroenterol. .

Abstract

Big Data, which are characterized by certain unique traits like volume, velocity and value, have revolutionized the research of multiple fields including medicine. Big Data in health care are defined as large datasets that are collected routinely or automatically, and stored electronically. With the rapidly expanding volume of health data collection, it is envisioned that the Big Data approach can improve not only individual health, but also the performance of health care systems. The application of Big Data analysis in the field of gastroenterology and hepatology research has also opened new research approaches. While it retains most of the advantages and avoids some of the disadvantages of traditional observational studies (case-control and prospective cohort studies), it allows for phenomapping of disease heterogeneity, enhancement of drug safety, as well as development of precision medicine, prediction models and personalized treatment. Unlike randomized controlled trials, it reflects the real-world situation and studies patients who are often under-represented in randomized controlled trials. However, residual and/or unmeasured confounding remains a major concern, which requires meticulous study design and various statistical adjustment methods. Other potential drawbacks include data validity, missing data, incomplete data capture due to the unavailability of diagnosis codes for certain clinical situations, and individual privacy. With continuous technological advances, some of the current limitations with Big Data may be further minimized. This review will illustrate the use of Big Data research on gastrointestinal and liver diseases using recently published examples.

Keywords: Colorectal cancer; Epidemiology; Gastric cancer; Gastrointestinal bleeding; Healthcare dataset; Hepatocellular carcinoma; Inflammatory bowel disease.

PubMed Disclaimer

Conflict of interest statement

Conflict-of-interest statement: WKL has received an honorarium for attending advisory board meetings of Boehringer Ingelheim and Takeda. WKS received honorarium for attending advisory board meetings of AbbVie, Celltrion and Gilead; speaker fees from AbbVie, Astrazeneca, Eisai, Gilead and Ipsen; and research funding from Gilead.

Similar articles

Cited by

References

    1. Lohr S. The Origins of 'Big Data': An Etymological Detective Story [cited 25 January 2019]. Available from: https://bits.blogs.nytimes.com/2013/02/01/the-origins-of-big-data-an-ety....
    1. Kitchin R, McArdle G. What makes Big Data, Big Data? Exploring the ontological characteristics of 26 datasets. Big Data Soc. 2016:1–103.
    1. Manyika J, Chui M, Brown B, Bughin J, Dobbs R, Roxburgh C, Byers AH. Big Data: The Next Frontier for Innovation, Competition, and Productivity [cited 25 January 2019). Available from: https://bigdatawg.nist.gov/pdf/MG_big_data_full_report.pdf.
    1. Nickerson DW, Rogers T. Political campaigns and big data. J Econom Perspect. 2014;28:51–74.
    1. Laney D. 3D data management: controlling data volume, velocity and variety [cited 25 January 2019]. Available from: https://blogs.gartner.com/doug-laney/files/2012/01/ad949-3D-Data-Managem....