Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Apr 13;10(4):e0122508.
doi: 10.1371/journal.pone.0122508. eCollection 2015.

Mining disease risk patterns from nationwide clinical databases for the assessment of early rheumatoid arthritis risk

Affiliations

Mining disease risk patterns from nationwide clinical databases for the assessment of early rheumatoid arthritis risk

Chu Yu Chin et al. PLoS One. .

Abstract

Rheumatoid arthritis (RA) is a chronic autoimmune rheumatic disease that can cause painful swelling in the joint lining, morning stiffness, and joint deformation/destruction. These symptoms decrease both quality of life and life expectancy. However, if RA can be diagnosed in the early stages, it can be controlled with pharmacotherapy. Although many studies have examined the possibility of early assessment and diagnosis, few have considered the relationship between significant risk factors and the early assessment of RA. In this paper, we present a novel framework for early RA assessment that utilizes data preprocessing, risk pattern mining, validation, and analysis. Under our proposed framework, two risk patterns can be discovered. Type I refers to well-known risk patterns that have been identified by existing studies, whereas Type II denotes unknown relationship risk patterns that have rarely or never been reported in the literature. These Type II patterns are very valuable in supporting novel hypotheses in clinical trials of RA, and constitute the main contribution of this work. To ensure the robustness of our experimental evaluation, we use a nationwide clinical database containing information on 1,314 RA-diagnosed patients over a 12-year follow-up period (1997-2008) and 965,279 non-RA patients. Our proposed framework is employed on this large-scale population-based dataset, and is shown to effectively discover rich RA risk patterns. These patterns may assist physicians in patient assessment, and enhance opportunities for early detection of RA. The proposed framework is broadly applicable to the mining of risk patterns for major disease assessments. This enables the identification of early risk patterns that are significantly associated with a target disease.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors would like to state that there are no conflicts of interest to declare, although one of the authors is affiliated with a commercial company [Telecommunication Laboratories, Chunghwa Telecom Co., Ltd.], this does not alter the authors' adherence to PLOS ONE policies on sharing data and materials.

Figures

Fig 1
Fig 1. Workflow of data mining process for RA disease early detection.
Fig 2
Fig 2. Timeline for data collection of RA group.
Fig 3
Fig 3. Risk Pattern Viewer.
Fig 4
Fig 4. Age distribution of cohort.
Fig 5
Fig 5. Trend of RA assessment effects under different support thresholds.
Fig 6
Fig 6. Distribution of RA patients under different numbers of diagnosis records.
Fig 7
Fig 7. Trend of RA assessment effects under different diagnosis record numbers.
Fig 8
Fig 8. Trend in literature for single disease risk patterns in PubMed.
Fig 9
Fig 9. Trend in literature for PubMed pattern related mining.

Similar articles

Cited by

References

    1. Gibofsky A. Overview of epidemiology, pathophysiology, and diagnosis of rheumatoid arthritis. The American Journal of Managed Care. 2012. Dec;18(13 Suppl):S295–302. - PubMed
    1. Kuo CF, Luo SF, See LC, Chou IJ, Chang HC, Yu KH. Rheumatoid arthritis prevalence, incidence, and mortality rates: a nationwide population study in Taiwan. Rheumatol Int. 2013. Feb;33(2):355–60. 10.1007/s00296-012-2411-7 - DOI - PubMed
    1. Shang G, Richardson A, Gahan ME, Easteal S, Ohms S, Lidbury BA. Predicting the presence of hepatitis B virus surface antigen in Chinese patients by pathology data mining. Journal of Medical Virology. 2013. Aug;85(8):1334–9. 10.1002/jmv.23609 - DOI - PubMed
    1. Mathias JS, Agrawal A, Feinglass J, Cooper AJ, Baker DW, Choudhary A. Development of a 5 year life expectancy index in older adults using predictive mining of electronic health record data. Journal of the American Medical Informatics Association. 2013. Jun;20(e1):e118–e24. 10.1136/amiajnl-2012-001360 - DOI - PMC - PubMed
    1. Ng T, Chew L, Yap CW. A Clinical Decision Support Tool To Predict Survival in Cancer Patients beyond 120 Days after Palliative Chemotherapy. Journal of Palliative Medicine. 2012. Aug;15(8):863–9. 10.1089/jpm.2011.0417 - DOI - PMC - PubMed

Publication types

Substances