Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Oct 28;12(1):18208.
doi: 10.1038/s41598-022-22695-y.

Discovering HIV related information by means of association rules and machine learning

Collaborators, Affiliations

Discovering HIV related information by means of association rules and machine learning

Lourdes Araujo et al. Sci Rep. .

Abstract

Acquired immunodeficiency syndrome (AIDS) is still one of the main health problems worldwide. It is therefore essential to keep making progress in improving the prognosis and quality of life of affected patients. One way to advance along this pathway is to uncover connections between other disorders associated with HIV/AIDS-so that they can be anticipated and possibly mitigated. We propose to achieve this by using Association Rules (ARs). They allow us to represent the dependencies between a number of diseases and other specific diseases. However, classical techniques systematically generate every AR meeting some minimal conditions on data frequency, hence generating a vast amount of uninteresting ARs, which need to be filtered out. The lack of manually annotated ARs has favored unsupervised filtering, even though they produce limited results. In this paper, we propose a semi-supervised system, able to identify relevant ARs among HIV-related diseases with a minimal amount of annotated training data. Our system has been able to extract a good number of relationships between HIV-related diseases that have been previously detected in the literature but are scattered and are often little known. Furthermore, a number of plausible new relationships have shown up which deserve further investigation by qualified medical experts.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
Scheme of EXTRAE, the semi-supervised learning model for the filtering of relevant ARs among HIV-related diseases.
Figure 2
Figure 2
Flow diagram of incremental learning in EXTRAE semi-supervised algorithm for filtering relevant ARs. Rounded rectangles show the beginning and the end of the iterations, rectangles are the rule sets, ovals are processes, and the diamond represents a condition.

References

    1. Agrawal, R., Imielinski, T. & Swami, A. N. Mining association rules between sets of items in large databases. In Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data, Washington, DC, USA, May 26-28, 1993., 207–216 (1993).
    1. Imamura T, et al. A technique for identifying three diagnostic findings using association analysis. Med. Biol. Eng. Comput. 2007;45:51–59. doi: 10.1007/s11517-006-0121-6. - DOI - PubMed
    1. Park SH, Jang SY, Kim H, Lee SW. An association rule mining-based framework for understanding lifestyle risk behaviors. PLoS ONE. 2014;9:e88859. doi: 10.1371/journal.pone.0088859. - DOI - PMC - PubMed
    1. Rao PS, Devi TU. Applicability of apriori based association rules on medical data. Int. J. Appl. Eng. Res. 2017;12:9451–9458.
    1. Manolitsis, I. et al. Using association rules in antimicrobial resistance in stone disease patients. Stud. Heal. Technol. Inform. 462–465 (2022). - PubMed

Publication types