Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Mar 11;19(3):e0012924.
doi: 10.1371/journal.pntd.0012924. eCollection 2025 Mar.

Unsupervised machine learning identifies biomarkers of disease progression in post-kala-azar dermal leishmaniasis in Sudan

Affiliations

Unsupervised machine learning identifies biomarkers of disease progression in post-kala-azar dermal leishmaniasis in Sudan

Ana Torres et al. PLoS Negl Trop Dis. .

Abstract

Background: Post-kala-azar dermal leishmaniasis (PKDL) appears as a rash in some individuals who have recovered from visceral leishmaniasis caused by Leishmania donovani. Today, basic knowledge of this neglected disease and how to predict its progression remain largely unknown.

Methods and findings: This study addresses the use of several biochemical, haematological and immunological variables, independently or through unsupervised machine learning (ML), to predict PKDL progression risk. In 110 patients from Sudan, 31 such factors were assessed in relation to PKDL disease state at the time of diagnosis: progressive (worsening) versus stable. To identify key factors associated with PKDL worsening, we used both a conventional statistical approach and multivariate analysis through unsupervised ML. The independent use of these variables had limited power to predict skin lesion severity in a baseline examination. In contrast, the unsupervised ML approach identified a set of 10 non-redundant variables that was linked to a 3.1 times higher risk of developing progressive PKDL. Three of these clustering factors (low albumin level, low haematocrit and low IFN-γ production in PBMCs after Leishmania antigen stimulation) were remarkable in patients with progressive disease. Dimensionality re-establishment identified 11 further significantly modified factors that are also important to understand the worsening phenotype. Our results indicate that the combination of anaemia and a weak Th1 immunological response is likely the main physiological mechanism that leads to progressive PKDL.

Conclusions: A combination of 14 biochemical variables identified by unsupervised ML was able to detect a worsening PKDL state in Sudanese patients. This approach could prove instrumental to train future supervised algorithms based on larger patient cohorts both for a more precise diagnosis and to gain insight into fundamental aspects of this complication of visceral leishmaniasis.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Physiological data framework.
Patient metadata and feature scheme. Created in BioRender. Torres, A. (2025) https://BioRender.com/j75e448.
Fig 2
Fig 2. Heatmap indicating r values for all-against-all properties > 80% non-zero numbers.
The chart was built with the heatmap function of the seaborn Python library.
Fig 3
Fig 3. Dimensionality reduction by PCA.
(A) Dimensionality reduction using robust normalized data. Cumulative variance by PCA. (B) Bidimensional scatter plot showing patient data for “stable” (blue) and “worsening” (red) PKDL.
Fig 4
Fig 4. Clustering charts.
(A) K-means clustering inertia for k values from 2 to 20. (B) Significance of clinical phenotype enrichment in patients with stable (blue) or worsening (red) PKDL disease for clusters with k values from 2 to 20. Sphere diameter is proportional to cluster size (number of patients). Dashed grey lines indicate significant (p < 0.05) and highly significant (p < 0.01) values for enrichment in either stable or worsening patients. (C) Boxplot panel showing value distributions of features, both PCA-selected and non-selected, significantly differing in cluster cl5-k8 compared to the remaining subjects. Boxes represent the interquartile range; the line is the mean. Whiskers indicate up to 1.5 fold the interquartile range. Outliers are shown in circles. Significance: *p < 0.05, **p < 0.01, ***p < 0.001.
Fig 5
Fig 5. Spatial representation of three of fourteen features found to define the worsening PKDL phenotype.
Red dots: worsening patients in cluster cl5-k8; Orange dots: stable patients in cluster cl5-k8; Black dots: worsening patients outside cluster cl5-k8; Grey dots: stable patients outside cluster cl5-k8.

References

    1. Zijlstra EE, el-Hassan AM. Leishmaniasis in Sudan. Post kala-azar dermal leishmaniasis. Trans R Soc Trop Med Hyg. 2001;95(Suppl 1):S59–76. doi: 10.1016/s0035-9203(01)90219-6 - DOI - PubMed
    1. Ganguly S, Das N, Barbhuiya J, Chatterjee M. Post-kala-azar dermal leishmaniasis--an overview. International Journal of Dermatology. 2010;49(8):921–31. - PubMed
    1. Zijlstra EE, Musa AM, Khalil EAG, el-Hassan IM, el-Hassan AM. Post-kala-azar dermal leishmaniasis. Lancet Infect Dis. 2003;3(2):87–98. doi: 10.1016/s1473-3099(03)00517-6 - DOI - PubMed
    1. Zijlstra EE. The immunology of post-kala-azar dermal leishmaniasis (PKDL). Parasit Vectors. 2016;9(1):464. doi: 10.1186/s13071-016-1721-0 - DOI - PMC - PubMed
    1. Molina R, Ghosh D, Carrillo E, Monnerat S, Bern C, Mondal D, et al.. Infectivity of Post-Kala-azar Dermal Leishmaniasis Patients to Sand Flies: Revisiting a Proof of Concept in the Context of the Kala-azar Elimination Program in the Indian Subcontinent. Clin Infect Dis. 2017;65(1):150–3. doi: 10.1093/cid/cix245 - DOI - PMC - PubMed

LinkOut - more resources