Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Feb 25;11(1):4541.
doi: 10.1038/s41598-021-83660-9.

Recursive ensemble feature selection provides a robust mRNA expression signature for myalgic encephalomyelitis/chronic fatigue syndrome

Affiliations

Recursive ensemble feature selection provides a robust mRNA expression signature for myalgic encephalomyelitis/chronic fatigue syndrome

Paula I Metselaar et al. Sci Rep. .

Abstract

Myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS) is a chronic disorder characterized by disabling fatigue. Several studies have sought to identify diagnostic biomarkers, with varying results. Here, we innovate this process by combining both mRNA expression and DNA methylation data. We performed recursive ensemble feature selection (REFS) on publicly available mRNA expression data in peripheral blood mononuclear cells (PBMCs) of 93 ME/CFS patients and 25 healthy controls, and found a signature of 23 genes capable of distinguishing cases and controls. REFS highly outperformed other methods, with an AUC of 0.92. We validated the results on a different platform (AUC of 0.95) and in DNA methylation data obtained from four public studies on ME/CFS (99 patients and 50 controls), identifying 48 gene-associated CpGs that predicted disease status as well (AUC of 0.97). Finally, ten of the 23 genes could be interpreted in the context of the derailed immune system of ME/CFS.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
Results of the REFS algorithm run ten times on the mRNA expression data of 118 samples from the CAMDA dataset. (a) The optimal number of predictor genes to distinguish 93 cases from 25 controls was 23 (red vertical line). (b) mRNA expression levels of the 23 predictor genes for 93 cases (light blue) and 25 controls (dark blue) in box-and-whisker plots. Outliers were omitted for visualization purposes.
Figure 2
Figure 2
ROC curves for (a) REFS, (b) IWGCNA, and (c) univariate analysis applied to the same mRNA expression data of 118 samples from the CAMDA dataset. (d) ROC curve for the validation of the obtained 23-gene signature on a separate dataset (GSE14577). The 5-fold cross validation was performed with the eighteen genes available in GSE14577.
Figure 3
Figure 3
Results of the REFS algorithm run ten times on the merged DNA methylation datasets restricted to 278 probes associated with the 23 candidate genes. (a) The optimal number of predictor CpGs to distinguish 99 cases from 50 controls was 48 (red vertical line). (b) ROC curve of the 48 predictor CpGs. c) DNA methylation levels (normalized using Standard scaler) of the 48 predictor CpG sites for 99 cases (light blue) and 50 controls (dark blue).
Figure 4
Figure 4
Visualization of the functions and locations of ten proteins in a hypothetical immune cell setting. All ten mRNA transcripts were downregulated in PBMCs of ME/CFS patients compared to healthy controls. Created with BioRender.com.
Figure 5
Figure 5
Overview of the REFS algorithm using the 10-fold global accuracy > 70% as a stop parameter.
Figure 6
Figure 6
Overview of the pipeline to reduce the number of methylation data probes for REFS by selecting CpGs associated with the candidate genes.

References

    1. Fukuda K, et al. The chronic fatigue syndrome: A comprehensive approach to its definition and study. Ann. Intern. Med. 1994;121:953–959. doi: 10.7326/0003-4819-121-12-199412150-00009. - DOI - PubMed
    1. Johnston S, Brenu EW, Staines D, Marshall-Gradisnik S. The prevalence of chronic fatigue syndrome/myalgic encephalomyelitis: A meta-analysis. Clin. Epidemiol. 2013;5:105. doi: 10.2147/CLEP.S39876. - DOI - PMC - PubMed
    1. Walsh CM, Zainal NZ, Middleton SJ, Paykel ES. A family history study of chronic fatigue syndrome. Psychiatr. Genet. 2001;11:123–128. doi: 10.1097/00041444-200109000-00003. - DOI - PubMed
    1. Schlauch KA, et al. Genome-wide association analysis identifies genetic variations in subjects with myalgic encephalomyelitis/chronic fatigue syndrome. Transl. Psychiatry. 2016;6:e730–e730. doi: 10.1038/tp.2015.208. - DOI - PMC - PubMed
    1. Perez M, et al. Genetic predisposition for immune system, hormone, and metabolic dysfunction in myalgic encephalomyelitis/chronic fatigue syndrome: A pilot study. Front. Pediatr. 2019;7:206. doi: 10.3389/fped.2019.00206. - DOI - PMC - PubMed

Publication types