TY - JOUR
T1 - Recursive ensemble feature selection provides a robust mRNA expression signature for myalgic encephalomyelitis/chronic fatigue syndrome
AU - Metselaar, Paula I
AU - Mendoza-Maldonado, Lucero
AU - Li Yim, Andrew Yung Fong
AU - Abarkan, Ilias
AU - Henneman, Peter
AU - Te Velde, Anje A
AU - Schönhuth, Alexander
AU - Bosch, Jos A
AU - Kraneveld, Aletta D
AU - Lopez-Rincon, Alejandro
N1 - Publisher Copyright:
© 2021, The Author(s).
PY - 2021/2/25
Y1 - 2021/2/25
N2 - Myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS) is a chronic disorder characterized by disabling fatigue. Several studies have sought to identify diagnostic biomarkers, with varying results. Here, we innovate this process by combining both mRNA expression and DNA methylation data. We performed recursive ensemble feature selection (REFS) on publicly available mRNA expression data in peripheral blood mononuclear cells (PBMCs) of 93 ME/CFS patients and 25 healthy controls, and found a signature of 23 genes capable of distinguishing cases and controls. REFS highly outperformed other methods, with an AUC of 0.92. We validated the results on a different platform (AUC of 0.95) and in DNA methylation data obtained from four public studies on ME/CFS (99 patients and 50 controls), identifying 48 gene-associated CpGs that predicted disease status as well (AUC of 0.97). Finally, ten of the 23 genes could be interpreted in the context of the derailed immune system of ME/CFS.
AB - Myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS) is a chronic disorder characterized by disabling fatigue. Several studies have sought to identify diagnostic biomarkers, with varying results. Here, we innovate this process by combining both mRNA expression and DNA methylation data. We performed recursive ensemble feature selection (REFS) on publicly available mRNA expression data in peripheral blood mononuclear cells (PBMCs) of 93 ME/CFS patients and 25 healthy controls, and found a signature of 23 genes capable of distinguishing cases and controls. REFS highly outperformed other methods, with an AUC of 0.92. We validated the results on a different platform (AUC of 0.95) and in DNA methylation data obtained from four public studies on ME/CFS (99 patients and 50 controls), identifying 48 gene-associated CpGs that predicted disease status as well (AUC of 0.97). Finally, ten of the 23 genes could be interpreted in the context of the derailed immune system of ME/CFS.
KW - Biomarkers
KW - Case-Control Studies
KW - Computational Biology/methods
KW - DNA Methylation
KW - Disease Susceptibility
KW - Fatigue Syndrome, Chronic/diagnosis
KW - Gene Expression Profiling
KW - Gene Expression Regulation
KW - Models, Biological
KW - RNA, Messenger
KW - ROC Curve
KW - Reproducibility of Results
KW - Transcriptome
UR - http://www.scopus.com/inward/record.url?scp=85101571486&partnerID=8YFLogxK
U2 - 10.1038/s41598-021-83660-9
DO - 10.1038/s41598-021-83660-9
M3 - Article
C2 - 33633136
SN - 2045-2322
VL - 11
JO - Scientific Reports
JF - Scientific Reports
IS - 1
M1 - 4541
ER -