TY - JOUR
T1 - Overinterpretation of findings in machine learning prediction model studies in oncology
T2 - a systematic review
AU - Dhiman, Paula
AU - Ma, Jie
AU - Andaur Navarro, Constanza L
AU - Speich, Benjamin
AU - Bullock, Garrett
AU - Damen, Johanna AA
AU - Hooft, Lotty
AU - Kirtley, Shona
AU - Riley, Richard D
AU - Van Calster, Ben
AU - Moons, Karel Gm
AU - Collins, Gary S
N1 - Funding Information:
Funding statement: Gary Collins, Shona Kirtley and Jie Ma are supported by Cancer Research UK (programme grant: C49297 / A27294 ). Benjamin Speich is supported by an Advanced Postdoc. Mobility grant (P300PB_177933) and a return grant ( P4P4PM_194496 ) from the Swiss National Science Foundation . Gary Collins and Paula Dhiman are supported by the NIHR Biomedical Research Center , Oxford. Ben Van Calster is supported by Internal Funds KU Leuven (grant C24M/20/064 ), University Hospitals Leuven (grant COPREDICT), and Kom Op Tegen Kanker (grant KOTK TRANS-IOTA). This publication presents independent research funded by Cancer Research UK, and the National Institute for Health Research (NIHR). The views expressed are those of the author(s) and not necessarily those of the Cancer Research UK, the NHS, the NIHR or the Department of Health and Social Care.
Publisher Copyright:
© 2023 The Author(s)
PY - 2023/5
Y1 - 2023/5
N2 - Objectives: In biomedical research, spin is the overinterpretation of findings, and it is a growing concern. To date, the presence of spin has not been evaluated in prognostic model research in oncology, including studies developing and validating models for individualized risk prediction. Study Design and Setting: We conducted a systematic review, searching MEDLINE and EMBASE for oncology-related studies that developed and validated a prognostic model using machine learning published between 1st January, 2019, and 5th September, 2019. We used existing spin frameworks and described areas of highly suggestive spin practices. Results: We included 62 publications (including 152 developed models; 37 validated models). Reporting was inconsistent between methods and the results in 27% of studies due to additional analysis and selective reporting. Thirty-two studies (out of 36 applicable studies) reported comparisons between developed models in their discussion and predominantly used discrimination measures to support their claims (78%). Thirty-five studies (56%) used an overly strong or leading word in their title, abstract, results, discussion, or conclusion. Conclusion: The potential for spin needs to be considered when reading, interpreting, and using studies that developed and validated prognostic models in oncology. Researchers should carefully report their prognostic model research using words that reflect their actual results and strength of evidence.
AB - Objectives: In biomedical research, spin is the overinterpretation of findings, and it is a growing concern. To date, the presence of spin has not been evaluated in prognostic model research in oncology, including studies developing and validating models for individualized risk prediction. Study Design and Setting: We conducted a systematic review, searching MEDLINE and EMBASE for oncology-related studies that developed and validated a prognostic model using machine learning published between 1st January, 2019, and 5th September, 2019. We used existing spin frameworks and described areas of highly suggestive spin practices. Results: We included 62 publications (including 152 developed models; 37 validated models). Reporting was inconsistent between methods and the results in 27% of studies due to additional analysis and selective reporting. Thirty-two studies (out of 36 applicable studies) reported comparisons between developed models in their discussion and predominantly used discrimination measures to support their claims (78%). Thirty-five studies (56%) used an overly strong or leading word in their title, abstract, results, discussion, or conclusion. Conclusion: The potential for spin needs to be considered when reading, interpreting, and using studies that developed and validated prognostic models in oncology. Researchers should carefully report their prognostic model research using words that reflect their actual results and strength of evidence.
KW - Artificial intelligence
KW - Machine learning
KW - Oncology
KW - Prediction model
KW - Prognosis
KW - Spin
KW - Statistical learning
UR - http://www.scopus.com/inward/record.url?scp=85151783991&partnerID=8YFLogxK
U2 - 10.1016/j.jclinepi.2023.03.012
DO - 10.1016/j.jclinepi.2023.03.012
M3 - Article
C2 - 36935090
SN - 0895-4356
VL - 157
SP - 120
EP - 133
JO - Journal of Clinical Epidemiology
JF - Journal of Clinical Epidemiology
ER -