TY - JOUR
T1 - Sample size for binary logistic prediction models
T2 - Beyond events per variable criteria
AU - van Smeden, Maarten
AU - Moons, Karel G.M.
AU - de Groot, Joris A.H.
AU - Collins, Gary S.
AU - Altman, Douglas G.
AU - Eijkemans, Marinus J.C.
AU - Reitsma, Johannes B.
PY - 2019/8
Y1 - 2019/8
N2 - Binary logistic regression is one of the most frequently applied statistical approaches for developing clinical prediction models. Developers of such models often rely on an Events Per Variable criterion (EPV), notably EPV ≥10, to determine the minimal sample size required and the maximum number of candidate predictors that can be examined. We present an extensive simulation study in which we studied the influence of EPV, events fraction, number of candidate predictors, the correlations and distributions of candidate predictor variables, area under the ROC curve, and predictor effects on out-of-sample predictive performance of prediction models. The out-of-sample performance (calibration, discrimination and probability prediction error) of developed prediction models was studied before and after regression shrinkage and variable selection. The results indicate that EPV does not have a strong relation with metrics of predictive performance, and is not an appropriate criterion for (binary) prediction model development studies. We show that out-of-sample predictive performance can better be approximated by considering the number of predictors, the total sample size and the events fraction. We propose that the development of new sample size criteria for prediction models should be based on these three parameters, and provide suggestions for improving sample size determination.
AB - Binary logistic regression is one of the most frequently applied statistical approaches for developing clinical prediction models. Developers of such models often rely on an Events Per Variable criterion (EPV), notably EPV ≥10, to determine the minimal sample size required and the maximum number of candidate predictors that can be examined. We present an extensive simulation study in which we studied the influence of EPV, events fraction, number of candidate predictors, the correlations and distributions of candidate predictor variables, area under the ROC curve, and predictor effects on out-of-sample predictive performance of prediction models. The out-of-sample performance (calibration, discrimination and probability prediction error) of developed prediction models was studied before and after regression shrinkage and variable selection. The results indicate that EPV does not have a strong relation with metrics of predictive performance, and is not an appropriate criterion for (binary) prediction model development studies. We show that out-of-sample predictive performance can better be approximated by considering the number of predictors, the total sample size and the events fraction. We propose that the development of new sample size criteria for prediction models should be based on these three parameters, and provide suggestions for improving sample size determination.
KW - EPV
KW - Logistic regression
KW - prediction models
KW - predictive performance
KW - sample size
KW - simulations
UR - http://www.scopus.com/inward/record.url?scp=85049920258&partnerID=8YFLogxK
U2 - 10.1177/0962280218784726
DO - 10.1177/0962280218784726
M3 - Article
C2 - 29966490
AN - SCOPUS:85049920258
SN - 0962-2802
VL - 28
SP - 2455
EP - 2474
JO - Statistical Methods in Medical Research
JF - Statistical Methods in Medical Research
IS - 8
ER -