TY - JOUR
T1 - Sample size considerations and predictive performance of multinomial logistic prediction models
AU - de Jong, Valentijn M.T.
AU - Eijkemans, Marinus J.C.
AU - van Calster, Ben
AU - Timmerman, Dirk
AU - Moons, Karel G.M.
AU - Steyerberg, Ewout W.
AU - van Smeden, Maarten
N1 - Funding Information:
We thank Hajime Uno for providing code for the PDI. Karel G. M. Moons receives funding from the Netherlands Organisation for Scientific Research (project 918.10.615).
Publisher Copyright:
© 2019 The Authors. Statistics in Medicine Published by John Wiley & Sons Ltd.
PY - 2019/4/1
Y1 - 2019/4/1
N2 - Multinomial Logistic Regression (MLR) has been advocated for developing clinical prediction models that distinguish between three or more unordered outcomes. We present a full-factorial simulation study to examine the predictive performance of MLR models in relation to the relative size of outcome categories, number of predictors and the number of events per variable. It is shown that MLR estimated by Maximum Likelihood yields overfitted prediction models in small to medium sized data. In most cases, the calibration and overall predictive performance of the multinomial prediction model is improved by using penalized MLR. Our simulation study also highlights the importance of events per variable in the multinomial context as well as the total sample size. As expected, our study demonstrates the need for optimism correction of the predictive performance measures when developing the multinomial logistic prediction model. We recommend the use of penalized MLR when prediction models are developed in small data sets or in medium sized data sets with a small total sample size (ie, when the sizes of the outcome categories are balanced). Finally, we present a case study in which we illustrate the development and validation of penalized and unpenalized multinomial prediction models for predicting malignancy of ovarian cancer.
AB - Multinomial Logistic Regression (MLR) has been advocated for developing clinical prediction models that distinguish between three or more unordered outcomes. We present a full-factorial simulation study to examine the predictive performance of MLR models in relation to the relative size of outcome categories, number of predictors and the number of events per variable. It is shown that MLR estimated by Maximum Likelihood yields overfitted prediction models in small to medium sized data. In most cases, the calibration and overall predictive performance of the multinomial prediction model is improved by using penalized MLR. Our simulation study also highlights the importance of events per variable in the multinomial context as well as the total sample size. As expected, our study demonstrates the need for optimism correction of the predictive performance measures when developing the multinomial logistic prediction model. We recommend the use of penalized MLR when prediction models are developed in small data sets or in medium sized data sets with a small total sample size (ie, when the sizes of the outcome categories are balanced). Finally, we present a case study in which we illustrate the development and validation of penalized and unpenalized multinomial prediction models for predicting malignancy of ovarian cancer.
KW - Multinomial logistic regression
KW - overfit
KW - prediction models
KW - predictive performance
KW - shrinkage
KW - Multinomial Logistic Regression
UR - http://www.scopus.com/inward/record.url?scp=85059557306&partnerID=8YFLogxK
U2 - 10.1002/sim.8063
DO - 10.1002/sim.8063
M3 - Article
C2 - 30614028
SN - 0277-6715
VL - 38
SP - 1601
EP - 1619
JO - Statistics in Medicine
JF - Statistics in Medicine
IS - 9
ER -