TY - JOUR
T1 - How variation in predictor measurement affects the discriminative ability and transportability of a prediction model
AU - Pajouheshnia, R.
AU - van Smeden, M.
AU - Peelen, L. M.
AU - Groenwold, R. H.H.
N1 - Funding Information:
Declaration of funding and competing interests: This work was funded by the Netherlands Organisation for Scientific Research (project 9120.8004 and 918.10.615). Rolf Groenwold receives funding from the Netherlands Organisation for Scientific Research (project 917.16.430). The funding bodies had no role in the design, conduct or decision to publish this study, and there are no conflicts of interest to declare.
Publisher Copyright:
© 2018 Elsevier Inc.
PY - 2019/1
Y1 - 2019/1
N2 - Background and Objective: Diagnostic and prognostic prediction models often perform poorly when externally validated. We investigate how differences in the measurement of predictors across settings affect the discriminative power and transportability of a prediction model. Methods: Differences in predictor measurement between data sets can be described formally using a measurement error taxonomy. Using this taxonomy, we derive an expression relating variation in the measurement of a continuous predictor to the area under the receiver operating characteristic curve (AUC) of a logistic regression prediction model. This expression is used to demonstrate how variation in measurements across settings affects the out-of-sample discriminative ability of a prediction model. We illustrate these findings with a diagnostic prediction model using example data of patients suspected of having deep venous thrombosis. Results: When a predictor, such as D-dimer, is measured with more noise in one setting compared to another, which we conceptualize as a difference in “classical” measurement error, the expected value of the AUC decreases. In contrast, constant, “structural” measurement error does not impact on the AUC of a logistic regression model, provided the magnitude of the error is the same among cases and noncases. As the differences in measurement methods between settings (and in turn differences in measurement error structures) become more complex, it becomes increasingly difficult to predict how the AUC will differ between settings. Conclusion: When a prediction model is applied to a different setting to the one in which it was developed, its discriminative ability can decrease or even increase if the magnitude or structure of the errors in predictor measurements differ between the two settings. This provides an important starting point for researchers to better understand how differences in measurement methods can affect the performance of a prediction model when externally validating or implementing it in practice.
AB - Background and Objective: Diagnostic and prognostic prediction models often perform poorly when externally validated. We investigate how differences in the measurement of predictors across settings affect the discriminative power and transportability of a prediction model. Methods: Differences in predictor measurement between data sets can be described formally using a measurement error taxonomy. Using this taxonomy, we derive an expression relating variation in the measurement of a continuous predictor to the area under the receiver operating characteristic curve (AUC) of a logistic regression prediction model. This expression is used to demonstrate how variation in measurements across settings affects the out-of-sample discriminative ability of a prediction model. We illustrate these findings with a diagnostic prediction model using example data of patients suspected of having deep venous thrombosis. Results: When a predictor, such as D-dimer, is measured with more noise in one setting compared to another, which we conceptualize as a difference in “classical” measurement error, the expected value of the AUC decreases. In contrast, constant, “structural” measurement error does not impact on the AUC of a logistic regression model, provided the magnitude of the error is the same among cases and noncases. As the differences in measurement methods between settings (and in turn differences in measurement error structures) become more complex, it becomes increasingly difficult to predict how the AUC will differ between settings. Conclusion: When a prediction model is applied to a different setting to the one in which it was developed, its discriminative ability can decrease or even increase if the magnitude or structure of the errors in predictor measurements differ between the two settings. This provides an important starting point for researchers to better understand how differences in measurement methods can affect the performance of a prediction model when externally validating or implementing it in practice.
KW - Area under the curve
KW - Discrimination
KW - Measurement error
KW - Prediction models
KW - Transportability
UR - https://www.scopus.com/pages/publications/85054428385
U2 - 10.1016/j.jclinepi.2018.09.001
DO - 10.1016/j.jclinepi.2018.09.001
M3 - Article
C2 - 30223065
AN - SCOPUS:85054428385
SN - 0895-4356
VL - 105
SP - 136
EP - 141
JO - Journal of Clinical Epidemiology
JF - Journal of Clinical Epidemiology
ER -