TY - JOUR
T1 - Approaches to addressing missing values, measurement error, and confounding in epidemiologic studies
AU - van Smeden, Maarten
AU - Penning de Vries, Bas B.L.
AU - Nab, Linda
AU - Groenwold, Rolf H.H.
N1 - Funding Information:
Funding: This work was supported by grants from the Netherlands Organization for Scientific Research (ZonMW-Vidi project 917.16.430) and the Leiden University Medical Center .
Publisher Copyright:
© 2020 The Authors
PY - 2021/3
Y1 - 2021/3
N2 - Objectives: Epidemiologic studies often suffer from incomplete data, measurement error (or misclassification), and confounding. Each of these can cause bias and imprecision in estimates of exposure–outcome relations. We describe and compare statistical approaches that aim to control all three sources of bias simultaneously. Study Design and Setting: We illustrate four statistical approaches that address all three sources of bias, namely, multiple imputation for missing data and measurement error, multiple imputation combined with regression calibration, full information maximum likelihood within a structural equation modeling framework, and a Bayesian model. In a simulation study, we assess the performance of the four approaches compared with more commonly used approaches that do not account for measurement error, missing values, or confounding. Results: The results demonstrate that the four approaches consistently outperform the alternative approaches on all performance metrics (bias, mean squared error, and confidence interval coverage). Even in simulated data of 100 subjects, these approaches perform well. Conclusion: There can be a large benefit of addressing measurement error, missing values, and confounding to improve the estimation of exposure–outcome relations, even when the available sample size is relatively small.
AB - Objectives: Epidemiologic studies often suffer from incomplete data, measurement error (or misclassification), and confounding. Each of these can cause bias and imprecision in estimates of exposure–outcome relations. We describe and compare statistical approaches that aim to control all three sources of bias simultaneously. Study Design and Setting: We illustrate four statistical approaches that address all three sources of bias, namely, multiple imputation for missing data and measurement error, multiple imputation combined with regression calibration, full information maximum likelihood within a structural equation modeling framework, and a Bayesian model. In a simulation study, we assess the performance of the four approaches compared with more commonly used approaches that do not account for measurement error, missing values, or confounding. Results: The results demonstrate that the four approaches consistently outperform the alternative approaches on all performance metrics (bias, mean squared error, and confidence interval coverage). Even in simulated data of 100 subjects, these approaches perform well. Conclusion: There can be a large benefit of addressing measurement error, missing values, and confounding to improve the estimation of exposure–outcome relations, even when the available sample size is relatively small.
KW - Bayes Theorem
KW - Bias
KW - Computer Simulation
KW - Confounding Factors, Epidemiologic
KW - Data Interpretation, Statistical
KW - Epidemiologic Studies
KW - Humans
KW - Probability
KW - Data analysis
KW - Confounding
KW - Regression
KW - Missing data
KW - Simulation
KW - Imputation
KW - Regression calibration
KW - Measurement error
UR - http://www.scopus.com/inward/record.url?scp=85097731895&partnerID=8YFLogxK
U2 - 10.1016/j.jclinepi.2020.11.006
DO - 10.1016/j.jclinepi.2020.11.006
M3 - Article
C2 - 33176189
SN - 0895-4356
VL - 131
SP - 89
EP - 100
JO - Journal of Clinical Epidemiology
JF - Journal of Clinical Epidemiology
ER -