TY - JOUR
T1 - Sequential double cross-validation for assessment of added predictive ability in high-dimensional omic applications
AU - Rodríguez-Girondo, Mar
AU - Salo, Perttu
AU - Burzykowski, Tomasz
AU - Perola, Markus
AU - Houwing-Duistermaat, Jeanine
AU - Mertens, Bart
PY - 2018/9/1
Y1 - 2018/9/1
N2 - Enriching existing predictive models with new biomolecular markers is an important task in the new multi-omic era. Clinical studies increasingly include new sets of omic measurements which may prove their added value in terms of predictive performance. We introduce a two-step approach for the assessment of the added predictive ability of omic predictors, based on sequential double cross-validation and regularized regression models. We propose several performance indices to summarize the two-stage prediction procedure and a permutation test to formally assess the added predictive value of a second omic set of predictors over a primary omic source. The performance of the test is investigated through simulations. We illustrate the new method through the systematic assessment and comparison of the performance of transcriptomics and metabolomics sources in the prediction of body mass index (BMI) using longitudinal data from the Dietary, Lifestyle, and Genetic determinants of Obesity and Metabolic syndrome (DILGOM) study, a population-based cohort from Finland.
AB - Enriching existing predictive models with new biomolecular markers is an important task in the new multi-omic era. Clinical studies increasingly include new sets of omic measurements which may prove their added value in terms of predictive performance. We introduce a two-step approach for the assessment of the added predictive ability of omic predictors, based on sequential double cross-validation and regularized regression models. We propose several performance indices to summarize the two-stage prediction procedure and a permutation test to formally assess the added predictive value of a second omic set of predictors over a primary omic source. The performance of the test is investigated through simulations. We illustrate the new method through the systematic assessment and comparison of the performance of transcriptomics and metabolomics sources in the prediction of body mass index (BMI) using longitudinal data from the Dietary, Lifestyle, and Genetic determinants of Obesity and Metabolic syndrome (DILGOM) study, a population-based cohort from Finland.
KW - Added predictive ability
KW - Double cross-validation
KW - Multiple omics sets
KW - Regularized regression
UR - http://www.scopus.com/inward/record.url?scp=85053346600&partnerID=8YFLogxK
U2 - 10.1214/17-AOAS1125
DO - 10.1214/17-AOAS1125
M3 - Article
AN - SCOPUS:85053346600
SN - 1932-6157
VL - 12
SP - 1655
EP - 1678
JO - Annals of Applied Statistics
JF - Annals of Applied Statistics
IS - 3
ER -