TY - JOUR
T1 - How to deal with the early GWAS data when imputing and combining different arrays is necessary
AU - Uh, HW
AU - Deelen, Joris
AU - Beekman, Marian
AU - Helmer, Quinta
AU - Rivadeneira, Fernando
AU - Hottenga, Jouke Jan
AU - Boomsma, Dorret I.
AU - Hofman, Albert
AU - Uitterlinden, André G.
AU - Slagboom, P. E.
AU - Böhringer, Stefan
AU - Houwing-Duistermaat, Jeanine J.
PY - 2012/5/1
Y1 - 2012/5/1
N2 -
Genotype imputation has become an essential tool in the analysis of genome-wide association scans. This technique allows investigators to test association at ungenotyped genetic markers, and to combine results across studies that rely on different genotyping platforms. In addition, imputation is used within long-running studies to reuse genotypes produced across generations of platforms. Typically, genotypes of controls are reused and cases are genotyped on more novel platforms yielding a case-control study that is not matched for genotyping platforms. In this study, we scrutinize such a situation and validate GWAS results by actually retyping top-ranking SNPs with the Sequenom MassArray platform. We discuss the needed quality controls (QCs). In doing so, we report a considerable discrepancy between the results from imputed and retyped data when applying recommended QCs from the literature. These discrepancies appear to be caused by extrapolating differences between arrays by the process of imputation. To avoid false positive results, we recommend that more stringent QCs should be applied. We also advocate reporting the imputation quality measure (R
T
2
) for the post-imputation QCs in publications.
AB -
Genotype imputation has become an essential tool in the analysis of genome-wide association scans. This technique allows investigators to test association at ungenotyped genetic markers, and to combine results across studies that rely on different genotyping platforms. In addition, imputation is used within long-running studies to reuse genotypes produced across generations of platforms. Typically, genotypes of controls are reused and cases are genotyped on more novel platforms yielding a case-control study that is not matched for genotyping platforms. In this study, we scrutinize such a situation and validate GWAS results by actually retyping top-ranking SNPs with the Sequenom MassArray platform. We discuss the needed quality controls (QCs). In doing so, we report a considerable discrepancy between the results from imputed and retyped data when applying recommended QCs from the literature. These discrepancies appear to be caused by extrapolating differences between arrays by the process of imputation. To avoid false positive results, we recommend that more stringent QCs should be applied. We also advocate reporting the imputation quality measure (R
T
2
) for the post-imputation QCs in publications.
KW - GWAS
KW - imputation
KW - quality control
UR - http://www.scopus.com/inward/record.url?scp=84859919736&partnerID=8YFLogxK
U2 - 10.1038/ejhg.2011.231
DO - 10.1038/ejhg.2011.231
M3 - Article
C2 - 22189269
AN - SCOPUS:84859919736
SN - 1018-4813
VL - 20
SP - 572
EP - 576
JO - European Journal of Human Genetics
JF - European Journal of Human Genetics
IS - 5
ER -