TY - JOUR
T1 - Powerful testing via hierarchical linkage disequilibrium in haplotype association studies
AU - Balliu, Brunilda
AU - Houwing-Duistermaat, Jeanine J.
AU - Böhringer, Stefan
N1 - Funding Information:
European Union’s Seventh Framework Pro gram, Grant/Award Number: 305280; Netherlands Organization for Scientific Research, Grant/Award Number: 917.66.344; Wellcome Trust Case Control Consortium, Grant/Award Numbers: 076113, 085475, 090355
Funding Information:
Research leading to this work was supported by the Netherlands Organization for Scientific Research Grant (917.66.344), European Union's Seventh Framework Program for research under grant agreement no. 305280 (MIMOmics), and Stanford's Genome Training Program. This study makes use of data generated by the Wellcome Trust Case Control Consortium which is available at the European Genome-Phenome Archive (https://ega-archive.org). A full list of the investigators who contributed to the generation of the data is available from www.wtccc.org.uk. Funding for the project was provided by the Wellcome Trust under award 076113, 085475, and 090355.
Funding Information:
Research leading to this work was supported by the Netherlands Organization for Scientific Research Grant (917.66.344), European Union's Seventh Framework Program for research under grant agreement no. 305280 (MIMOmics), and Stanford's Genome Training Program. This study makes use of data generated by the Wellcome Trust Case Control Consortium which is available at the European Genome-Phenome Archive (https://ega-archive.org). A full list of the investigators who contributed to the generation of the data is available from www.wtccc.org.uk. Funding for the project was provided by the Wellcome Trust under award 076113, 085475, and 090355.
Publisher Copyright:
© 2019 The Authors. Biometrical Journal Published by WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
PY - 2019/5/1
Y1 - 2019/5/1
N2 - Marginal tests based on individual SNPs are routinely used in genetic association studies. Studies have shown that haplotype-based methods may provide more power in disease mapping than methods based on single markers when, for example, multiple disease-susceptibility variants occur within the same gene. A limitation of haplotype-based methods is that the number of parameters increases exponentially with the number of SNPs, inducing a commensurate increase in the degrees of freedom and weakening the power to detect associations. To address this limitation, we introduce a hierarchical linkage disequilibrium model for disease mapping, based on a reparametrization of the multinomial haplotype distribution, where every parameter corresponds to the cumulant of each possible subset of a set of loci. This hierarchy present in the parameters enables us to employ flexible testing strategies over a range of parameter sets: from standard single SNP analyses through the full haplotype distribution tests, reducing degrees of freedom and increasing the power to detect associations. We show via extensive simulations that our approach maintains the type I error at nominal level and has increased power under many realistic scenarios, as compared to single SNP and standard haplotype-based studies. To evaluate the performance of our proposed methodology in real data, we analyze genome-wide data from the Wellcome Trust Case-Control Consortium.
AB - Marginal tests based on individual SNPs are routinely used in genetic association studies. Studies have shown that haplotype-based methods may provide more power in disease mapping than methods based on single markers when, for example, multiple disease-susceptibility variants occur within the same gene. A limitation of haplotype-based methods is that the number of parameters increases exponentially with the number of SNPs, inducing a commensurate increase in the degrees of freedom and weakening the power to detect associations. To address this limitation, we introduce a hierarchical linkage disequilibrium model for disease mapping, based on a reparametrization of the multinomial haplotype distribution, where every parameter corresponds to the cumulant of each possible subset of a set of loci. This hierarchy present in the parameters enables us to employ flexible testing strategies over a range of parameter sets: from standard single SNP analyses through the full haplotype distribution tests, reducing degrees of freedom and increasing the power to detect associations. We show via extensive simulations that our approach maintains the type I error at nominal level and has increased power under many realistic scenarios, as compared to single SNP and standard haplotype-based studies. To evaluate the performance of our proposed methodology in real data, we analyze genome-wide data from the Wellcome Trust Case-Control Consortium.
KW - cis interactions
KW - genome-wide association study
KW - haplotype association study
KW - linkage disequilibrium
UR - http://www.scopus.com/inward/record.url?scp=85060757218&partnerID=8YFLogxK
U2 - 10.1002/bimj.201800053
DO - 10.1002/bimj.201800053
M3 - Article
C2 - 30693553
AN - SCOPUS:85060757218
SN - 0323-3847
VL - 61
SP - 747
EP - 768
JO - Biometrical Journal
JF - Biometrical Journal
IS - 3
ER -