TY - JOUR
T1 - A multimetric approach to analysis of genome-wide association by single markers and composite likelihood
AU - Gibson, Jane
AU - Tapper, William
AU - Cox, David
AU - Zhang, Weihua
AU - Pfeufer, Arne
AU - Gieger, Christian
AU - Wichmann, H. Erich
AU - Kääb, Stefan
AU - Collins, Andrew R.
AU - Meitinger, Thomas
AU - Morton, Newton
PY - 2008/2/19
Y1 - 2008/2/19
N2 - Two case/control studies with different phenotypes, marker densities, and microarrays were examined for the most significant single markers in defined regions. They show a pronounced bias toward exaggerated significance that increases with the number of observed markers and would increase further with imputed markers. This bias is eliminated by Bonferroni adjustment, thereby allowing combination by principal component analysis with a Malecot model composite likelihood evaluated by a permutation procedure to allow for multiple dependent markers. This intermediate value identifies the only demonstrated causal locus as most significant even in the preliminary analysis and clearly recognizes the strongest candidate in the other sample. Because the three metrics (most significant single marker, composite likelihood, and their principal component) are correlated, choice of the n smallest P values by each test gives <3n regions for follow-up in the next stage. In this way, methods with different response to marker selection and density are given approximately equal weight and economically compared, without expressing an untested prejudice or sacrificing the most significant results for any of them. Large numbers of cases, controls, and markers are by themselves insufficient to control type 1 and 2 errors, and so efficient use of multiple metrics with Bonferroni adjustment promises to be valuable in identifying causal variants and optimal design simultaneously.
AB - Two case/control studies with different phenotypes, marker densities, and microarrays were examined for the most significant single markers in defined regions. They show a pronounced bias toward exaggerated significance that increases with the number of observed markers and would increase further with imputed markers. This bias is eliminated by Bonferroni adjustment, thereby allowing combination by principal component analysis with a Malecot model composite likelihood evaluated by a permutation procedure to allow for multiple dependent markers. This intermediate value identifies the only demonstrated causal locus as most significant even in the preliminary analysis and clearly recognizes the strongest candidate in the other sample. Because the three metrics (most significant single marker, composite likelihood, and their principal component) are correlated, choice of the n smallest P values by each test gives <3n regions for follow-up in the next stage. In this way, methods with different response to marker selection and density are given approximately equal weight and economically compared, without expressing an untested prejudice or sacrificing the most significant results for any of them. Large numbers of cases, controls, and markers are by themselves insufficient to control type 1 and 2 errors, and so efficient use of multiple metrics with Bonferroni adjustment promises to be valuable in identifying causal variants and optimal design simultaneously.
KW - Bonferroni correction
KW - Electrocardiographic QT interval
KW - Empirical P values
KW - Principal component analysis
UR - http://www.scopus.com/inward/record.url?scp=40649122795&partnerID=8YFLogxK
U2 - 10.1073/pnas.0711903105
DO - 10.1073/pnas.0711903105
M3 - Article
C2 - 18268331
AN - SCOPUS:40649122795
SN - 0027-8424
VL - 105
SP - 2592
EP - 2597
JO - Proceedings of the National Academy of Sciences of the United States of America
JF - Proceedings of the National Academy of Sciences of the United States of America
IS - 7
ER -