Show simple item record

dc.identifier.urihttp://hdl.handle.net/1951/55688
dc.identifier.urihttp://hdl.handle.net/11401/72722
dc.description.sponsorshipThis work is sponsored by the Stony Brook University Graduate School in compliance with the requirements for completion of degree.en_US
dc.formatMonograph
dc.format.mediumElectronic Resourceen_US
dc.language.isoen_US
dc.publisherThe Graduate School, Stony Brook University: Stony Brook, NY.
dc.typeDissertation
dcterms.abstractGenotype imputation provides an essential technique for genome-wide association studies (GWAS) with hundreds of thousands of SNPs. Understanding the connection between imputation inconsistencies and the power to detect association at imputed markers or the disease genes close to them is important for the optimal design of imputation-based GWAS since genotype misclassification can significantly decrease statistical power to detect association. Double sampling of genotypes is a statistical procedure in which a portion of subjects receive a second and more precise genotyping. This paper applies the likelihood ratio test allowing for errors (LRT-AE), which incorporates double sample information for genotypes on a sub-sample of cases/controls, to correct for imputation inconsistencies. Parameters used to determine the log likelihoods are determined using the Expectation-Maximization (EM) algorithm. To compare the performance of the LRT-AE with the performance of the likelihood ratio test (LRT), which makes no adjustment for imputation inconsistencies, I perform simulation studies using a factorial design with high and low settings of: disease minor allele frequency (MAF), heterozygote relative risk, mode of inheritance (MOI), disease prevalence, and proportion of double sampled subjects. The LRT-AE method maintains correct type I error rates for all null simulations and all significance level thresholds (5%, 1%). Power improvement, however, is not significant unless more than 50% of subjects are in the double sampled group. Unbiased estimates of imputation inconsistency rates are also obtained from the LRT-AE method.
dcterms.available2012-05-15T18:07:49Z
dcterms.available2015-04-24T14:53:22Z
dcterms.contributorMendell, Nancy R.en_US
dcterms.contributorStephen J. Finchen_US
dcterms.contributorWei Zhuen_US
dcterms.contributorDerek Gordon.en_US
dcterms.creatorYuan, Qilong
dcterms.dateAccepted2012-05-15T18:07:49Z
dcterms.dateAccepted2015-04-24T14:53:22Z
dcterms.dateSubmitted2012-05-15T18:07:49Z
dcterms.dateSubmitted2015-04-24T14:53:22Z
dcterms.descriptionDepartment of Applied Mathematics and Statisticsen_US
dcterms.formatMonograph
dcterms.formatApplication/PDFen_US
dcterms.identifierYuan_grad.sunysb_0771E_10347.pdfen_US
dcterms.identifierhttp://hdl.handle.net/1951/55688
dcterms.identifierhttp://hdl.handle.net/11401/72722
dcterms.issued2010-12-01
dcterms.languageen_US
dcterms.provenanceMade available in DSpace on 2012-05-15T18:07:49Z (GMT). No. of bitstreams: 1 Yuan_grad.sunysb_0771E_10347.pdf: 872834 bytes, checksum: 9542210b100c3cb7fc33bdee2c61b028 (MD5) Previous issue date: 1en
dcterms.provenanceMade available in DSpace on 2015-04-24T14:53:22Z (GMT). No. of bitstreams: 3 Yuan_grad.sunysb_0771E_10347.pdf.jpg: 1894 bytes, checksum: a6009c46e6ec8251b348085684cba80d (MD5) Yuan_grad.sunysb_0771E_10347.pdf.txt: 96556 bytes, checksum: bc064c14710235c38ec1bb949eebe9f6 (MD5) Yuan_grad.sunysb_0771E_10347.pdf: 872834 bytes, checksum: 9542210b100c3cb7fc33bdee2c61b028 (MD5) Previous issue date: 1en
dcterms.publisherThe Graduate School, Stony Brook University: Stony Brook, NY.
dcterms.subjectDouble Sampling Method, Genome-wide Association Studies, Genotype Imputation, Likelihood Ratio Test Allowing for Errors
dcterms.subjectGenetics -- Statistics
dcterms.titleApplication of Double Sampling to Combine Measured and Imputed Genotype Data in Genetic Association Studies
dcterms.typeDissertation


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record