Show simple item record

dc.identifier.urihttp://hdl.handle.net/1951/55610
dc.identifier.urihttp://hdl.handle.net/11401/72659
dc.description.sponsorshipThis work is sponsored by the Stony Brook University Graduate School in compliance with the requirements for completion of degree.en_US
dc.formatMonograph
dc.format.mediumElectronic Resourceen_US
dc.language.isoen_US
dc.publisherThe Graduate School, Stony Brook University: Stony Brook, NY.
dc.typeDissertation
dcterms.abstractVariation in the human genome is present in many forms, including single-nucleotide polymorphisms (SNPs) and copy number polymorphisms (CNPs). CNPs have many categories such as small insertion-deletion polymorphisms, variable number of repetitive sequences, and genomic structural alterations. A major question that researchers in the field of statistical genetics need to answer is the number of CNP categories in a given dataset. In this study, I compare five information criteria (BIC, AIC, NEC, CLC, and ICL-BIC) to find if there is a"best" measure among them in finding the correct number of components (correct number of CNP categories). I consider six design factors: equal/unequal within-component variances, high/low separations, sample size, mixture proportion, multiple random starting values, and transformation using two known number of components (3 and 6). The result indicates that under"ideal" conditions (that is, small number of components, large separation between components, constant within component variance, and no subsequent transformation of mixture data), each criterion performs well. When the data is a monotonic transformation of data from a mixture, the BIC criterion, which is the most commonly used criterion in CNP research, has a low component number accuracy rate. I then considered the application of the Box-Cox transformation whether or not it was needed. The application of the Box-Cox transformation did not reduce the component number accuracy rate of the CLC, ICL-BIC, and BIC when it was not needed. The component number accuracy rates for the BIC criterion with Box-Cox transformation applied were improved when the mixture data was transformed. The Box-Cox transformation should be used routinely with CLC, ICL-BIC, or BIC criterion to estimate the number of components in a CNP mixture analysis.
dcterms.available2012-05-15T18:06:38Z
dcterms.available2015-04-24T14:53:04Z
dcterms.contributorFinch, Stephen J.en_US
dcterms.contributorNancy R. Mendellen_US
dcterms.contributorWei Zhuen_US
dcterms.contributorDerek Gordon.en_US
dcterms.creatorSaint Fleur, Rose Edy
dcterms.dateAccepted2012-05-15T18:06:38Z
dcterms.dateAccepted2015-04-24T14:53:04Z
dcterms.dateSubmitted2012-05-15T18:06:38Z
dcterms.dateSubmitted2015-04-24T14:53:04Z
dcterms.descriptionDepartment of Applied Mathematics and Statisticsen_US
dcterms.formatMonograph
dcterms.formatApplication/PDFen_US
dcterms.identifierSaintFleur_grad.sunysb_0771E_10193.pdfen_US
dcterms.identifierhttp://hdl.handle.net/1951/55610
dcterms.identifierhttp://hdl.handle.net/11401/72659
dcterms.issued2010-08-01
dcterms.languageen_US
dcterms.provenanceMade available in DSpace on 2012-05-15T18:06:38Z (GMT). No. of bitstreams: 1 SaintFleur_grad.sunysb_0771E_10193.pdf: 1414701 bytes, checksum: 40e070202b4cc93420f9ee2ec14bcd0d (MD5) Previous issue date: 1en
dcterms.provenanceMade available in DSpace on 2015-04-24T14:53:04Z (GMT). No. of bitstreams: 3 SaintFleur_grad.sunysb_0771E_10193.pdf.jpg: 1894 bytes, checksum: a6009c46e6ec8251b348085684cba80d (MD5) SaintFleur_grad.sunysb_0771E_10193.pdf.txt: 192579 bytes, checksum: 30d793dfef4a88cd67439b0674dec0fa (MD5) SaintFleur_grad.sunysb_0771E_10193.pdf: 1414701 bytes, checksum: 40e070202b4cc93420f9ee2ec14bcd0d (MD5) Previous issue date: 1en
dcterms.publisherThe Graduate School, Stony Brook University: Stony Brook, NY.
dcterms.subjectStatistics
dcterms.titleTesting the properties of selection criteria: an application to copy number polymorphism measurements
dcterms.typeDissertation


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record