Show simple item record

dc.identifier.urihttp://hdl.handle.net/1951/60234
dc.identifier.urihttp://hdl.handle.net/11401/71500
dc.description.sponsorshipThis work is sponsored by the Stony Brook University Graduate School in compliance with the requirements for completion of degree.en_US
dc.formatMonograph
dc.format.mediumElectronic Resourceen_US
dc.language.isoen_US
dc.publisherThe Graduate School, Stony Brook University: Stony Brook, NY.
dc.typeDissertation
dcterms.abstractThe partial correlation is well defined for continuous data and popularly used in network analysis. Its strength is in its interpretation as the relationship between two variables after removing the effects of other variables. We follow up on a recent proposal of such a measure for categorical data, but the properties of which were not well studied. The new partial correlation is defined as the first canonical correlation of Pearson residuals from logistic regressions. This is analogous to the continuous case, where the partial correlation is obtained from correlating residuals from linear regressions. A simulation study is presented to examine the properties of the new partial correlation and compare it to other measures, such as the partial phi coefficient. In the limiting case, the new partial correlation and the partial phi coefficient converge in estimate and inference. However, the partial phi coefficient cannot be applied to multi-categorical data. Furthermore, it is not an efficient measure to control for more than one variable. The new partial correlation is well defined for the multi-categorical case and can readily control for more than one variable. Being derived as the canonical correlation, the new partial correlation can also measure the relationship between continuous and categorical variables as the multiple correlation between the Pearson residuals from the logistic regression and the usual residual from the linear regression when the response variables are categorical and continuous respectively. Now that we are fully capable of obtaining partial correlation networks for any data types, continuous, categorical or mixed, our next goal is to compare the network structure between different groups and to examine the impact of continuous, in addition to categorical covariates, on the pathway connections. This is accomplished by extending the two-level regression approach for continuous data originally developed by our research group (Pradhan, 2009) to categorical data and mixed data network analysis. By linearly regressing the first canonical variates and replacing the slope coefficient with an expression of the covariates, we can test for the effect of covariates (both categorical and continuous) on the partial correlation and the network structure. This new covariate partial correlation network analysis approach is illustrated through two studies on the links between human genotypes (single-nucleotide polymorphisms) and disease phenotypes.
dcterms.available2013-05-24T16:38:16Z
dcterms.available2015-04-24T14:47:45Z
dcterms.contributorZhu, Wei, Ahn, Hongshiken_US
dcterms.contributorWu, Songen_US
dcterms.contributorKotov, Roman.en_US
dcterms.creatorLeong, Shirley Hui Yee
dcterms.dateAccepted2013-05-24T16:38:16Z
dcterms.dateAccepted2015-04-24T14:47:45Z
dcterms.dateSubmitted2013-05-24T16:38:16Z
dcterms.dateSubmitted2015-04-24T14:47:45Z
dcterms.descriptionDepartment of Applied Mathematics and Statisticsen_US
dcterms.extent141 pg.en_US
dcterms.formatApplication/PDFen_US
dcterms.formatMonograph
dcterms.identifierhttp://hdl.handle.net/1951/60234
dcterms.identifierhttp://hdl.handle.net/11401/71500
dcterms.issued2012-05-01
dcterms.languageen_US
dcterms.provenanceMade available in DSpace on 2013-05-24T16:38:16Z (GMT). No. of bitstreams: 1 StonyBrookUniversityETDPageEmbargo_20130517082608_116839.pdf: 41286 bytes, checksum: 425a156df10bbe213bfdf4d175026e82 (MD5) Previous issue date: 1en
dcterms.provenanceMade available in DSpace on 2015-04-24T14:47:45Z (GMT). No. of bitstreams: 3 StonyBrookUniversityETDPageEmbargo_20130517082608_116839.pdf.jpg: 1934 bytes, checksum: c116f0e1e7be19420106a88253e31f2e (MD5) StonyBrookUniversityETDPageEmbargo_20130517082608_116839.pdf.txt: 336 bytes, checksum: 84c0f8f99f2b4ae66b3cc3ade09ad2e9 (MD5) StonyBrookUniversityETDPageEmbargo_20130517082608_116839.pdf: 41286 bytes, checksum: 425a156df10bbe213bfdf4d175026e82 (MD5) Previous issue date: 1en
dcterms.publisherThe Graduate School, Stony Brook University: Stony Brook, NY.
dcterms.subjectStatistics
dcterms.titlePartial Correlation Network Analysis for Mixed Data
dcterms.typeDissertation


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record