Show simple item record

dc.identifier.urihttp://hdl.handle.net/11401/77607
dc.description.sponsorshipThis work is sponsored by the Stony Brook University Graduate School in compliance with the requirements for completion of degree.en_US
dc.formatMonograph
dc.format.mediumElectronic Resourceen_US
dc.language.isoen_US
dc.publisherThe Graduate School, Stony Brook University: Stony Brook, NY.
dc.typeDissertation
dcterms.abstractHigh-throughput DNA sequencing technologies have given us the power to understand genetic disease at extraordinarily detailed resolution. It is now possible to sequence a person’s whole genome and search for the genetic markers that contribute to specific disease, or even markers that contribute to the possibility of developing a new one. However, the task of understanding and sifting through billions of data-points is not a trivial one. There are diverse statistical, algorithmic and practical implementation challenges that must be met so that we can accurately and reliably analyze the vast swaths of data that come from human DNA sequences. Indeed, strategies for detecting human sequence variation in exome and whole genome sequencing data are myriad, but the reliability of these methods, even when applied to the same underlying sequencing data, is unclear. Furthermore, in the context of imperfect agreement among results stemming from these various methods, powerful strategies for assessing and recovering true, but missed, sequence variation have yet to be devised. Most research effort has focused on mitigating false detection. It is in this context that highthroughput sequencing technologies are used for both research and clinical investigations. In the medical genomics realm, our understanding of the genetic origins of human disease has been empowered by these technologies, but unreliable analyses have led to a number of false positive research findings. The community has since recognized the need for robust and comprehensive sequencing and analysis methods, particularly in cases where only a small number of samples from probands or affected families are available. In the clinical realm, most agree that there exists an enormous amount of potential for these technologies to transform clinical care, but the practicality of their use is currently understudied, particularly for individual patients among complex cohorts, such as those harboring psychiatric afflictions. In order to move the field of human genetics research forward and to contribute toward the successful implementation of genomics-guided medical care, several key advancements are needed: a characterization of the reliability of current high-throughout analysis methods, methods for recovering missed sequence variants from discordant detection sets, an understanding of current infrastructural deficiencies for implementation, general guidance on how to use diverse sets of analysis results in the context of generating robust relationships between human sequence variation and disease, and new methodological approaches for generating sequence analysis results that accurately characterize uncertainties in the underlying data, so that the reliabilities of their inferences remain robust throughout the lifetime of their use.
dcterms.abstractHigh-throughput DNA sequencing technologies have given us the power to understand genetic disease at extraordinarily detailed resolution. It is now possible to sequence a person’s whole genome and search for the genetic markers that contribute to specific disease, or even markers that contribute to the possibility of developing a new one. However, the task of understanding and sifting through billions of data-points is not a trivial one. There are diverse statistical, algorithmic and practical implementation challenges that must be met so that we can accurately and reliably analyze the vast swaths of data that come from human DNA sequences. Indeed, strategies for detecting human sequence variation in exome and whole genome sequencing data are myriad, but the reliability of these methods, even when applied to the same underlying sequencing data, is unclear. Furthermore, in the context of imperfect agreement among results stemming from these various methods, powerful strategies for assessing and recovering true, but missed, sequence variation have yet to be devised. Most research effort has focused on mitigating false detection. It is in this context that highthroughput sequencing technologies are used for both research and clinical investigations. In the medical genomics realm, our understanding of the genetic origins of human disease has been empowered by these technologies, but unreliable analyses have led to a number of false positive research findings. The community has since recognized the need for robust and comprehensive sequencing and analysis methods, particularly in cases where only a small number of samples from probands or affected families are available. In the clinical realm, most agree that there exists an enormous amount of potential for these technologies to transform clinical care, but the practicality of their use is currently understudied, particularly for individual patients among complex cohorts, such as those harboring psychiatric afflictions. In order to move the field of human genetics research forward and to contribute toward the successful implementation of genomics-guided medical care, several key advancements are needed: a characterization of the reliability of current high-throughout analysis methods, methods for recovering missed sequence variants from discordant detection sets, an understanding of current infrastructural deficiencies for implementation, general guidance on how to use diverse sets of analysis results in the context of generating robust relationships between human sequence variation and disease, and new methodological approaches for generating sequence analysis results that accurately characterize uncertainties in the underlying data, so that the reliabilities of their inferences remain robust throughout the lifetime of their use.
dcterms.available2017-09-20T16:52:59Z
dcterms.contributorPatro, Roberten_US
dcterms.contributorLyon, Gholsonen_US
dcterms.contributorMason, Christopheren_US
dcterms.contributorRest, Joshuaen_US
dcterms.contributorFerson, Scotten_US
dcterms.contributor.en_US
dcterms.creatorO'Rawe, Jason
dcterms.dateAccepted2017-09-20T16:52:59Z
dcterms.dateSubmitted2017-09-20T16:52:59Z
dcterms.descriptionDepartment of Geneticsen_US
dcterms.extent214 pg.en_US
dcterms.formatMonograph
dcterms.formatApplication/PDFen_US
dcterms.identifierhttp://hdl.handle.net/11401/77607
dcterms.issued2016-12-01
dcterms.languageen_US
dcterms.provenanceMade available in DSpace on 2017-09-20T16:52:59Z (GMT). No. of bitstreams: 1 ORawe_grad.sunysb_0771E_12935.pdf: 15993796 bytes, checksum: cf91ab1a58a6002c9383daa4fa0ac54c (MD5) Previous issue date: 1en
dcterms.publisherThe Graduate School, Stony Brook University: Stony Brook, NY.
dcterms.subjectGenetics
dcterms.titleToward Reliable Analysis of Individual Genomes
dcterms.typeDissertation


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record