Show simple item record

dc.identifier.urihttp://hdl.handle.net/1951/55538
dc.identifier.urihttp://hdl.handle.net/11401/72596
dc.description.sponsorshipThis work is sponsored by the Stony Brook University Graduate School in compliance with the requirements for completion of degree.en_US
dc.formatMonograph
dc.format.mediumElectronic Resourceen_US
dc.language.isoen_US
dc.publisherThe Graduate School, Stony Brook University: Stony Brook, NY.
dc.typeDissertation
dcterms.abstractThe simple read/write interface exposed by traditional disk I/O systems isinadequate for low-locality update-intensive workloads because it limits theflexibility of the disk I/O systems in scheduling disk access requests andresults in inefficient use of buffer memory and disk bandwidth. We proposed anovel disk I/O subsystem architecture called Batching mOdifications withSequential Commit (BOSC), which is optimized for workloads characterized byintensive random updates. BOSC improves the sustained disk update throughput byeffectively aggregating disk update operations and sequentially committing themto disk.We demonstrated the benefits of BOSC by adapting it to 3 different storagesystems. The first one is a continuous data protection system called Mariner.Mariner is an iSCSI-based storage system that is designed to providecomprehensive data protection on commodity hardware while offering the sameperformance as those without any such protection. With the help of BOSC inmetadata updating, the throughput of Mariner has less than 10\% degradationcompared to that without metadata updating.Flash-based storage is the second storage system we leveraged BOSC.Because of the physics underlying the flash memory technology and the coarseaddress mapping granularity used in the on-board flash translation layer (FTL),commodity flash disks exhibit poor random write performance. We designed LFSM, aLog-structured Flash Storage Manager, to eliminate the random write performanceproblem of commodity flash disks by employing data logging and BOSC in metadataupdating. LFSM is able to reduce the average write latency of a commodity flashdisk by a factor of more than 6 under standard benchmarks.As a third example, we applied BOSC to a scalable data de-duplicationsystem based on the incremental backups. Each input block is de-duplicated bycomparing its fingerprint, a collision-free hash value, with existingfingerprints. A range-based block group, called segment, is the basic unit topreserve data locality for incremental backups. We propose four novel techniquesto improve the de-duplication throughput with minimal impact on data de-duplicationratio (DDR). BOSC is employed to eliminate the performance bottleneck due tocommitting segment updates to the disk.
dcterms.available2012-05-15T18:04:58Z
dcterms.available2015-04-24T14:52:44Z
dcterms.contributorKorach, Chaden_US
dcterms.contributorJie Gaoen_US
dcterms.contributorNakamura, Toshioen_US
dcterms.contributorRobert Johnsonen_US
dcterms.contributorDavid D. Chambliss.en_US
dcterms.creatorLu, Maohua
dcterms.dateAccepted2012-05-15T18:04:58Z
dcterms.dateAccepted2015-04-24T14:52:44Z
dcterms.dateSubmitted2012-05-15T18:04:58Z
dcterms.dateSubmitted2015-04-24T14:52:44Z
dcterms.descriptionDepartment of Computer Scienceen_US
dcterms.formatApplication/PDFen_US
dcterms.formatMonograph
dcterms.identifierhttp://hdl.handle.net/1951/55538
dcterms.identifierLu_grad.sunysb_0771E_10199.pdfen_US
dcterms.identifierhttp://hdl.handle.net/11401/72596
dcterms.issued2010-08-01
dcterms.languageen_US
dcterms.provenanceMade available in DSpace on 2012-05-15T18:04:58Z (GMT). No. of bitstreams: 1 Lu_grad.sunysb_0771E_10199.pdf: 1803667 bytes, checksum: 7527d4f71833b346bf40908625547d41 (MD5) Previous issue date: 1en
dcterms.provenanceMade available in DSpace on 2015-04-24T14:52:44Z (GMT). No. of bitstreams: 3 Lu_grad.sunysb_0771E_10199.pdf.jpg: 1894 bytes, checksum: a6009c46e6ec8251b348085684cba80d (MD5) Lu_grad.sunysb_0771E_10199.pdf.txt: 536283 bytes, checksum: 17849f4c5036068d6a4c1e3d6e4c83de (MD5) Lu_grad.sunysb_0771E_10199.pdf: 1803667 bytes, checksum: 7527d4f71833b346bf40908625547d41 (MD5) Previous issue date: 1en
dcterms.publisherThe Graduate School, Stony Brook University: Stony Brook, NY.
dcterms.subjectComputer Science
dcterms.subjectContinuous Data Protection, De-duplication, File system consistency, Flash Translation Layer, Metadata Update, versioning file system
dcterms.titleEfficient Metadata Update Techniques for Storage Systems
dcterms.typeDissertation


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record