Show simple item record

dc.identifier.urihttp://hdl.handle.net/11401/77313
dc.description.sponsorshipThis work is sponsored by the Stony Brook University Graduate School in compliance with the requirements for completion of degree.en_US
dc.formatMonograph
dc.format.mediumElectronic Resourceen_US
dc.language.isoen_US
dc.publisherThe Graduate School, Stony Brook University: Stony Brook, NY.
dc.typeDissertation
dcterms.abstractA fundamental building block for an IaaS (Infrastructure-as-a-Service) cloud service such as Amazon's EC2 is a storage virtualization system that provides block-level storage services to individual virtual machines over the network. This dissertation addresses four major problems in such a block-level cloud storage system, in the context of an end-to-end IaaS solution called ITRI Cloud OS. First, to effectively eliminate redundancies in stored data blocks, we propose a scalable block-level deduplication engine called <bold>Sungem</bold>, which uses both sampling and prefetching to minimize the performance overhead of fingerprint accesses, and features a storage block garbage collection algorithm whose run- time overhead is proportional only to the size of the delta between consecutive backup operations. Second, to efficiently flush meta-data updates associated with large-scale block-level storage management, we developed a novel storage system architecture called <bold>BOSC (Batching mOdifications with Sequential Commit)</bold>, which uses largely sequential writes to commit updates to disk and is thus able to sustain high-throughput and low-latency metadata updates that are largely random. Third, as part of the BOSC architecture, we invented a high-throughput low-latency disk logging system called <bold>Beluga</bold>, which fashions a carefully tuned disk write pipeline and makes it possible to provide, on an array of three commodity 7200 RPM SATA disks, close to 5 million fine-grained (64-byte) disk logging operations per second, which is close to the maximum possible bandwidth on a commodity disk, while keeping the latency of each logging operation under 1 msec. Finally, we devised a set of techniques for supporting software-defined storage service on a distributed and replicated storage architecture. Specifically, we developed a distributed storage QoS guarantee system called <bold>Cheetah</bold>, which is able to provide a bandwidth guarantee to each virtual disk attached to a virtual machine, while ensuring the loads on the distributed storage nodes be balanced, and the locality of the access stream associated with each virtual disk be preserved as much as possible.
dcterms.available2017-09-20T16:52:29Z
dcterms.contributorChiueh, Tzi-ckeren_US
dcterms.contributorZadok, Erezen_US
dcterms.contributorPorter, Donalden_US
dcterms.contributorAguilera, Marcos.en_US
dcterms.creatorSimha, Dilip Nijagal
dcterms.dateAccepted2017-09-20T16:52:29Z
dcterms.dateSubmitted2017-09-20T16:52:29Z
dcterms.descriptionDepartment of Computer Science.en_US
dcterms.extent260 pg.en_US
dcterms.formatMonograph
dcterms.formatApplication/PDFen_US
dcterms.identifierhttp://hdl.handle.net/11401/77313
dcterms.issued2014-12-01
dcterms.languageen_US
dcterms.provenanceMade available in DSpace on 2017-09-20T16:52:29Z (GMT). No. of bitstreams: 1 Simha_grad.sunysb_0771E_11831.pdf: 4888192 bytes, checksum: 643f5f314b2c211f7ee0b7ec97dce904 (MD5) Previous issue date: 1en
dcterms.publisherThe Graduate School, Stony Brook University: Stony Brook, NY.
dcterms.subjectComputer science
dcterms.subjectBeluga, BOSC, Cloud Storage, deduplication, Quality of Service, Sungem
dcterms.titleEfficient Implementation Techniques for Block-Level Cloud Storage Systems
dcterms.typeDissertation


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record