Show simple item record

dc.identifier.urihttp://hdl.handle.net/1951/59798
dc.identifier.urihttp://hdl.handle.net/11401/71354
dc.description.sponsorshipThis work is sponsored by the Stony Brook University Graduate School in compliance with the requirements for completion of degree.en_US
dc.formatMonograph
dc.format.mediumElectronic Resourceen_US
dc.language.isoen_US
dc.publisherThe Graduate School, Stony Brook University: Stony Brook, NY.
dc.typeThesis
dcterms.abstractThe context of data is important for optimal performance of data management systems like deduplication. In typical operating systems, the block layer of the I/O stack is unaware of the context of the data it is operating on. Thanks to the simplicity and modularity of the block layer interface, it is one of the best places to implement data deduplication. We designed an interface between file systems and the block layer that allows a file system to pass the context of the data to the underlying deduplication system at the block layer. This context is in the form of a ``hint'' to convey information that is useful for the block-layer deduplication system, so that it can optimize its operation. For example, the hint can indicate what data is worthy of deduplication, what data should not be deduplicated at all, or that an impending set of I/O operations are likely to generate lot of duplicates. With hints, we observed a 1.5--2x reduction in I/Os and a 10% improvement in CPU utilization for metadata-intensive workloads, compared to a context-unaware deduplication system at the block layer. Our hinting system degraded the deduplication ratio by only 3--5%. To implement hints, we had to change fewer than 0.6% of the Linux kernel, and we changed approximately 600 LoC of file system code in two file systems (Ext3 and NILFS2). Our block-layer deduplication system is about 4,000 LoC of standalone kernel code.
dcterms.available2013-05-22T17:35:17Z
dcterms.available2015-04-24T14:47:10Z
dcterms.contributorJohnson, Roben_US
dcterms.contributorZadok, Erezen_US
dcterms.contributorPorter, Donald.en_US
dcterms.creatorMudrankit, Amar
dcterms.dateAccepted2013-05-22T17:35:17Z
dcterms.dateAccepted2015-04-24T14:47:10Z
dcterms.dateSubmitted2013-05-22T17:35:17Z
dcterms.dateSubmitted2015-04-24T14:47:10Z
dcterms.descriptionDepartment of Computer Scienceen_US
dcterms.extent55 pg.en_US
dcterms.formatApplication/PDFen_US
dcterms.formatMonograph
dcterms.identifierMudrankit_grad.sunysb_0771M_10975en_US
dcterms.identifierhttp://hdl.handle.net/1951/59798
dcterms.identifierhttp://hdl.handle.net/11401/71354
dcterms.issued2012-05-01
dcterms.languageen_US
dcterms.provenanceMade available in DSpace on 2013-05-22T17:35:17Z (GMT). No. of bitstreams: 1 Mudrankit_grad.sunysb_0771M_10975.pdf: 383216 bytes, checksum: c0602446148a6e3c54c1ef12ca625353 (MD5) Previous issue date: 1en
dcterms.provenanceMade available in DSpace on 2015-04-24T14:47:10Z (GMT). No. of bitstreams: 3 Mudrankit_grad.sunysb_0771M_10975.pdf.jpg: 1894 bytes, checksum: a6009c46e6ec8251b348085684cba80d (MD5) Mudrankit_grad.sunysb_0771M_10975.pdf.txt: 126564 bytes, checksum: b2e912dfcac5334a1e56f69b9c205bfe (MD5) Mudrankit_grad.sunysb_0771M_10975.pdf: 383216 bytes, checksum: c0602446148a6e3c54c1ef12ca625353 (MD5) Previous issue date: 1en
dcterms.publisherThe Graduate School, Stony Brook University: Stony Brook, NY.
dcterms.subjectBlock Layer, Context-aware, device mapper, In-line Deduplication
dcterms.subjectComputer science
dcterms.titleA Context Aware Block Layer: The Case for Block Layer Deduplication
dcterms.typeThesis


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record