A Survey and Classification of Storage Deduplication Systems

dc.contributor.author João Tiago Paulo en
dc.contributor.author José Orlando Pereira en
dc.date.accessioned 2017-12-15T12:24:43Z
dc.date.available 2017-12-15T12:24:43Z
dc.date.issued 2014 en
dc.description.abstract The automatic elimination of duplicate data in a storage system, commonly known as deduplication, is increasingly accepted as an effective technique to reduce storage costs. Thus, it has been applied to different storage types, including archives and backups, primary storage, within solid-state drives, and even to random access memory. Although the general approach to deduplication is shared by all storage types, each poses specific challenges and leads to different trade-offs and solutions. This diversity is often misunderstood, thus underestimating the relevance of new research and development. The first contribution of this article is a classification of deduplication systems according to six criteria that correspond to key design decisions: granularity, locality, timing, indexing, technique, and scope. This classification identifies and describes the different approaches used for each of them. As a second contribution, we describe which combinations of these design decisions have been proposed and found more useful for challenges in each storage type. Finally, outstanding research challenges and unexplored design points are identified and discussed. en
dc.identifier.uri http://repositorio.inesctec.pt/handle/123456789/4154
dc.identifier.uri http://dx.doi.org/10.1145/2611778 en
dc.language eng en
dc.relation 5621 en
dc.relation 5602 en
dc.rights info:eu-repo/semantics/openAccess en
dc.title A Survey and Classification of Storage Deduplication Systems en
dc.type article en
dc.type Publication en
Files
Original bundle
Now showing 1 - 1 of 1
Thumbnail Image
Name:
P-009-SED.pdf
Size:
555.8 KB
Format:
Adobe Portable Document Format
Description: