Workload-aware table splitting for NoSQL

dc.contributor.author Francisco Miguel Cruz en
dc.contributor.author Francisco Almeida Maia en
dc.contributor.author Rui Carlos Oliveira en
dc.contributor.author Ricardo Pereira Vilaça en
dc.date.accessioned 2017-11-20T10:34:21Z
dc.date.available 2017-11-20T10:34:21Z
dc.date.issued 2014 en
dc.description.abstract Massive scale data stores, which exhibit highly desirable scalability and availability properties are becoming pivotal systems in nowadays infrastructures. Scalability achieved by these data stores is anchored on data independence; there is no clear relationship between data, and atomic inter-node operations are not a concern. Such assumption over data allows aggressive data partitioning. In particular, data tables are horizontally partitioned and spread across nodes for load balancing. However, in current versions of these data stores, partitioning is either a manual process or automated but simply based on table size. We argue that size based partitioning does not lead to acceptable load balancing as it ignores data access patterns, namely data hotspots. Moreover, manual data partitioning is cumbersome and typically infeasible in large scale scenarios. In this paper we propose an automated table splitting mechanism that takes into account the system workload. We evaluate such mechanism showing that it simple, non-intrusive and effective. Copyright 2014 ACM. en
dc.identifier.uri http://repositorio.inesctec.pt/handle/123456789/3508
dc.identifier.uri http://dx.doi.org/10.1145/2554850.2555027 en
dc.language eng en
dc.relation 5594 en
dc.relation 5622 en
dc.relation 5656 en
dc.relation 5635 en
dc.rights info:eu-repo/semantics/openAccess en
dc.title Workload-aware table splitting for NoSQL en
dc.type conferenceObject en
dc.type Publication en
Files