Please use this identifier to cite or link to this item: http://repositorio.inesctec.pt/handle/123456789/3508
Title: Workload-aware table splitting for NoSQL
Authors: Francisco Miguel Cruz
Francisco Almeida Maia
Rui Carlos Oliveira
Ricardo Pereira Vilaça
Issue Date: 2014
Abstract: Massive scale data stores, which exhibit highly desirable scalability and availability properties are becoming pivotal systems in nowadays infrastructures. Scalability achieved by these data stores is anchored on data independence; there is no clear relationship between data, and atomic inter-node operations are not a concern. Such assumption over data allows aggressive data partitioning. In particular, data tables are horizontally partitioned and spread across nodes for load balancing. However, in current versions of these data stores, partitioning is either a manual process or automated but simply based on table size. We argue that size based partitioning does not lead to acceptable load balancing as it ignores data access patterns, namely data hotspots. Moreover, manual data partitioning is cumbersome and typically infeasible in large scale scenarios. In this paper we propose an automated table splitting mechanism that takes into account the system workload. We evaluate such mechanism showing that it simple, non-intrusive and effective. Copyright 2014 ACM.
URI: http://repositorio.inesctec.pt/handle/123456789/3508
http://dx.doi.org/10.1145/2554850.2555027
metadata.dc.type: conferenceObject
Publication
Appears in Collections:HASLab - Indexed Articles in Conferences

Files in This Item:
File Description SizeFormat 
P-009-RJ7.pdf457.82 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.