Distributed Adaptive Model Rules for mining big data streams

dc.contributor.author Vu,AT en
dc.contributor.author De Francisci Morales,G en
dc.contributor.author João Gama en
dc.contributor.author Bifet,A en
dc.date.accessioned 2018-01-03T10:34:08Z
dc.date.available 2018-01-03T10:34:08Z
dc.date.issued 2015 en
dc.description.abstract Decision rules are among the most expressive data mining models. We propose the first distributed streaming algorithm to learn decision rules for regression tasks. The algorithm is available in samoa (Scalable Advanced Massive Online Analysis), an open-source platform for mining big data streams. It uses a hybrid of vertical and horizontal parallelism to distribute Adaptive Model Rules (AMRules) on a cluster. The decision rules built by AMRules are comprehensible models, where the antecedent of a rule is a conjunction of conditions on the attribute values, and the consequent is a linear combination of the attributes. Our evaluation shows that this implementation is scalable in relation to CPU and memory consumption. On a small commodity Samza cluster of 9 nodes, it can handle a rate of more than 30000 instances per second, and achieve a speedup of up to 4.7x over the sequential version. © 2014 IEEE. en
dc.identifier.uri http://repositorio.inesctec.pt/handle/123456789/5302
dc.identifier.uri http://dx.doi.org/10.1109/bigdata.2014.7004251 en
dc.language eng en
dc.relation 5120 en
dc.rights info:eu-repo/semantics/openAccess en
dc.title Distributed Adaptive Model Rules for mining big data streams en
dc.type conferenceObject en
dc.type Publication en
Files
Original bundle
Now showing 1 - 1 of 1
Thumbnail Image
Name:
P-00A-7ZB.pdf
Size:
469.64 KB
Format:
Adobe Portable Document Format
Description: