Very fast decision rules for classification in data streams

dc.contributor.author Kosina,P en
dc.contributor.author João Gama en
dc.date.accessioned 2017-11-23T11:32:32Z
dc.date.available 2017-11-23T11:32:32Z
dc.date.issued 2015 en
dc.description.abstract Data stream mining is the process of extracting knowledge structures from continuous, rapid data records. Many decision tasks can be formulated as stream mining problems and therefore many new algorithms for data streams are being proposed. Decision rules are one of the most interpretable and flexible models for predictive data mining. Nevertheless, few algorithms have been proposed in the literature to learn rule models for time-changing and high-speed flows of data. In this paper we present the very fast decision rules (VFDR) algorithm and discuss interesting extensions to the base version. All the proposed versions are one-pass and any-time algorithms. They work on-line and learn ordered or unordered rule sets. Algorithms designed to work with data streams should be able to detect changes and quickly adapt the decision model. In order to manage these situations we also present the adaptive extension (AVFDR) to detect changes in the process generating data and adapt the decision model. Detecting local drifts takes advantage of the modularity of the rule sets. In AVFDR, each individual rule monitors the evolution of performance metrics to detect concept drift. AVFDR prunes rules whenever a drift is signaled. This explicit change detection mechanism provides useful information about the dynamics of the process generating data, faster adaptation to changes and generates more compact rule sets. The experimental evaluation demonstrates that algorithms achieve competitive results in comparison to alternative methods and the adaptive methods are able to learn fast and compact rule sets from evolving streams. en
dc.identifier.uri http://repositorio.inesctec.pt/handle/123456789/3794
dc.identifier.uri http://dx.doi.org/10.1007/s10618-013-0340-z en
dc.language eng en
dc.relation 5120 en
dc.rights info:eu-repo/semantics/openAccess en
dc.title Very fast decision rules for classification in data streams en
dc.type article en
dc.type Publication en
Files
Original bundle
Now showing 1 - 1 of 1
Thumbnail Image
Name:
P-00G-1QB.pdf
Size:
606.28 KB
Format:
Adobe Portable Document Format
Description: