Please use this identifier to cite or link to this item:
|Title:||WCDS: A Two-Phase Weightless Neural System for Data Stream Clustering|
|Authors:||Douglas Oliveira Cardoso|
|Abstract:||Clustering is a powerful and versatile tool for knowledge discovery, able to provide a valuable information for data analysis in various domains. To perform this task based on streaming data is quite challenging: outdated knowledge needs to be disposed while the current knowledge is obtained from fresh data; since data are continuously flowing, strict efficiency constraints have to be met. This paper presents WCDS, an approach to this problem based on the WiSARD artificial neural network model. This model already had useful characteristics as inherent incremental learning capability and patent functioning speed. These were combined with novel features as an adaptive countermeasure to cluster imbalance, a mechanism to discard expired data, and offline clustering based on a pairwise similarity measure for WiSARD discriminators. In an insightful experimental evaluation, the proposed system had an excellent performance according to multiple quality standards. This supports its applicability for the analysis of data streams.|
|Appears in Collections:||LIAAD - Articles in International Journals|
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.