A survey on learning from data streams: current and future trends
A survey on learning from data streams: current and future trends
No Thumbnail Available
Date
2012
Authors
João Gama
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
Abstract Nowadays, there are applications in which the
data are modeled best not as persistent tables, but rather as
transient data streams. In this article, we discuss the limitations
of current machine learning and data mining algorithms.
We discuss the fundamental issues in learning in
dynamic environments like continuously maintain learning
models that evolve over time, learning and forgetting, concept
drift and change detection. Data streams produce a huge
amount of data that introduce new constraints in the design
of learning algorithms: limited computational resources in
terms ofmemory, cpu power, and communication bandwidth.
We present some illustrative algorithms, designed to taking
these constrains into account, for decision-tree learning, hierarchical
clustering and frequent pattern mining. We identify
the main issues and current challenges that emerge in learning
from data streams that open research lines for further
developments.