Concept Neurons - Handling Drift Issues for Real-Time Industrial Data Mining

dc.contributor.author Luís Moreira Matias en
dc.contributor.author João Gama en
dc.contributor.author João Mendes Moreira en
dc.date.accessioned 2018-01-03T10:36:55Z
dc.date.available 2018-01-03T10:36:55Z
dc.date.issued 2016 en
dc.description.abstract Learning from data streams is a challenge faced by data science professionals from multiple industries. Most of them struggle hardly on applying traditional Machine Learning algorithms to solve these problems. It happens so due to their high availability on ready-to-use software libraries on big data technologies (e.g. SparkML). Nevertheless, most of them cannot cope with the key characteristics of this type of data such as high arrival rate and/or non-stationary distributions. In this paper, we introduce a generic and yet simplistic framework to fill this gap denominated Concept Neurons. It leverages on a combination of continuous inspection schemas and residual-based updates over the model parameters and/or the model output. Such framework can empower the resistance of most of induction learning algorithms to concept drifts. Two distinct and hence closely related flavors are introduced to handle different drift types. Experimental results on successful distinct applications on different domains along transportation industry are presented to uncover the hidden potential of this methodology. en
dc.identifier.uri http://repositorio.inesctec.pt/handle/123456789/5339
dc.identifier.uri http://dx.doi.org/10.1007/978-3-319-46131-1_18 en
dc.language eng en
dc.relation 5120 en
dc.relation 5450 en
dc.relation 5320 en
dc.rights info:eu-repo/semantics/openAccess en
dc.title Concept Neurons - Handling Drift Issues for Real-Time Industrial Data Mining en
dc.type conferenceObject en
dc.type Publication en
Files
Original bundle
Now showing 1 - 1 of 1
Thumbnail Image
Name:
P-00K-THA.pdf
Size:
449.7 KB
Format:
Adobe Portable Document Format
Description: