Resampling strategies for regression

dc.contributor.author Luís Torgo en
dc.contributor.author Paula Oliveira Branco en
dc.contributor.author Rita Paula Ribeiro en
dc.contributor.author Pfahringer,B en
dc.date.accessioned 2017-12-21T12:24:27Z
dc.date.available 2017-12-21T12:24:27Z
dc.date.issued 2015 en
dc.description.abstract Several real world prediction problems involve forecasting rare values of a target variable. When this variable is nominal, we have a problem of class imbalance that was thoroughly studied within machine learning. For regression tasks, where the target variable is continuous, few works exist addressing this type of problem. Still, important applications involve forecasting rare extreme values of a continuous target variable. This paper describes a contribution to this type of tasks. Namely, we propose to address such tasks by resampling approaches that change the distribution of the given data set to decrease the problem of imbalance between the rare target cases and the most frequent ones. We present two modifications of well-known resampling strategies for classification tasks: the under-sampling and the synthetic minority over-sampling technique (SMOTE) methods. These modifications allow the use of these strategies on regression tasks where the goal is to forecast rare extreme values of the target variable. In an extensive set of experiments, we provide empirical evidence for the superiority of our proposals for these particular regression tasks. The proposed resampling methods can be used with any existing regression algorithm, which means that they are general tools for addressing problems of forecasting rare extreme values of a continuous target variable. en
dc.identifier.uri http://repositorio.inesctec.pt/handle/123456789/4625
dc.identifier.uri http://dx.doi.org/10.1111/exsy.12081 en
dc.language eng en
dc.relation 4983 en
dc.relation 5934 en
dc.relation 4982 en
dc.rights info:eu-repo/semantics/openAccess en
dc.title Resampling strategies for regression en
dc.type article en
dc.type Publication en
Files
Original bundle
Now showing 1 - 1 of 1
Thumbnail Image
Name:
P-00G-B5Y.pdf
Size:
710.27 KB
Format:
Adobe Portable Document Format
Description: