SMOTE for regression

dc.contributor.author Luís Torgo en
dc.contributor.author Rita Paula Ribeiro en
dc.contributor.author Pfahringer,B en
dc.contributor.author Paula Oliveira Branco en
dc.date.accessioned 2017-12-21T12:32:16Z
dc.date.available 2017-12-21T12:32:16Z
dc.date.issued 2013 en
dc.description.abstract Several real world prediction problems involve forecasting rare values of a target variable. When this variable is nominal we have a problem of class imbalance that was already studied thoroughly within machine learning. For regression tasks, where the target variable is continuous, few works exist addressing this type of problem. Still, important application areas involve forecasting rare extreme values of a continuous target variable. This paper describes a contribution to this type of tasks. Namely, we propose to address such tasks by sampling approaches. These approaches change the distribution of the given training data set to decrease the problem of imbalance between the rare target cases and the most frequent ones. We present a modification of the well-known Smote algorithm that allows its use on these regression tasks. In an extensive set of experiments we provide empirical evidence for the superiority of our proposals for these particular regression tasks. The proposed SmoteR method can be used with any existing regression algorithm turning it into a general tool for addressing problems of forecasting rare extreme values of a continuous target variable. © 2013 Springer-Verlag. en
dc.identifier.uri http://repositorio.inesctec.pt/handle/123456789/4640
dc.identifier.uri http://dx.doi.org/10.1007/978-3-642-40669-0_33 en
dc.language eng en
dc.relation 5934 en
dc.relation 4983 en
dc.relation 4982 en
dc.rights info:eu-repo/semantics/openAccess en
dc.title SMOTE for regression en
dc.type conferenceObject en
dc.type Publication en
Files
Original bundle
Now showing 1 - 1 of 1
Thumbnail Image
Name:
P-008-EG1.pdf
Size:
291.28 KB
Format:
Adobe Portable Document Format
Description: