MC-ReliefF: An extension of ReliefF for cost-based feature selection

Thumbnail Image
Date
2014
Authors
Bolon Canedo,V
Beatriz Remeseiro López
Sanchez Marono,N
Alonso Betanzos,A
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
The proliferation of high-dimensional data in the last few years has brought a necessity to use dimensionality reduction techniques, in which feature selection is arguably the favorite one. Feature selection consists of detecting relevant features and discarding the irrelevant ones. However, there are some situations where the users are not only interested in the relevance of the selected features but also in the costs that they imply, e.g. economical or computational costs. In this paper an extension of the well-known ReliefF method for feature selection is proposed, which consists of adding a new term to the function which updates the weights of the features so as to be able to reach a trade-off between the relevance of a feature and its associated cost. The behavior of the proposed method is tested on twelve heterogeneous classification datasets as well as a real application, using a support vector machine (SVM) as a classifier. The results of the experimental study show that the approach is sound, since it allows the user to reduce the cost significantly without compromising the classification error.
Description
Keywords
Citation