autoBagging: Learning to Rank Bagging Workflows with Metalearning

Pinto,F; Vítor Manuel Cerqueira; Carlos Manuel Soares; João Mendes Moreira

autoBagging: Learning to Rank Bagging Workflows with Metalearning

Files

P-00M-YFM.pdf (370.09 KB)

Date

2017

Authors

Pinto,F

Vítor Manuel Cerqueira

Carlos Manuel Soares

João Mendes Moreira

Abstract

Machine Learning (ML) has been successfully applied to a wide range of domains and applications. One of the techniques behind most of these successful applications is Ensemble Learning (EL), the field of ML that gave birth to methods such as Random Forests or Boosting. The complexity of applying these techniques together with the market scarcity on ML experts, has created the need for systems that enable a fast and easy drop-in replacement for ML libraries. Automated machine learning (autoML) is the field of ML that attempts to answers these needs. We propose autoBagging, an autoML system that automatically ranks 63 bagging workflows by exploiting past performance and metalearning. Results on 140 classification datasets from the OpenML platform show that autoBagging can yield better performance than the Average Rank method and achieve results that are not statistically different from an ideal model that systematically selects the best workflow for each dataset. For the purpose of reproducibility and generalizability, autoBagging is publicly available as an R package on CRAN.

URI

http://repositorio.inesctec.pt/handle/123456789/7125

Collections

CESE - Indexed Articles in Conferences

Full item page