Text Categorization using an Ensemble Classifier based on a Mean Co-Association Matrix

dc.contributor.author João Mendes Moreira en
dc.contributor.author Luís Moreira Matias en
dc.contributor.author João Gama en
dc.contributor.author Pavel Brazdil en
dc.date.accessioned 2017-11-16T13:46:15Z
dc.date.available 2017-11-16T13:46:15Z
dc.date.issued 2012 en
dc.description.abstract Text Categorization (TC) has attracted the attention of the research community in the last decade. Algorithms like Support Vector Machines, Naïve Bayes or k Nearest Neighbors have been used with good performance, confirmed by several comparative studies. Recently, several ensemble classifiers were also introduced in TC. However, many of those can only provide a category for a given new sample. Instead, in this paper, we propose a methodology - MECAC - to build an ensemble of classifiers that has two advantages to other ensemble methods: 1) it can be run using parallel computing, saving processing time and 2) it can extract important statistics from the obtained clusters. It uses the mean co-association matrix to solve binary TC problems. Our experiments revealed that our framework performed, on average, 2.04% better than the best individual classifier on the tested datasets. These results were statistically validated for a significance level of 0.05 using the Friedman Test. en
dc.identifier.uri http://repositorio.inesctec.pt/handle/123456789/2511
dc.language eng en
dc.relation 5120 en
dc.relation 5320 en
dc.relation 5339 en
dc.relation 5450 en
dc.rights info:eu-repo/semantics/openAccess en
dc.title Text Categorization using an Ensemble Classifier based on a Mean Co-Association Matrix en
dc.type conferenceObject en
dc.type Publication en
Files