Text Categorization using an Ensemble Classifier based on a Mean Co-Association Matrix
Text Categorization using an Ensemble Classifier based on a Mean Co-Association Matrix
dc.contributor.author | João Mendes Moreira | en |
dc.contributor.author | Luís Moreira Matias | en |
dc.contributor.author | João Gama | en |
dc.contributor.author | Pavel Brazdil | en |
dc.date.accessioned | 2017-11-16T13:46:15Z | |
dc.date.available | 2017-11-16T13:46:15Z | |
dc.date.issued | 2012 | en |
dc.description.abstract | Text Categorization (TC) has attracted the attention of the research community in the last decade. Algorithms like Support Vector Machines, Naïve Bayes or k Nearest Neighbors have been used with good performance, confirmed by several comparative studies. Recently, several ensemble classifiers were also introduced in TC. However, many of those can only provide a category for a given new sample. Instead, in this paper, we propose a methodology - MECAC - to build an ensemble of classifiers that has two advantages to other ensemble methods: 1) it can be run using parallel computing, saving processing time and 2) it can extract important statistics from the obtained clusters. It uses the mean co-association matrix to solve binary TC problems. Our experiments revealed that our framework performed, on average, 2.04% better than the best individual classifier on the tested datasets. These results were statistically validated for a significance level of 0.05 using the Friedman Test. | en |
dc.identifier.uri | http://repositorio.inesctec.pt/handle/123456789/2511 | |
dc.language | eng | en |
dc.relation | 5120 | en |
dc.relation | 5320 | en |
dc.relation | 5339 | en |
dc.relation | 5450 | en |
dc.rights | info:eu-repo/semantics/openAccess | en |
dc.title | Text Categorization using an Ensemble Classifier based on a Mean Co-Association Matrix | en |
dc.type | conferenceObject | en |
dc.type | Publication | en |