Please use this identifier to cite or link to this item: http://repositorio.inesctec.pt/handle/123456789/2511
Title: Text Categorization using an Ensemble Classifier based on a Mean Co-Association Matrix
Authors: João Mendes Moreira
Luís Moreira Matias
João Gama
Pavel Brazdil
Issue Date: 2012
Abstract: Text Categorization (TC) has attracted the attention of the research community in the last decade. Algorithms like Support Vector Machines, Naïve Bayes or k Nearest Neighbors have been used with good performance, confirmed by several comparative studies. Recently, several ensemble classifiers were also introduced in TC. However, many of those can only provide a category for a given new sample. Instead, in this paper, we propose a methodology - MECAC - to build an ensemble of classifiers that has two advantages to other ensemble methods: 1) it can be run using parallel computing, saving processing time and 2) it can extract important statistics from the obtained clusters. It uses the mean co-association matrix to solve binary TC problems. Our experiments revealed that our framework performed, on average, 2.04% better than the best individual classifier on the tested datasets. These results were statistically validated for a significance level of 0.05 using the Friedman Test.
URI: http://repositorio.inesctec.pt/handle/123456789/2511
metadata.dc.type: conferenceObject
Publication
Appears in Collections:LIAAD - Indexed Articles in Conferences

Files in This Item:
File Description SizeFormat 
PS-07729.pdf1.29 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.