Clustering and classifying text documents a revisit to tagging integration methods
Clustering and classifying text documents a revisit to tagging integration methods
dc.contributor.author | Cunha,E | en |
dc.contributor.author | Álvaro Figueira | en |
dc.contributor.author | Mealha,O | en |
dc.date.accessioned | 2018-01-10T10:20:11Z | |
dc.date.available | 2018-01-10T10:20:11Z | |
dc.date.issued | 2013 | en |
dc.description.abstract | In this paper we analyze and discuss two methods that are based on the traditional k-means for document clustering and that feature integration of social tags in the process. The first one allows the integration of tags directly into a Vector Space Model, and the second one proposes the integration of tags in order to select the initial seeds. We created a predictive model for the impact of the tags' integration in both models, and compared the two methods using the traditional k-means++ and the novel k-C algorithm. To compare the results, we propose a new internal measure, allowing the computation of the cluster compactness. The experimental results indicate that the careful selection of seeds on the k-C algorithm present better results to those obtained with the k-means++, with and without integration of tags. | en |
dc.identifier.uri | http://repositorio.inesctec.pt/handle/123456789/5835 | |
dc.identifier.uri | http://dx.doi.org/10.5220/0004545201600168 | en |
dc.language | eng | en |
dc.relation | 5088 | en |
dc.rights | info:eu-repo/semantics/openAccess | en |
dc.title | Clustering and classifying text documents a revisit to tagging integration methods | en |
dc.type | conferenceObject | en |
dc.type | Publication | en |
Files
Original bundle
1 - 1 of 1