Please use this identifier to cite or link to this item: http://repositorio.inesctec.pt/handle/123456789/9548
Title: Merging Datasets for Hate Speech Classification in Italian
Authors: Sérgio Nunes
Fortuna,P
Bonavita,I
Issue Date: 2018
Abstract: This paper presents an approach to the shared task HaSpeeDe within Evalita 2018. We followed a standard machine learning procedure with training, validation, and testing phases. We considered word embedding as features and deep learning for classification. We tested the effect of merging two datasets in the classification of messages from Facebook and Twitter. We concluded that using data for training and testing from the same social network was a requirement to achieve a good performance. Moreover, adding data from a different social network allowed to improve the results, indicating that more generalized models can be an advantage.
URI: http://repositorio.inesctec.pt/handle/123456789/9548
metadata.dc.type: Publication
conferenceObject
Appears in Collections:CSIG - Articles in International Conferences

Files in This Item:
File Description SizeFormat 
P-00Q-0Z8.pdf285.03 kBAdobe PDFThumbnail
View/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.