Merging Datasets for Hate Speech Classification in Italian

dc.contributor.author	Sérgio Nunes	en
dc.contributor.author	Fortuna,P	en
dc.contributor.author	Bonavita,I	en
dc.contributor.other	5448	en
dc.date.accessioned	2019-05-29T11:08:56Z
dc.date.available	2019-05-29T11:08:56Z
dc.date.issued	2018	en
dc.description.abstract	This paper presents an approach to the shared task HaSpeeDe within Evalita 2018. We followed a standard machine learning procedure with training, validation, and testing phases. We considered word embedding as features and deep learning for classification. We tested the effect of merging two datasets in the classification of messages from Facebook and Twitter. We concluded that using data for training and testing from the same social network was a requirement to achieve a good performance. Moreover, adding data from a different social network allowed to improve the results, indicating that more generalized models can be an advantage.	en
dc.identifier.uri	http://repositorio.inesctec.pt/handle/123456789/9548
dc.language	eng	en
dc.rights	info:eu-repo/semantics/openAccess	en
dc.title	Merging Datasets for Hate Speech Classification in Italian	en
dc.type	Publication	en
dc.type	conferenceObject	en

Now showing 1 - 1 of 1