The Impact of Longstanding Messages In Micro-Blogging Classification

Costa,J; Silva,C; Mário João Antunes; Ribeiro,B

The Impact of Longstanding Messages In Micro-Blogging Classification

Files

P-00K-2R1.pdf (368.71 KB)

Date

2015

Authors

Costa,J

Silva,C

Mário João Antunes

Ribeiro,B

Abstract

Social networks are making part of the daily routine of millions of users. Twitter is among Facebook and Instagram one of the most used, and can be seen as a relevant source of information as users share not only daily status, but rapidly propagate news and events that occur worldwide. Considering the dynamic nature of social networks, and their potential in information spread, it is imperative to find learning strategies able to learn in these environments and cope with their dynamic nature. Time plays an important role by easily out-dating information, being crucial to understand how informative can past events be to current learning models and for how long it is relevant to store previously seen information, to avoid the computation burden associated with the amount of data produced. In this paper we study the impact of longstanding messages in micro-blogging classification by using different training time-window sizes in the learning process. Since there are few studies dealing with drift in Twitter and thus little is known about the types of drift that may occur, we simulate different types of drift in an artificial dataset to evaluate and validate our strategy. Results shed light on the relevance of previously seen examples according to different types of drift.

URI

http://repositorio.inesctec.pt/handle/123456789/5246
http://dx.doi.org/10.1109/IJCNN.2015.7280731

Collections

CRACS - Indexed Articles in Conferences

Full item page