An Exploratory Study on the impact of Temporal Features on the Classification and Clustering of Future-Related Web Documents

Alípio Jorge; Ricardo Campos; Gaël Dias

An Exploratory Study on the impact of Temporal Features on the Classification and Clustering of Future-Related Web Documents

Date

2011

Authors

Alípio Jorge

Ricardo Campos

Gaël Dias

Abstract

In the last few years, a huge amount of temporal written information has become widely available on the Internet with the advent of forums, blogs and social networks. This gave rise to a new challenging problem called future retrieval, which consists of extracting future temporal information, that is known in advance, from web sources in order to answer queries that combine text of a future temporal nature. This paper aims to confirm whether web snippets can be used to form an intelligent web that can detect future expected events when their dates are already known. Moreover, the objective is to identify the nature of future texts and understand how these temporal features affect the classification and clustering of the different types of future-related texts: informative texts, scheduled texts and rumor texts. We have conducted a set of comprehensive experiments and the results show that web documents are a valuable source of future data that can be particularly useful in identifying

URI

http://repositorio.inesctec.pt/handle/123456789/2938
http://dx.doi.org/10.1007/978-3-642-24769-9_42

Collections

LIAAD - Indexed Articles in Conferences

Full item page