An Exploratory Study on the impact of Temporal Features on the Classification and Clustering of Future-Related Web Documents

No Thumbnail Available
Date
2011
Authors
Alípio Jorge
Ricardo Campos
Gaël Dias
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
In the last few years, a huge amount of temporal written information has become widely available on the Internet with the advent of forums, blogs and social networks. This gave rise to a new challenging problem called future retrieval, which consists of extracting future temporal information, that is known in advance, from web sources in order to answer queries that combine text of a future temporal nature. This paper aims to confirm whether web snippets can be used to form an intelligent web that can detect future expected events when their dates are already known. Moreover, the objective is to identify the nature of future texts and understand how these temporal features affect the classification and clustering of the different types of future-related texts: informative texts, scheduled texts and rumor texts. We have conducted a set of comprehensive experiments and the results show that web documents are a valuable source of future data that can be particularly useful in identifying
Description
Keywords
Citation