An Exploratory Study on the impact of Temporal Features on the Classification and Clustering of Future-Related Web Documents
An Exploratory Study on the impact of Temporal Features on the Classification and Clustering of Future-Related Web Documents
No Thumbnail Available
Date
2011
Authors
Alípio Jorge
Ricardo Campos
Gaël Dias
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
In the last few years, a huge amount of temporal written information
has become widely available on the Internet with the advent of forums, blogs
and social networks. This gave rise to a new challenging problem called future
retrieval, which consists of extracting
future temporal information, that is
known in advance, from web sources in order to answer queries that combine
text of a future temporal nature. This paper aims to confirm whether web
snippets can be used to form an intelligent web that can detect future expected
events when their dates are already known. Moreover, the objective is to
identify the nature of future texts and understand how these temporal features
affect the classification and clustering of the different types of future-related
texts: informative texts, scheduled texts and rumor texts. We have conducted a
set of comprehensive experiments and the results show that web documents are
a valuable source of future data that can
be particularly useful in identifying