GTE-Rank: A time-aware search engine to answer time-sensitive queries

dc.contributor.author Ricardo Campos en
dc.contributor.author Dias,G en
dc.contributor.author Alípio Jorge en
dc.contributor.author Nunes,C en
dc.date.accessioned 2017-12-19T18:37:57Z
dc.date.available 2017-12-19T18:37:57Z
dc.date.issued 2016 en
dc.description.abstract In the web environment, most of the queries issued by users are implicit by nature. Inferring the different temporal intents of this type of query enhances the overall temporal part of the web search results. Previous works tackling this problem usually focused on news queries, where the retrieval of the most recent results related to the query are usually sufficient to meet the user's information needs. However, few works have studied the importance of time in queries such as "Philip Seymour Hoffman" where the results may require no recency at all. In this work, we focus on this type of queries named "time-sensitive queries" where the results are preferably from a diversified time span, not necessarily the most recent one. Unlike related work, we follow a content-based approach to identify the most important time periods of the query and integrate time into a re-ranking model to boost the retrieval of documents whose contents match the query time period. For that purpose, we define a linear combination of topical and temporal scores, which reflects the relevance of any web document both in the topical and temporal dimensions, thus contributing to improve the effectiveness of the ranked results across different types of queries. Our approach relies on a novel temporal similarity measure that is capable of determining the most important dates for a query, while filtering out the non-relevant ones. Through extensive experimental evaluation over web corpora, we show that our model offers promising results compared to baseline approaches. As a result of our investigation, we publicly provide a set of web services and a web search interface so that the system can be graphically explored by the research community. en
dc.identifier.uri http://repositorio.inesctec.pt/handle/123456789/4267
dc.identifier.uri http://dx.doi.org/10.1016/j.ipm.2015.07.006 en
dc.language eng en
dc.relation 4981 en
dc.relation 5782 en
dc.rights info:eu-repo/semantics/openAccess en
dc.title GTE-Rank: A time-aware search engine to answer time-sensitive queries en
dc.type article en
dc.type Publication en
Files
Original bundle
Now showing 1 - 1 of 1
Thumbnail Image
Name:
P-00K-6NS.pdf
Size:
2.65 MB
Format:
Adobe Portable Document Format
Description: