GTE: A Distributional Second-Order Co-Occurrence Approach to Improve the Identification of Top Relevant Dates in Web Snippets

dc.contributor.author Gael Dias en
dc.contributor.author Ricardo Campos en
dc.contributor.author Célia Nunes en
dc.contributor.author Alípio Jorge en
dc.date.accessioned 2017-11-16T13:45:03Z
dc.date.available 2017-11-16T13:45:03Z
dc.date.issued 2012 en
dc.description.abstract In this paper, we present an approach to identify top relevant dates in Web snippets with respect to a given user implicit temporal query. Our approach is two-fold. First, we propose a generic temporal similarity measure called GTE, which evaluates the temporal similarity between a query and a date. Second, we propose a classification model to accurately relate relevant dates to their corresponding query terms and withdraw irrelevant ones. We suggest two different solutions: a threshold-based classification strategy and a supervised classifier based on a combination of multiple similarity measures. We evaluate both strategies over a set of real-world text queries and compare the performance of our Web snippet approach with a query log based approach over the same set of queries. Experiments show that determining the most relevant dates of any given implicit temporal query can be improved with GTE combined with the second order similarity measure InfoSimba, the Dice coefficient and t en
dc.identifier.uri http://repositorio.inesctec.pt/handle/123456789/2495
dc.identifier.uri http://dx.doi.org/10.1145/2396761.2398567 en
dc.language eng en
dc.relation 5782 en
dc.relation 4981 en
dc.rights info:eu-repo/semantics/openAccess en
dc.title GTE: A Distributional Second-Order Co-Occurrence Approach to Improve the Identification of Top Relevant Dates in Web Snippets en
dc.type conferenceObject en
dc.type Publication en
Files