The impact of time in link-based Web ranking

dc.contributor.author Sérgio Nunes en
dc.contributor.author Cristina Ribeiro en
dc.contributor.author Gabriel David en
dc.date.accessioned 2017-11-16T21:47:51Z
dc.date.available 2017-11-16T21:47:51Z
dc.date.issued 2013 en
dc.description.abstract Introduction. The strong dynamic nature of the Web is a well-known reality. Nonetheless, research on Web dynamics is still a minor part of mainstream Web research. This is largely the case in Web link analysis. In this paper we investigate and measure the impact of time in link-based ranking algorithms on a particular subset of the Web, specifically blogs. Method. Using a large collection of blog posts that span more than three years, we compare a traditional link-based ranking algorithm with a time-biased alternative, providing some insights into the evolution of link data over time. We designed two experiments to evaluate the use of temporal features in authority estimation algorithms. In the first experiment we compare time-independent and time-sensitive ranking algorithms with a reference rank based on the total number of visits to each blog. In the second, we use feedback from communication media domain experts to contrast different rankings of Portuguese news Websites. Results. The distribution of citations to a Web document over time contains valuable information. Based on several examples we show that time-independent algorithms are unable to capture the correct popularity of sites with high citation activity. Using a reference rank based on the number of visits to a site, we show that a time-biased approach has a better performance. Conclusions. Although both time-independent and time-aware approaches are based on the same raw data, the experiments indicate that they can be treated as complementary signals for relevance assessment by information retrieval systems. We show that temporal information present in blogs can be used to derive stable time-dependent features, which can be successfully used in the context of Web document ranking. en
dc.identifier.uri http://repositorio.inesctec.pt/handle/123456789/3023
dc.language eng en
dc.relation 5448 en
dc.relation 215 en
dc.relation 212 en
dc.rights info:eu-repo/semantics/openAccess en
dc.title The impact of time in link-based Web ranking en
dc.type article en
dc.type Publication en
Files