Sampling massive streaming call graphs

dc.contributor.author Shazia Tabassum en
dc.contributor.author João Gama en
dc.date.accessioned 2018-01-03T10:36:26Z
dc.date.available 2018-01-03T10:36:26Z
dc.date.issued 2016 en
dc.description.abstract The problem of analyzing massive graph streams in real time is growing along with the size of streams. Sampling techniques have been used to analyze these streams in real time. However, it is difficult to answer questions like, which structures are well preserved by the sampling techniques over the evolution of streams? Which sampling techniques yield proper estimates for directed and weighted graphs? Which techniques have least time complexity etc? In this work, we have answered the above questions by comparing and analyzing the evolutionary samples of such graph streams. We have evaluated sequential sampling techniques by comparing the structural metrics from their samples. We have also presented a biased version of reservoir sampling, which shows better comparative results in our scenario. We have carried out rigorous experiments over a massive stream of 3 hundred million calls made by 11 million anonymous subscribers over 31 days. We evaluated node based and edge based methods of sampling. We have compared the samples generated by using sequential algorithms like, space saving algorithm for finding topK items, reservoir sampling, and a biased version of reservoir sampling. Our overall results and observations show that edge based samples perform well in our scenario. We have also compared the distribution of degrees and biases of evolutionary samples. © 2016 ACM. en
dc.identifier.uri http://repositorio.inesctec.pt/handle/123456789/5331
dc.identifier.uri http://dx.doi.org/10.1145/2851613.2851654 en
dc.language eng en
dc.relation 6461 en
dc.relation 5120 en
dc.rights info:eu-repo/semantics/openAccess en
dc.title Sampling massive streaming call graphs en
dc.type conferenceObject en
dc.type Publication en
Files
Original bundle
Now showing 1 - 1 of 1
Thumbnail Image
Name:
P-00K-H7X.pdf
Size:
704.66 KB
Format:
Adobe Portable Document Format
Description: