Parallel Subgraph Counting for Multicore Architectures
    
  
 
 
  
  
    
    
        Parallel Subgraph Counting for Multicore Architectures
    
  
No Thumbnail Available
      Date
    
    
        2014
    
  
Authors
  David Oliveira Aparício
  Pedro Manuel Ribeiro
  Fernando Silva
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
    
    
        Computing the frequency of small subgraphs on a large network is a computationally hard task. This is, however, an important graph mining primitive, with several applications, and here we present a novel multicore parallel algorithm for this task. At the core of our methodology lies a state-of-the-art data structure, the g-trie, which represents a collection of subgraphs and allows for a very efficient sequential search. Our implementation was done using Pthreads and can run on any multicore personal computer. We employ a diagonal work sharing strategy to dynamically and effectively divide work among threads during the execution. We assess the performance of our Pthreads implementation on a set of representative networks from various domains and with diverse topological features. For most networks, we obtain a speedup of over 50 for 64 cores and an almost linear speedup up to 32 cores, showcasing the flexibility and scalability of our algorithm. This paves the way for the usage of such counting algorithms on larger subgraph and network sizes without the obligatory access to a cluster.