G-Tries: a data structure for storing and finding subgraphs

Pedro Manuel Ribeiro; Fernando Silva

G-Tries: a data structure for storing and finding subgraphs

dc.contributor.author	Pedro Manuel Ribeiro	en
dc.contributor.author	Fernando Silva	en
dc.date.accessioned	2017-11-20T14:29:18Z
dc.date.available	2017-11-20T14:29:18Z
dc.date.issued	2014	en
dc.description.abstract	The ability to find and count subgraphs of a given network is an important non trivial task with multidisciplinary applicability. Discovering network motifs or computing graphlet signatures are two examples of methodologies that at their core rely precisely on the subgraph counting problem. Here we present the g-trie, a data-structure specifically designed for discovering subgraph frequencies. We produce a tree that encapsulates the structure of the entire graph set, taking advantage of common topologies in the same way a prefix tree takes advantage of common prefixes. This avoids redundancy in the representation of the graphs, thus allowing for both memory and computation time savings. We introduce a specialized canonical labeling designed to highlight common substructures and annotate the g-trie with a set of conditional rules that break symmetries, avoiding repetitions in the computation. We introduce a novel algorithm that takes as input a set of small graphs and is able to efficiently find and count them as induced subgraphs of a larger network. We perform an extensive empirical evaluation of our algorithms, focusing on efficiency and scalability on a set of diversified complex networks. Results show that g-tries are able to clearly outperform previously existing algorithms by at least one order of magnitude.	en
dc.identifier.uri	http://repositorio.inesctec.pt/handle/123456789/3709
dc.identifier.uri	http://dx.doi.org/10.1007/s10618-013-0303-4	en
dc.language	eng	en
dc.relation	5316	en
dc.relation	5124	en
dc.rights	info:eu-repo/semantics/openAccess	en
dc.title	G-Tries: a data structure for storing and finding subgraphs	en
dc.type	article	en
dc.type	Publication	en

Collections

CRACS - Indexed Articles in Journals

G-Tries: a data structure for storing and finding subgraphs

Files

Collections