FastStep: Scalable Boolean Matrix Decomposition

dc.contributor.author Miguel Ramos Araújo en
dc.contributor.author Pedro Manuel Ribeiro en
dc.contributor.author Faloutsos,C en
dc.date.accessioned 2018-01-18T14:56:43Z
dc.date.available 2018-01-18T14:56:43Z
dc.date.issued 2016 en
dc.description.abstract Matrix Decomposition methods are applied to a wide range of tasks, such as data denoising, dimensionality reduction, co-clustering and community detection. However, in the presence of boolean inputs, common methods either do not scale or do not provide a boolean reconstruction, which results in high reconstruction error and low interpretability of the decomposition. We propose a novel step decomposition of boolean matrices in non-negative factors with boolean reconstruction. By formulating the problem using threshold operators and through suitable relaxation of this problem, we provide a scalable algorithm that can be applied to boolean matrices with millions of non-zero entries. We show that our method achieves significantly lower reconstruction error when compared to standard state of the art algorithms. We also show that the decomposition keeps its interpretability by analyzing communities in a flights dataset (where the matrix is interpreted as a graph in which nodes are airports) and in a movie-ratings dataset with 10 million non-zeros. en
dc.identifier.uri http://repositorio.inesctec.pt/handle/123456789/6955
dc.identifier.uri http://dx.doi.org/10.1007/978-3-319-31753-3_37 en
dc.language eng en
dc.relation 6311 en
dc.relation 5316 en
dc.rights info:eu-repo/semantics/openAccess en
dc.title FastStep: Scalable Boolean Matrix Decomposition en
dc.type conferenceObject en
dc.type Publication en
Files
Original bundle
Now showing 1 - 1 of 1
Thumbnail Image
Name:
P-00K-BG7.pdf
Size:
1018.53 KB
Format:
Adobe Portable Document Format
Description: