Active Manifold Learning with Twitter Big Data

dc.contributor.author Silva,C en
dc.contributor.author Mário João Antunes en
dc.contributor.author Costa,J en
dc.contributor.author Ribeiro,B en
dc.date.accessioned 2018-01-02T15:39:40Z
dc.date.available 2018-01-02T15:39:40Z
dc.date.issued 2015 en
dc.description.abstract The data produced by Internet applications have increased substantially. Big data is a flaring field that deals with this deluge of data by using storage techniques, dedicated infrastructures and development frameworks for the parallelization of defined tasks and its consequent reduction. These solutions however fall short in online and highly data demanding scenarios, since users expect swift feedback. Reduction techniques are efficiently used in big data online applications to improve classification problems. Reduction in big data usually falls in one of two main methods: (i) reduce the dimensionality by pruning or reformulating the feature set; (ii) reduce the sample size by choosing the most relevant examples. Both approaches have benefits, not only of time consumed to build a model, but eventually also performance-wise, usually by reducing overfitting and improving generalization capabilities. In this paper we investigate reduction techniques that tackle both dimensionality and size of big data. We propose a framework that combines a manifold learning approach to reduce dimensionality and an active learning SVM-based strategy to reduce the size of labeled sample. Results on Twitter data show the potential of the proposed active manifold learning approach. en
dc.identifier.uri http://repositorio.inesctec.pt/handle/123456789/5245
dc.identifier.uri http://dx.doi.org/10.1016/j.procs.2015.07.296 en
dc.language eng en
dc.relation 5138 en
dc.rights info:eu-repo/semantics/openAccess en
dc.title Active Manifold Learning with Twitter Big Data en
dc.type conferenceObject en
dc.type Publication en
Files
Original bundle
Now showing 1 - 1 of 1
Thumbnail Image
Name:
P-00G-G00.pdf
Size:
314.61 KB
Format:
Adobe Portable Document Format
Description: