Classifying heart sounds using SAX motifs, random forests and text mining techniques

dc.contributor.author Elsa Ferreira Gomes en
dc.contributor.author Alípio Jorge en
dc.contributor.author Paulo Jorge Azevedo en
dc.date.accessioned 2017-11-20T10:47:15Z
dc.date.available 2017-11-20T10:47:15Z
dc.date.issued 2014 en
dc.description.abstract In this paper we describe an approach to classifying heart sounds (classes Normal, Murmur and Extra-systole) that is based on the discretization of sound signals using the SAX (Symbolic Aggregate Approximation) representation. The ability of automatically classifying heart sounds or at least support human decision in this task is socially relevant to spread the reach of medical care using simple mobile devices or digital stethoscopes. In our approach, sounds are first pre-processed using signal processing techniques (decimate, low-pass filter, normalize, Shannon envelope). Then the pre-processed symbols are transformed into sequences of discrete SAX symbols. These sequences are subject to a process of motif discovery. Frequent sequences of symbols (motifs) are adopted as features. Each sound is then characterized by the frequent motifs that occur in it and their respective frequency. This is similar to the term frequency (TF) model used in text mining. In this paper we compare the TF model with the application of the TFIDF (Term frequency - Inverse Document Frequency) and the use of bi-grams (frequent size two sequences of motifs). Results show the ability of the motifs based TF approach to separate classes and the relative value of the TFIDF and the bi-grams variants. The separation of the Extra-systole class is overly difficult and much better results are obtained for separating the Murmur class. Empirical validation is conducted using real data collected in noisy environments. We have also assessed the cost-reduction potential of the proposed methods by considering a fixed cost model and using a cost sensitive meta algorithm. Copyright 2014 ACM. en
dc.identifier.uri http://repositorio.inesctec.pt/handle/123456789/3611
dc.identifier.uri http://dx.doi.org/10.1145/2628194.2628240 en
dc.language eng en
dc.relation 6898 en
dc.relation 5606 en
dc.relation 4981 en
dc.rights info:eu-repo/semantics/openAccess en
dc.title Classifying heart sounds using SAX motifs, random forests and text mining techniques en
dc.type conferenceObject en
dc.type Publication en
Files