Multi-interval Discretization of Continuous Attributes for Label Ranking

dc.contributor.author Cláudio Rebelo Sá en
dc.contributor.author Carlos Manuel Soares en
dc.contributor.author Knobbe,A en
dc.contributor.author Paulo Jorge Azevedo en
dc.contributor.author Alípio Jorge en
dc.date.accessioned 2017-12-12T15:59:38Z
dc.date.available 2017-12-12T15:59:38Z
dc.date.issued 2013 en
dc.description.abstract Label Ranking (LR) problems, such as predicting rankings of financial analysts, are becoming increasingly important in data mining. While there has been a significant amount of work on the development of learning algorithms for LR in recent years, pre-processing methods for LR are still very scarce. However, some methods, like Naive Bayes for LR and APRIORI-LR, cannot deal with real-valued data directly. As a make-shift solution, one could consider conventional discretization methods used in classification, by simply treating each unique ranking as a separate class. In this paper, we show that such an approach has several disadvantages. As an alternative, we propose an adaptation of an existing method, MDLP, specifically for LR problems. We illustrate the advantages of the new method using synthetic data. Additionally, we present results obtained on several benchmark datasets. The results clearly indicate that the discretization is performing as expected and in some cases improves the results of the learning algorithms. en
dc.identifier.uri http://repositorio.inesctec.pt/handle/123456789/3926
dc.identifier.uri http://dx.doi.org/10.1007/978-3-642-40897-7_11 en
dc.language eng en
dc.relation 5001 en
dc.relation 5606 en
dc.relation 5527 en
dc.relation 4981 en
dc.rights info:eu-repo/semantics/openAccess en
dc.title Multi-interval Discretization of Continuous Attributes for Label Ranking en
dc.type conferenceObject en
dc.type Publication en
Files
Original bundle
Now showing 1 - 1 of 1
Thumbnail Image
Name:
P-008-HKZ.pdf
Size:
313.89 KB
Format:
Adobe Portable Document Format
Description: