Tuning a Semantic Relatedness Algorithm using a Multiscale Approach

dc.contributor.author José Paulo Leal en
dc.contributor.author Teresa Almeida Costa en
dc.date.accessioned 2017-12-19T19:33:46Z
dc.date.available 2017-12-19T19:33:46Z
dc.date.issued 2015 en
dc.description.abstract The research presented in this paper builds on previous work that lead to the definition of a family of semantic relatedness algorithms. These algorithms depend on a semantic graph and on a set of weights assigned to each type of arcs in the graph. The current objective of this research is to automatically tune the weights for a given graph in order to increase the proximity quality. The quality of a semantic relatedness method is usually measured against a benchmark data set. The results produced by a method are compared with those on the benchmark using a nonparametric measure of statistical dependence, such as the Spearman's rank correlation coefficient. The presented methodology works the other way round and uses this correlation coefficient to tune the proximity weights. The tuning process is controlled by a genetic algorithm using the Spearman's rank correlation coefficient as fitness function. This algorithm has its own set of parameters which also need to be tuned. Bootstrapping is a statistical method for generating samples that is used in this methodology to enable a large number of repetitions of a genetic algorithm, exploring the results of alternative parameter settings. This approach raises several technical challenges due to its computational complexity. This paper provides details on techniques used to speedup the process. The proposed approach was validated with the Word Net 2.1 and the Word Sim-353 data set. Several ranges of parameter values were tested and the obtained results are better than the state of the art methods for computing semantic relatedness using the Word Net 2.1, with the advantage of not requiring any domain knowledge of the semantic graph. en
dc.identifier.uri http://repositorio.inesctec.pt/handle/123456789/4321
dc.identifier.uri http://dx.doi.org/10.2298/csis140905020l en
dc.language eng en
dc.relation 5125 en
dc.relation 5935 en
dc.rights info:eu-repo/semantics/openAccess en
dc.title Tuning a Semantic Relatedness Algorithm using a Multiscale Approach en
dc.type article en
dc.type Publication en
Files
Original bundle
Now showing 1 - 1 of 1
Thumbnail Image
Name:
P-00G-EE8.pdf
Size:
381.05 KB
Format:
Adobe Portable Document Format
Description: