Using domain-specific term frequencies to identify and classify health queries

Thumbnail Image
Date
2013
Authors
Carla Lopes
Dias,D
Cristina Ribeiro
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
In this paper we propose a multilingual method to identify health-related queries and classify them into health categories. Our method uses a consumer health vocabulary and the Unified Medical Language System semantic structure to compute the association degree of a query to medical concepts and categories. This method can be applied in different languages with translated versions of the health vocabulary. To evaluate its efficacy and applicability in two languages we used two manually classified sets of queries, each on a different language. Results are better for the English sample where a distance of 0.38 to the ROC optimal point (0,1) was obtained. This shows some influence of the translation in the method's performance. © 2013 Springer-Verlag.
Description
Keywords
Citation