Please use this identifier to cite or link to this item: http://repositorio.inesctec.pt/handle/123456789/6905
Title: Identification and classification of health queries: Co-occurrences vs. domain- specific terminologies
Authors: Carla Lopes
Cristina Ribeiro
Issue Date: 2014
Abstract: Identifying the user's intent behind a query is a key challenge in Information Retrieval. This information may be used to contextualize the search and provide better search results to the user. The automatic identification of queries targeting a search for health information allows the implementation of retrieval strategies specifically focused on the health domain. In this paper, two kinds of automatic methods to identify and classify health queries based on domain-specific terminology are proposed. Besides evaluating these methods, we compare them with a method that is based on co-occurrence statistics of query terms with the word "health". Although the best overall result was achieved with a variant of the co-occurrence method, the method based on domain-specific frequencies that generates a continuous output outperformed most of the other methods. Moreover, this method also allows the association of queries to the semantic tree of the Unified Medical Language System and thereafter their classification into appropriate subcategories. Copyright © 2014, IGI Global.
URI: http://repositorio.inesctec.pt/handle/123456789/6905
http://dx.doi.org/10.4018/ijhisi.2014070104
metadata.dc.type: article
Publication
Appears in Collections:CSIG - Articles in International Journals

Files in This Item:
File Description SizeFormat 
P-00A-2H6.pdf353.89 kBAdobe PDFThumbnail
View/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.