Identification and classification of health queries: Co-occurrences vs. domain- specific terminologies

Thumbnail Image
Date
2014
Authors
Carla Lopes
Cristina Ribeiro
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
Identifying the user's intent behind a query is a key challenge in Information Retrieval. This information may be used to contextualize the search and provide better search results to the user. The automatic identification of queries targeting a search for health information allows the implementation of retrieval strategies specifically focused on the health domain. In this paper, two kinds of automatic methods to identify and classify health queries based on domain-specific terminology are proposed. Besides evaluating these methods, we compare them with a method that is based on co-occurrence statistics of query terms with the word "health". Although the best overall result was achieved with a variant of the co-occurrence method, the method based on domain-specific frequencies that generates a continuous output outperformed most of the other methods. Moreover, this method also allows the association of queries to the semantic tree of the Unified Medical Language System and thereafter their classification into appropriate subcategories. Copyright © 2014, IGI Global.
Description
Keywords
Citation