Please use this identifier to cite or link to this item: http://repositorio.inesctec.pt/handle/123456789/6913
Title: Predicting the comprehension of health web documents using characteristics of documents and users
Authors: Oroszlanyova,M
Carla Lopes
Sérgio Nunes
Cristina Ribeiro
Issue Date: 2016
Abstract: The Web is frequently used as a way to access health information. In the health domain, the terminology can be very specific, frequently assuming a medico-scientific character. This can be a barrier to users who may be unable to understand the retrieved documents. Therefore, it would be useful to automatically assess how well a certain document will be understood by a certain user. In the present work, we analyse whether it is possible to predict the comprehension of documents using document features together with user features, and how well this can be achieved. We use an existing dataset, composed by health documents on the Web and their assessment in terms of comprehension by users, to build two multivariate prediction models for comprehension. Our best model showed very good results, with 96.51% accuracy. Our findings suggest features that can be considered by search engines to estimate comprehension. We found that user characteristics related to web and health search habits, such as the success of the users with Web search and the frequency of the users' health search, are some of the most influential user variables. The promising results obtained with this dataset with manual comprehension assessment will lead us to explore the automatic assessment of document and user characteristics. (C) 2016 The Authors. Published by Elsevier B.V.
URI: http://repositorio.inesctec.pt/handle/123456789/6913
http://dx.doi.org/10.1016/j.procs.2016.09.120
metadata.dc.type: conferenceObject
Publication
Appears in Collections:CSIG - Articles in International Conferences

Files in This Item:
File Description SizeFormat 
P-00M-5ZW.pdf216.23 kBAdobe PDFThumbnail
View/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.