Title :
Using Tag-Neighbors for Query Expansion in Medical Information Retrieval
Author :
Durao, Frederico ; Bayyapu, Karunakar ; Xu, Guandong ; Dolog, Peter ; Lage, Ricardo
Author_Institution :
Dept. of Comput. Sci., Aalborg Univ., Aalborg, Denmark
Abstract :
In the context of medical document retrieval, users often under-specified queries lead to undesired search results that suffer from not containing the information they seek, inadequate domain knowledge matches and unreliable sources. To overcome the limitations of under-specified queries, we utilize tags to enhance information retrieval capabilities by expanding users´ original queries with context-relevant information. We compute a set of significant tag neighbor candidates based on the neighbor frequency and weight, and utilize the most frequent and weighted neighbors to expand an entry query that has terms matching tags. The proposed approach is evaluated using MedWorm medical article collection and standard evaluation methods from the text retrieval conference (TREC). We compared the baseline of 0.353 for Mean Average Precision (MAP), reaching a MAP 0.491 (+39%) with the query expansion. In-depth analysis shows how this strategy is beneficial when compared with different ranks of the retrieval results.
Keywords :
identification technology; medical information systems; query processing; text analysis; MedWorm medical article collection; TREC; context-relevant information; domain knowledge; information retrieval capability enhancement; mean average precision; medical document retrieval; medical information retrieval; query expansion; standard evaluation methods; tag-neighbors; terms matching tags; text retrieval conference; under-specified queries; Biomedical imaging; Databases; Search engines; Semantics; Tagging; Unified modeling language;
Conference_Titel :
Information Science and Applications (ICISA), 2011 International Conference on
Conference_Location :
Jeju Island
Print_ISBN :
978-1-4244-9222-0
Electronic_ISBN :
978-1-4244-9223-7
DOI :
10.1109/ICISA.2011.5772324