DocumentCode :
2891536
Title :
A Term Association Approach for Genomics Information Retrieval
Author :
Hu, Qinmin ; Huang, Jimmy Xiangji ; Hu, Xiaohua
Author_Institution :
Dept. of Comput. Sci. & Eng., York Univ., Toronto, ON, Canada
fYear :
2011
fDate :
12-15 Nov. 2011
Firstpage :
532
Lastpage :
537
Abstract :
Modeling and mining term association is important for information retrieval, which allows an information retrieval system to retrieve relevant documents/passages more precisely. In this paper, we propose a new approach for discovering term associations among the keywords from a query. First, factor analysis is applied to discover some hidden common factors as the "eliteness" variables that can be used to estimate the importance of term associations. Second, a factor analysis based model and a corresponding algorithm are proposed. Then, we report experimental results that confirm the effectiveness and superiority of the proposed term association approach. Our approach achieves excellent results on the TREC 2007 and 2006 data sets, which provides a promising avenue for constructing high performance information retrieval systems.
Keywords :
biology computing; data mining; document handling; genomics; information retrieval; document-passage retrieval; eliteness variables; factor analysis based model; genomics information retrieval system; hidden common factor discovery; keywords; relevant documents retrieval; relevant passages retrieval; term association mining approach; term association modelling; Algorithm design and analysis; Analytical models; Bioinformatics; Genomics; Load modeling; Loading; Mathematical model; Biomedical IR; Factor Analysis; Term Association;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2011 IEEE International Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
978-1-4577-1799-4
Type :
conf
DOI :
10.1109/BIBM.2011.107
Filename :
6120497
Link To Document :
بازگشت