Title :
Using Term Relation in Context Sensitive Information Retrieval
Author_Institution :
Sch. of Software & Microelectron., Peking Univ., Beijing, China
Abstract :
Term relations analysis has been used to improve performance in information retrieval. However, it is difficult to choose the appropriate related terms. Co-occurrence analysis and WordNet have been used to obtain mutual information between terms in re-ranking retrieval results and performing query expansion, but it didn´t improve the performance as expected. It is difficult to avoid involving noise information and inappropriate related terms with ambiguous sense in the process of finding related terms and computing mutual information. To solve this problem, we propose to add context information in a document when choosing related terms by clustering method, and use Mahalanobis distance instead of Euclidean distance in re-ranking query result with term mutual information. The approach presented in this paper can improve the precision and relevance in enterprise information retrieval significantly to satisfy user´s needs.
Keywords :
pattern clustering; query processing; relevance feedback; Mahalanobis distance; WordNet; clustering method; co-occurrence analysis; context sensitive information retrieval; enterprise information retrieval; query expansion; relevance feedback; retrieval results reranking; term relations analysis; Clustering methods; Euclidean distance; Feedback; Fuzzy systems; Information analysis; Information retrieval; Internet; Java; Mutual information; Performance analysis; Term relation; context; covariance; information retrieval; precision; query expansion; rank; relevance;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2009. FSKD '09. Sixth International Conference on
Conference_Location :
Tianjin
Print_ISBN :
978-0-7695-3735-1
DOI :
10.1109/FSKD.2009.661