Title :
A New Method for Cross-Language Information Retrieval by Summing Weights of Graphs
Author :
Yuan, Song An ; Yu, Song Nian
Author_Institution :
Shanghai Univ., Shanghai
Abstract :
Disambiguation is the aim of most translation techniques used in cross-language information retrieval. In this paper, we present a new method for query translation which only needs a bilingual dictionary and a monolingual corpus. Unlike the traditional statistical approach, our method uses co-occurrences between pairs of terms as statistical measure. By adding up all the weights of a k-complete subgraph, we can compare different combinations of target terms. The output of our method is in the form of probability distribution. Then the result is converted to the query in the target language. The method is easy to implement, and experiment shows it performs well.
Keywords :
dictionaries; graph theory; information retrieval; natural language processing; statistical distributions; text analysis; bilingual dictionary; cooccurrences; cross-language information retrieval; disambiguation; k-complete subgraph; monolingual corpus; probability distribution; query translation; statistical measure; Dictionaries; Distributed computing; Frequency estimation; Frequency shift keying; Fuzzy systems; Information retrieval; Natural languages; Probability distribution; Search engines;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on
Conference_Location :
Haikou
Print_ISBN :
978-0-7695-2874-8
DOI :
10.1109/FSKD.2007.84