Title :
Using Statistical Machine Translation Model to Improve Domain-Specific Metasearch Engines
Author_Institution :
Xiamen Univ., Xiamen
fDate :
May 30 2007-June 1 2007
Abstract :
In order to improve the recall of the domain-specific information retrieval, an efficient query expansion mechanism is proposed for the metasearch engines. This mechanism uses the statistical machine translation model to compute the relevance between general query words and domain-relevant query words and dispatches the expanded queries to component search engines. The key ingredient of translation model is the expectation maximization (EM) algorithm. The experimental results show that the proposed expansion mechanism is a desirable and efficient method to improve the domain-relevance of the pages returned by a metasearch engine.
Keywords :
computational linguistics; expectation-maximisation algorithm; language translation; query formulation; search engines; EM algorithm; Internet; domain-relevant query words; domain-specific information retrieval; domain-specific metasearch engines; expectation maximization algorithm; query expansion mechanism; statistical machine translation model; Automatic control; Automation; Information resources; Information retrieval; Internet; Metasearch; Natural languages; Research and development; Search engines; Web pages; EM algorithm; metasearch engine; statistical machine translation;
Conference_Titel :
Control and Automation, 2007. ICCA 2007. IEEE International Conference on
Conference_Location :
Guangzhou
Print_ISBN :
978-1-4244-0818-4
Electronic_ISBN :
978-1-4244-0818-4
DOI :
10.1109/ICCA.2007.4376679