DocumentCode :
3699219
Title :
A parallel cross-language retrieval system for patent documents
Author :
Xin Shen;Heyan Huang;Lingzhi Li;Yonggang Huang
Author_Institution :
Beijing Engineering Research Center of High Volume Language Information Processing &
fYear :
2015
Firstpage :
672
Lastpage :
676
Abstract :
In order to help people obtain useful information from patent documents in different languages. This paper proposes a cross-language retrieval system to search Chinese and English patent documents simultaneously. This system consists of query translation module, document retrieval module and user interaction module. Query translation module is used to translate query based on bilingual dictionaries. Document retrieval module consists of monolingual retrieval system using standard vector space model. In order to retrieve in highly parallel, we use the MapReduce model to calculate the similarity. User interaction module provides users with interactive mechanism used to improve the retrieval accuracy in the system. It contains two parts: the second translation and relevance feedback. The experimental results show that our system has good performance.
Keywords :
"Patents","Dictionaries","Context","Data processing","Accuracy","Machine learning algorithms"
Publisher :
ieee
Conference_Titel :
Software Engineering and Service Science (ICSESS), 2015 6th IEEE International Conference on
ISSN :
2327-0586
Print_ISBN :
978-1-4799-8352-0
Electronic_ISBN :
2327-0594
Type :
conf
DOI :
10.1109/ICSESS.2015.7339147
Filename :
7339147
Link To Document :
بازگشت