DocumentCode :
1608640
Title :
A Cross Language Text Categorization Algorithm from the Perspective of Information Retrieval
Author :
Liu, Yue ; Tian, Ming ; Zhou, Weitao ; Dai, Lin
Author_Institution :
Beijing Inst. of Technol., Beijing, China
fYear :
2012
Firstpage :
254
Lastpage :
257
Abstract :
In this paper, we propose a novel method that performs Cross Language Text Categorization (CLTC) from the perspective of Information Retrieval. We present an input document in target language in the form of a query in source language. Then we retrieve the training documents in source language and find K most relevant results. At last, we use the class labels of the K results to predict the class of the input document. The only external resource required by our method is a bilingual dictionary. Experimental results show that our method gives promising performance, which is better than translation-based method.
Keywords :
dictionaries; natural language processing; query processing; text analysis; CLTC; bilingual dictionary; class labels; cross language text categorization algorithm; information retrieval perspective; input document; query; source language; target language; training document retrieval; translation-based method; Industrial control; Cross Language Text Categorization; Information Retrieval; Text Categorization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Industrial Control and Electronics Engineering (ICICEE), 2012 International Conference on
Conference_Location :
Xi´an
Print_ISBN :
978-1-4673-1450-3
Type :
conf
DOI :
10.1109/ICICEE.2012.74
Filename :
6322363
Link To Document :
بازگشت