DocumentCode
1608640
Title
A Cross Language Text Categorization Algorithm from the Perspective of Information Retrieval
Author
Liu, Yue ; Tian, Ming ; Zhou, Weitao ; Dai, Lin
Author_Institution
Beijing Inst. of Technol., Beijing, China
fYear
2012
Firstpage
254
Lastpage
257
Abstract
In this paper, we propose a novel method that performs Cross Language Text Categorization (CLTC) from the perspective of Information Retrieval. We present an input document in target language in the form of a query in source language. Then we retrieve the training documents in source language and find K most relevant results. At last, we use the class labels of the K results to predict the class of the input document. The only external resource required by our method is a bilingual dictionary. Experimental results show that our method gives promising performance, which is better than translation-based method.
Keywords
dictionaries; natural language processing; query processing; text analysis; CLTC; bilingual dictionary; class labels; cross language text categorization algorithm; information retrieval perspective; input document; query; source language; target language; training document retrieval; translation-based method; Industrial control; Cross Language Text Categorization; Information Retrieval; Text Categorization;
fLanguage
English
Publisher
ieee
Conference_Titel
Industrial Control and Electronics Engineering (ICICEE), 2012 International Conference on
Conference_Location
Xi´an
Print_ISBN
978-1-4673-1450-3
Type
conf
DOI
10.1109/ICICEE.2012.74
Filename
6322363
Link To Document