• DocumentCode
    1608640
  • Title

    A Cross Language Text Categorization Algorithm from the Perspective of Information Retrieval

  • Author

    Liu, Yue ; Tian, Ming ; Zhou, Weitao ; Dai, Lin

  • Author_Institution
    Beijing Inst. of Technol., Beijing, China
  • fYear
    2012
  • Firstpage
    254
  • Lastpage
    257
  • Abstract
    In this paper, we propose a novel method that performs Cross Language Text Categorization (CLTC) from the perspective of Information Retrieval. We present an input document in target language in the form of a query in source language. Then we retrieve the training documents in source language and find K most relevant results. At last, we use the class labels of the K results to predict the class of the input document. The only external resource required by our method is a bilingual dictionary. Experimental results show that our method gives promising performance, which is better than translation-based method.
  • Keywords
    dictionaries; natural language processing; query processing; text analysis; CLTC; bilingual dictionary; class labels; cross language text categorization algorithm; information retrieval perspective; input document; query; source language; target language; training document retrieval; translation-based method; Industrial control; Cross Language Text Categorization; Information Retrieval; Text Categorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Industrial Control and Electronics Engineering (ICICEE), 2012 International Conference on
  • Conference_Location
    Xi´an
  • Print_ISBN
    978-1-4673-1450-3
  • Type

    conf

  • DOI
    10.1109/ICICEE.2012.74
  • Filename
    6322363