• DocumentCode
    3317431
  • Title

    Research on ethnic minority language text tools and corpora of China

  • Author

    Guo, Yanhui ; Wang, Xiaojie ; Wang, Cong ; Zhong, Yixin

  • fYear
    2005
  • fDate
    30 Oct.-1 Nov. 2005
  • Firstpage
    277
  • Lastpage
    280
  • Abstract
    The increasing interest in the use of large-scale textual resources for NLP research has led to the rapid proliferation of both massive amounts of textual data and text-handling tools. Most of them are dedicated to a widely-used language (such as English, French or Chinese). However, huge efforts are required to develop the corresponding resource for other ethnic minority languages. In this paper, we analyze the requirements of ethnic minority language corpus, discuss some surrounding issues on the development of corpus-handling tools for ethnic minority language, and present the initial results of above work.
  • Keywords
    computational linguistics; natural languages; text analysis; corpus-handling tool; ethnic minority language text tool; natural language processing; text-handling tool; textual resource; Acceleration; Computer languages; Educational technology; Encoding; Large-scale systems; Mechanical factors; Natural language processing; Natural languages; Software tools; Writing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
  • Print_ISBN
    0-7803-9361-9
  • Type

    conf

  • DOI
    10.1109/NLPKE.2005.1598748
  • Filename
    1598748