• DocumentCode
    2260498
  • Title

    Research on knowledge elements in exponential language model

  • Author

    Jiang, Huixing ; Wang, Xiaojie

  • Author_Institution
    Center for Intell. Sci. & Technol., Beijing Univ. of Posts & Telecommun., Beijing, China
  • fYear
    2009
  • fDate
    24-27 Sept. 2009
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    This paper presents an exponential language model (ELM) for modeling and managing knowledge elements. The model has been developed based on minimum sample risk (MSR) algorithm, which is a discriminative training method. ELM uses features to capture global, domain, or sentential language phenomena that is composed of name entities, part of speech strings, personal usage words, positions of words, sentence mood, sentence tense etc. We study kinds of knowledge elements´ performances on the task of Chinese Pinyin to Chinese character (PTC) conversion in Internet language (Chinese mobile short messages and Chinese QQ1 chat records). By combining different kind of knowledge elements to ELM, the model performs different, but all ELMs with more knowledge elements outperform the ELM only using probability knowledge calculated by baseline n-gram models which use Kneser-Ney smoothing technology.
  • Keywords
    Internet; knowledge engineering; natural language processing; Chinese Pinyin; Chinese character conversion; Chinese mobile short messages; Internet language; Kneser-Ney smoothing technology; baseline n-gram models; discriminative training method; exponential language model; knowledge elements; minimum sample risk algorithm; name entities; personal usage words; positions of words; probability knowledge; sentence mood; sentence tense; sentential language phenomena; speech strings; Equations; Error analysis; Knowledge management; Management training; Maximum likelihood estimation; Natural languages; Niobium; Speech; Strontium; Technology management; Exponential language models; MSR-ELM; knowledge elements; minimum sample risk;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Language Processing and Knowledge Engineering, 2009. NLP-KE 2009. International Conference on
  • Conference_Location
    Dalian
  • Print_ISBN
    978-1-4244-4538-7
  • Electronic_ISBN
    978-1-4244-4540-0
  • Type

    conf

  • DOI
    10.1109/NLPKE.2009.5313799
  • Filename
    5313799