• DocumentCode
    2029518
  • Title

    Disambiguation for polyphones of Chinese based on two-pass unified approach

  • Author

    Huang, Feng-Long ; Lin, Jun-Hong ; Lin, Xin-Wei

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Eng., Nat. United Univ., Miaoli, Taiwan
  • fYear
    2010
  • fDate
    16-18 Dec. 2010
  • Firstpage
    603
  • Lastpage
    607
  • Abstract
    The paper addresses the issue of Chinese polyphones and the disambiguity approach. Three methods, dictionary matching, language models and voting scheme, are used to retrieve the token´s information and then disambiguate the prediction of polyphones. The best precision for these methods achieves 92.72%. Furthermore we proposed the two-pass unified approach to improve the performance with various empirical thresholds. Our approach is superior to the well-known MS Word 2007, and the final precision rate reaches 94.3%. It proves that the proposed approach can resolve effectively text-to-phoneme conversion in Chinese TTS system.
  • Keywords
    Internet; learning (artificial intelligence); natural language processing; speech processing; speech synthesis; Chinese TTS system; Chinese polyphones; MS Word 2007; dictionary matching; disambiguation; language models; text-to-phoneme conversion; text-to-speech; token´s information; two-pass unified approach; voting scheme; Computational modeling; Dictionaries; Electronic learning; Markov processes; Probability; Semantics; Testing; Language Model; Text-to-Speech(TTS); Two-Pass Unified Approach; Word Sense Disambiguation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Symposium (ICS), 2010 International
  • Conference_Location
    Tainan
  • Print_ISBN
    978-1-4244-7639-8
  • Type

    conf

  • DOI
    10.1109/COMPSYM.2010.5685440
  • Filename
    5685440