• DocumentCode
    134243
  • Title

    Error-driven pronunciation dictionary construction for Mandarin speech recognition

  • Author

    Yi Liu ; Xiangang Li ; Xihong Wu

  • Author_Institution
    Key Lab. of Machine Perception (Minist. of Educ.), Peking Univ., Beijing, China
  • fYear
    2014
  • fDate
    12-14 Sept. 2014
  • Firstpage
    88
  • Lastpage
    92
  • Abstract
    Aiming at constructing the pronunciation dictionary for Mandarin speech recognition, an automatic error-driven and incremental approach is proposed based on the acoustic confusion network. This method considers both of the acoustic and language information, constructs a dictionary through words selection and composition to optimal the performance of ASR directly. During the process, removing and splitting operations are applied to control the scale of dictionary and avoid to stuck into local optimum. Additionally, it takes advantage of simulated annealing algorithm to obtain the global optimal dictionary. Experiments on Mandarin speech recognition show that the system with the dictionary constructed by the proposed approach gains 1.01% absolute reduction in character error rate compared to the baseline with the same dictionary scale. Besides, the proposed approach can achieve the same performance as best baseline but reduce the size of dictionary from 30000 to 20000.
  • Keywords
    dictionaries; natural language processing; simulated annealing; speech recognition; ASR; Mandarin speech recognition; acoustic confusion network; character error rate reduction; error-driven pronunciation dictionary construction; global optimal dictionary; incremental approach; simulated annealing algorithm; word selection; Acoustics; Compounds; Dictionaries; Error analysis; Hidden Markov models; Lattices; Speech recognition; Mandarin speech recognition; pronunciation dictionary construction; words composition and splitting; words selection and removement;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
  • Conference_Location
    Singapore
  • Type

    conf

  • DOI
    10.1109/ISCSLP.2014.6936635
  • Filename
    6936635