DocumentCode
134243
Title
Error-driven pronunciation dictionary construction for Mandarin speech recognition
Author
Yi Liu ; Xiangang Li ; Xihong Wu
Author_Institution
Key Lab. of Machine Perception (Minist. of Educ.), Peking Univ., Beijing, China
fYear
2014
fDate
12-14 Sept. 2014
Firstpage
88
Lastpage
92
Abstract
Aiming at constructing the pronunciation dictionary for Mandarin speech recognition, an automatic error-driven and incremental approach is proposed based on the acoustic confusion network. This method considers both of the acoustic and language information, constructs a dictionary through words selection and composition to optimal the performance of ASR directly. During the process, removing and splitting operations are applied to control the scale of dictionary and avoid to stuck into local optimum. Additionally, it takes advantage of simulated annealing algorithm to obtain the global optimal dictionary. Experiments on Mandarin speech recognition show that the system with the dictionary constructed by the proposed approach gains 1.01% absolute reduction in character error rate compared to the baseline with the same dictionary scale. Besides, the proposed approach can achieve the same performance as best baseline but reduce the size of dictionary from 30000 to 20000.
Keywords
dictionaries; natural language processing; simulated annealing; speech recognition; ASR; Mandarin speech recognition; acoustic confusion network; character error rate reduction; error-driven pronunciation dictionary construction; global optimal dictionary; incremental approach; simulated annealing algorithm; word selection; Acoustics; Compounds; Dictionaries; Error analysis; Hidden Markov models; Lattices; Speech recognition; Mandarin speech recognition; pronunciation dictionary construction; words composition and splitting; words selection and removement;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
Conference_Location
Singapore
Type
conf
DOI
10.1109/ISCSLP.2014.6936635
Filename
6936635
Link To Document