DocumentCode :
1467898
Title :
Continuous Mandarin speech recognition for Chinese language with large vocabulary based on segmental probability model
Author :
Shen, J.-L.
Author_Institution :
Inst. of Inf. Sci., Acad. Sinica, Taipei, Taiwan
Volume :
145
Issue :
5
fYear :
1998
fDate :
10/1/1998 12:00:00 AM
Firstpage :
309
Lastpage :
315
Abstract :
The author presents a study of large-vocabulary continuous Mandarin speech recognition based on a segmental probability model (SPM) approach. The SPM was found to be very suitable for recognition of isolated Mandarin syllables especially considering the monosyllabic structure of the Chinese language. To extend the application of the model to continuous Mandarin speech recognition, a concatenated syllable matching (CSM) algorithm in place of the conventional Viterbi search algorithm is first introduced. Also, to utilise the available training material efficiently, a training procedure is proposed to re-estimate the SPM parameters using the maximum a posteriori (MAP) algorithm. A few special techniques integrating acoustic and linguistic knowledge are developed further to improve the performance step by step. Preliminary experimental results show that the final achievable rate is as high as 91.62%, which indicates a 18.48% error rate reduction and more than three times faster than the well studied subsyllable-based CHMM
Keywords :
error statistics; natural languages; probability; speech recognition; Chinese language; SPM parameters re-estimation; acoustic knowledge; concatenated syllable matching algorithm; continuous Mandarin speech recognition; error rate reduction; experimental results; isolated Mandarin syllables; large vocabulary; linguistic knowledge; maximum a posteriori algorithm; monosyllabic structure; performance; recognition rate; segmental probability model; subsyllable-based CHMM; training material; training procedure;
fLanguage :
English
Journal_Title :
Vision, Image and Signal Processing, IEE Proceedings -
Publisher :
iet
ISSN :
1350-245X
Type :
jour
DOI :
10.1049/ip-vis:19982316
Filename :
741943
Link To Document :
بازگشت