DocumentCode
921745
Title
Golden Mandarin (I)-A real-time Mandarin speech dictation machine for Chinese language with very large vocabulary
Author
Lee, Lin-shan ; Tseng, Chiu-Yu ; Gu, Hung-Yan ; Liu, Fu-hua ; Chang, Chen-hao ; Lin, Yueh-hong ; Lee, Yumin ; Tu, Shih-Lung ; Hsieh, Shew-Heng ; Chen, Chian-hung
Author_Institution
Dept. of Comput. Sci. & Inf. Eng., Nat. Taiwan Univ., Taipei, Taiwan
Volume
1
Issue
2
fYear
1993
fDate
4/1/1993 12:00:00 AM
Firstpage
158
Lastpage
179
Abstract
The first successfully implemented real-time Mandarin dictation machine, which recognizes Mandarin speech with very large vocabulary and almost unlimited texts for the input of Chinese characters into computers, is described. The machine is speaker-dependent, and the input speech is in the form of sequences of isolated syllables. The machine can be decomposed into two subsystems. The first subsystem recognizes the syllables using hidden Markov models. Because every syllable can represent many different homonym characters and form different multisyllabic words with syllables on its right or left, the second subsystem is needed to identify the exact characters from the syllables and correct the errors in syllable recognition. The real-time implementation is on an IBM PC/AT, connected to three sets of specially designed hardware boards on which seven TMS 320C25 chips operate in parallel. The preliminary test results indicate that it takes only about 0.45 s to dictate a syllable (or character) with an accuracy on the order of 90%
Keywords
dictation; hidden Markov models; real-time systems; speech recognition equipment; Chinese language; Golden Mandarin (I); HMM; IBM PC/AT; Mandarin speech dictation machine; TMS 320C25 chips; hidden Markov models; homonym characters; multisyllabic words; real-time implementation; sequences of isolated syllables; speaker-dependent; speech recognition; syllable recognition; very large vocabulary; voice input; Character recognition; Computer science; Error correction; Hardware; Hidden Markov models; Lattices; Natural languages; Speech recognition; Text recognition; Vocabulary;
fLanguage
English
Journal_Title
Speech and Audio Processing, IEEE Transactions on
Publisher
ieee
ISSN
1063-6676
Type
jour
DOI
10.1109/89.222876
Filename
222876
Link To Document