DocumentCode :
330084
Title :
A recognition algorithm without the ending-point detection of Chinese based on the DTW and HMM unified model
Author :
Jie, Zhang ; Yan, Zhang ; Zhitong, Huang
Author_Institution :
Dept. of Autom., Nanjing Univ., China
Volume :
5
fYear :
1998
fDate :
11-14 Oct 1998
Firstpage :
4279
Abstract :
Describes a characteristic of Chinese speech, according to the duration time of Chinese consonant, and a recognition algorithm without end-point detection is proposed. Three situations in this algorithm have been pointed out, but the same result has been obtained from them. Compared with the traditional method, in this algorithm, it is not necessary to decide the end-point of speech signals. From the beginning, feature vectors, which consist of 15-order cepstrum coefficients and the average energy of each frame, are extracted in frames (length of each frame is 20 millisecond, the overlapping between two frames is 50%). By introducing the self-loop of the silent segment of the discrete time warping (DTW) and HMM unified model (DHUM), this algorithm is successfully implemented. In recognition of 99 similar words of Chinese, a first candidate recognition rate of 94.95% is obtained. To study the robustness of the algorithm, auditory representation of speech signals is also employed to obtained feature vectors. From the comparison of different feature-extraction methods, the conclusion is obtained that if an auditory feature is accepted for feature vectors, the robustness of the algorithm will be better; and because of the superiority of auditory representation to describe the characteristics of the silent segment of speech, the auditory feature-vector is more suitable for this algorithm
Keywords :
feature extraction; hidden Markov models; speech recognition; 15-order cepstrum coefficients; Chinese consonant; Chinese speech; auditory representation; discrete time warping; feature vectors; feature-extraction methods; speech signals; unified model; Automatic speech recognition; Automation; Cepstrum; Character recognition; Feature extraction; Hidden Markov models; Robustness; Signal detection; Speech recognition; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man, and Cybernetics, 1998. 1998 IEEE International Conference on
Conference_Location :
San Diego, CA
ISSN :
1062-922X
Print_ISBN :
0-7803-4778-1
Type :
conf
DOI :
10.1109/ICSMC.1998.727518
Filename :
727518
Link To Document :
بازگشت