• DocumentCode
    330084
  • Title

    A recognition algorithm without the ending-point detection of Chinese based on the DTW and HMM unified model

  • Author

    Jie, Zhang ; Yan, Zhang ; Zhitong, Huang

  • Author_Institution
    Dept. of Autom., Nanjing Univ., China
  • Volume
    5
  • fYear
    1998
  • fDate
    11-14 Oct 1998
  • Firstpage
    4279
  • Abstract
    Describes a characteristic of Chinese speech, according to the duration time of Chinese consonant, and a recognition algorithm without end-point detection is proposed. Three situations in this algorithm have been pointed out, but the same result has been obtained from them. Compared with the traditional method, in this algorithm, it is not necessary to decide the end-point of speech signals. From the beginning, feature vectors, which consist of 15-order cepstrum coefficients and the average energy of each frame, are extracted in frames (length of each frame is 20 millisecond, the overlapping between two frames is 50%). By introducing the self-loop of the silent segment of the discrete time warping (DTW) and HMM unified model (DHUM), this algorithm is successfully implemented. In recognition of 99 similar words of Chinese, a first candidate recognition rate of 94.95% is obtained. To study the robustness of the algorithm, auditory representation of speech signals is also employed to obtained feature vectors. From the comparison of different feature-extraction methods, the conclusion is obtained that if an auditory feature is accepted for feature vectors, the robustness of the algorithm will be better; and because of the superiority of auditory representation to describe the characteristics of the silent segment of speech, the auditory feature-vector is more suitable for this algorithm
  • Keywords
    feature extraction; hidden Markov models; speech recognition; 15-order cepstrum coefficients; Chinese consonant; Chinese speech; auditory representation; discrete time warping; feature vectors; feature-extraction methods; speech signals; unified model; Automatic speech recognition; Automation; Cepstrum; Character recognition; Feature extraction; Hidden Markov models; Robustness; Signal detection; Speech recognition; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man, and Cybernetics, 1998. 1998 IEEE International Conference on
  • Conference_Location
    San Diego, CA
  • ISSN
    1062-922X
  • Print_ISBN
    0-7803-4778-1
  • Type

    conf

  • DOI
    10.1109/ICSMC.1998.727518
  • Filename
    727518