• DocumentCode
    302069
  • Title

    Feature parameter curve method for high performance NN-based speech recognition

  • Author

    Zhu, Shun ; Chen, Dao Wen ; Huang, Tai Yi

  • Author_Institution
    Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore
  • Volume
    1
  • fYear
    1996
  • fDate
    7-10 May 1996
  • Firstpage
    1
  • Abstract
    This paper proposes a novel high-performance NN-based isolated word speech recognition method named feature parameter curve method (FPCM), which contains three sub-methods, i.e., the feature frame parameter method, the broken line fitting method and the representing-point sequence (RPS) method. Using feature parameter curve to represent the contour of LPC-based cepstral coefficient sequence, our new methods solve the time alignment problem, greatly lighten the burden on neural network and thus reducing the complexity of the neural network. Compared with other NN-based methods, our methods have many strong points such as much faster training and recognition, simpler structure of neural nets, and is more robust. A recognition system of all Mandarin syllables has been established based on this method and the comparative experiments confirmed the good characteristics stated above. The idea of reconstructing original speech signals is also first shown in speech recognition
  • Keywords
    cepstral analysis; feature extraction; learning (artificial intelligence); linear predictive coding; natural languages; neural nets; parameter estimation; speech coding; speech recognition; LPC-based cepstral coefficient sequence; Mandarin syllables recognition; broken line fitting method; experiments; feature parameter curve method; isolated word speech recognition; neural network; representing-point sequence method; speech signal reconstruction; time alignment problem solution; training; Automation; Cepstral analysis; Character recognition; Curve fitting; Databases; Isolation technology; Neural networks; Pattern recognition; Robustness; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
  • Conference_Location
    Atlanta, GA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-3192-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1996.540275
  • Filename
    540275