• DocumentCode
    328229
  • Title

    Estimation of multi-templates for speech recognition by using spectrum reforming and self-organized clustering

  • Author

    Imamura, Daisuke ; Hiroshige, Makoto ; Nakagaki, Atsushi ; Miyanaga, Yoshikazu ; Tochinai, Koji

  • Author_Institution
    Dept. of Electron. Eng., Hokkaido Univ., Sapporo, Japan
  • Volume
    1
  • fYear
    1993
  • fDate
    25-29 Oct. 1993
  • Firstpage
    247
  • Abstract
    This report proposes a new recognition method of continuous Japanese speech. In order to overcome some difficulties in a large vocabulary recognition system, which requires large calculation cost and memory area, the proposed system only recognizes the single phonemes which are automatically selected from continuous speech. The automatic selection can be realized using a new spectrum reforming technique, i.e., a method which smooths the estimated parameters using a nonlinear filter and a parameter quantization method based on a priori rules. This system can also easily search the optimal template for speech phonemes using the self-organized clustering method. This technique is suitable for optimum estimation of multi-templates for continuous speech recognition.
  • Keywords
    neural nets; nonlinear filters; probability; quantisation (signal); speech recognition; continuous Japanese speech; multi-templates estimation; neural networks; nonlinear filter; parameter quantization; probability; self-organized clustering; spectrum reforming; speech phonemes; speech recognition; Automatic speech recognition; Clustering methods; Costs; Filters; Information processing; Multimedia systems; Parameter estimation; Quantization; Speech recognition; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 1993. IJCNN '93-Nagoya. Proceedings of 1993 International Joint Conference on
  • Print_ISBN
    0-7803-1421-2
  • Type

    conf

  • DOI
    10.1109/IJCNN.1993.713903
  • Filename
    713903