• DocumentCode
    2374810
  • Title

    Speaker classification based on combined neural network and fuzzy decision

  • Author

    Bin, Zhu ; Yisheng, Zhu

  • Author_Institution
    Dept. of Electron. Eng., Univ. of Sci. & Technol. of China, Hefei, China
  • fYear
    1994
  • fDate
    1994
  • Abstract
    Speech recognition has played a key role in man-machine communication. The authors present a new speaker classifier based on combined neural network and integrate the fuzzy decision and knowledge supervision. Here, they first deal with context-dependent speaker classification. The input speech is sampled at 10 kHz and a 12-order cepstrum is obtained in real-time by a TMS320C25 board. A 2-stage combined neural network is proposed. The first stage is the neural prediction network in order to include the time correlation of the speech signal and to compress and normalize the parameters. The authors use the back-and-forward prediction network. After convergence, the speaker information is embedded in the weights of the predictor. The authors modify the conventional BP algorithm to the fuzzy BP algorithm for training, which has obtained better results. Considering its outstanding ability of characteristic extraction and auto-clustering, the main network is the Self-organizing Feature Map (SOFM). The weights of the predictor become the input of the SOFM which naturally needs no normalization of its input parameters. Using the algorithm proposed by T. Kohonen, the map was rapidly trained and afterwards it is marked for each speaker who says the same word. Then, the characteristics of each speaker are reflected in the map
  • Keywords
    speaker recognition; 10 kHz; 12-order cepstrum; 2-stage combined neural network; T. Kohonen algorithm; TMS320C25 board; auto-clustering; back-and-forward prediction network; characteristic extraction; context-dependent speaker classification; predictor weights; speaker information; speech signal time correlation; Cepstrum; Communication system control; Control systems; Fuzzy neural networks; Instruments; Neural networks; Partial response channels; Robot control; Speaker recognition; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Engineering in Medicine and Biology Society, 1994. Engineering Advances: New Opportunities for Biomedical Engineers. Proceedings of the 16th Annual International Conference of the IEEE
  • Conference_Location
    Baltimore, MD
  • Print_ISBN
    0-7803-2050-6
  • Type

    conf

  • DOI
    10.1109/IEMBS.1994.415354
  • Filename
    415354