DocumentCode :
2374810
Title :
Speaker classification based on combined neural network and fuzzy decision
Author :
Bin, Zhu ; Yisheng, Zhu
Author_Institution :
Dept. of Electron. Eng., Univ. of Sci. & Technol. of China, Hefei, China
fYear :
1994
fDate :
1994
Abstract :
Speech recognition has played a key role in man-machine communication. The authors present a new speaker classifier based on combined neural network and integrate the fuzzy decision and knowledge supervision. Here, they first deal with context-dependent speaker classification. The input speech is sampled at 10 kHz and a 12-order cepstrum is obtained in real-time by a TMS320C25 board. A 2-stage combined neural network is proposed. The first stage is the neural prediction network in order to include the time correlation of the speech signal and to compress and normalize the parameters. The authors use the back-and-forward prediction network. After convergence, the speaker information is embedded in the weights of the predictor. The authors modify the conventional BP algorithm to the fuzzy BP algorithm for training, which has obtained better results. Considering its outstanding ability of characteristic extraction and auto-clustering, the main network is the Self-organizing Feature Map (SOFM). The weights of the predictor become the input of the SOFM which naturally needs no normalization of its input parameters. Using the algorithm proposed by T. Kohonen, the map was rapidly trained and afterwards it is marked for each speaker who says the same word. Then, the characteristics of each speaker are reflected in the map
Keywords :
speaker recognition; 10 kHz; 12-order cepstrum; 2-stage combined neural network; T. Kohonen algorithm; TMS320C25 board; auto-clustering; back-and-forward prediction network; characteristic extraction; context-dependent speaker classification; predictor weights; speaker information; speech signal time correlation; Cepstrum; Communication system control; Control systems; Fuzzy neural networks; Instruments; Neural networks; Partial response channels; Robot control; Speaker recognition; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Engineering in Medicine and Biology Society, 1994. Engineering Advances: New Opportunities for Biomedical Engineers. Proceedings of the 16th Annual International Conference of the IEEE
Conference_Location :
Baltimore, MD
Print_ISBN :
0-7803-2050-6
Type :
conf
DOI :
10.1109/IEMBS.1994.415354
Filename :
415354
Link To Document :
بازگشت