Title :
A neural speaker model for speaker clustering
Author :
Nakamura, Satoshi ; Akabane, Toshio
Author_Institution :
Sharp Corp., Nara, Japan
Abstract :
A speaker model using a neural network is proposed for reference speaker clustering on speaker independent speech recognition. Speaker individuality is embedded in not only a static short time spectrum and a pitch frequency, but also a dynamic spectral pattern and pitch pattern. In conventional modeling, speaker individuality is based on the former static features. The authors try to capture the latter dynamic features, of speaker by a neural speaker model. Two methods, neural prediction modeling by multilayer perceptron and learning matrix vector-quantization, are considered for the speaker modeling. Using the measures of speaker modeling, speaker clustering of the reference patterns based on mutual information is carried out for speaker independent speech recognition
Keywords :
data compression; learning systems; neural nets; speech recognition; dynamic features; dynamic spectral pattern; learning matrix vector-quantization; multilayer perceptron; mutual information; neural network; neural prediction modeling; neural speaker model; pitch frequency; pitch pattern; reference patterns; reference speaker; speaker independent speech recognition; speaker individuality; static short time spectrum; Databases; Distortion measurement; Frequency; Information technology; Multilayer perceptrons; Mutual information; Neural networks; Predictive models; Speech recognition; Vector quantization;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
Conference_Location :
Toronto, Ont.
Print_ISBN :
0-7803-0003-3
DOI :
10.1109/ICASSP.1991.150472