DocumentCode :
1938933
Title :
A neural speaker model for speaker clustering
Author :
Nakamura, Satoshi ; Akabane, Toshio
Author_Institution :
Sharp Corp., Nara, Japan
fYear :
1991
fDate :
14-17 Apr 1991
Firstpage :
853
Abstract :
A speaker model using a neural network is proposed for reference speaker clustering on speaker independent speech recognition. Speaker individuality is embedded in not only a static short time spectrum and a pitch frequency, but also a dynamic spectral pattern and pitch pattern. In conventional modeling, speaker individuality is based on the former static features. The authors try to capture the latter dynamic features, of speaker by a neural speaker model. Two methods, neural prediction modeling by multilayer perceptron and learning matrix vector-quantization, are considered for the speaker modeling. Using the measures of speaker modeling, speaker clustering of the reference patterns based on mutual information is carried out for speaker independent speech recognition
Keywords :
data compression; learning systems; neural nets; speech recognition; dynamic features; dynamic spectral pattern; learning matrix vector-quantization; multilayer perceptron; mutual information; neural network; neural prediction modeling; neural speaker model; pitch frequency; pitch pattern; reference patterns; reference speaker; speaker independent speech recognition; speaker individuality; static short time spectrum; Databases; Distortion measurement; Frequency; Information technology; Multilayer perceptrons; Mutual information; Neural networks; Predictive models; Speech recognition; Vector quantization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
Conference_Location :
Toronto, Ont.
ISSN :
1520-6149
Print_ISBN :
0-7803-0003-3
Type :
conf
DOI :
10.1109/ICASSP.1991.150472
Filename :
150472
Link To Document :
بازگشت