DocumentCode :
1909788
Title :
Auditory model representation and comparison for speaker recognition
Author :
Colombi, John M. ; Anderson, Timothy R. ; Rogers, Steven K. ; Ruck, Dennis W. ; Warhola, Gregory T.
Author_Institution :
AFIT, Wright-Patterson AFB, OH, USA
fYear :
1993
fDate :
1993
Firstpage :
1914
Abstract :
The TIMIT and KING databases are used to compare proven spectral processing techinques to an auditory neural representation for speaker identification. The feature sets compared are linear prediction coding (LPC) cepstral coefficients and auditory nerve firing rates using the Payton model (1988). Two clustering algorithms, one statistically based and the other a neural approach, are used to generate speaker-specific codebook vectors. These algorithms are the Linde-Buzo-Gray algorithm and a Kohonen self-organizing feature map. The resulting vector-quantized distortion-based classification indicates the auditory model performs statistically equal to the LPC cepstral representation in clean environments and outperforms the LPC cepstral in noisy environments and in test data recorded over multiple sessions (greater intra-speaker distortions)
Keywords :
linear predictive coding; self-organising feature maps; spectral analysis; speech coding; speech recognition; KING databases; Kohonen self-organizing feature map; Linde-Buzo-Gray algorithm; TIMIT database; auditory nerve firing rates; auditory neural representation; clustering; linear prediction coding cepstral coefficients; speaker identification; speaker-specific codebook vectors; spectral processing techinques; vector-quantized distortion-based classification; Biomembranes; Cepstral analysis; Clustering algorithms; Databases; Frequency; Hidden Markov models; Linear predictive coding; Predictive models; Speaker recognition; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks, 1993., IEEE International Conference on
Conference_Location :
San Francisco, CA
Print_ISBN :
0-7803-0999-5
Type :
conf
DOI :
10.1109/ICNN.1993.298849
Filename :
298849
Link To Document :
بازگشت