DocumentCode :
701521
Title :
Speaker recognition with artificial neural networks and MEL-frequency cepstral coefficients correlations
Author :
Soria, Roberto A.B. ; Cabral, Euvaldo F., Jr.
Author_Institution :
University of São Paulo - DEE/EPUSP, Laboratory of Communication and Signals - LCS, CAIXA POSTAL 8174, São Paulo, SP, 01065-970, Brazil
fYear :
1996
fDate :
10-13 Sept. 1996
Firstpage :
1
Lastpage :
4
Abstract :
The problem addressed in this paper is related to the fact that classical statistical approach for speaker recognition yields satisfactory results but at the expense of long length training and test utterances. An attempt to reduce the length of speaker samples is of great importance in the field of speaker recognition since the statistical approach, due to its limitations, is usually precluded from use in real-time applications. A novel method of text-independent speaker recognition which uses only the correlations among MFCCs, computed over selected speech segments of very-short length (approximately 120ms) is proposed. Three different neural networks — the Multi-Layer Perceptron (MLP), the Steinbuch´s Learnmatrix (SLM) and the Self-Organizing Feature Finder (SOFF) — are evaluated in a speaker recognition task. The ability of dimensionality reduction of the SOFF paradigm is also discussed.
Keywords :
Correlation; Detectors; Feature extraction; Speaker recognition; Speech; Speech recognition; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
European Signal Processing Conference, 1996. EUSIPCO 1996. 8th
Conference_Location :
Trieste, Italy
Print_ISBN :
978-888-6179-83-6
Type :
conf
Filename :
7083248
Link To Document :
بازگشت