DocumentCode
701521
Title
Speaker recognition with artificial neural networks and MEL-frequency cepstral coefficients correlations
Author
Soria, Roberto A.B. ; Cabral, Euvaldo F., Jr.
Author_Institution
University of São Paulo - DEE/EPUSP, Laboratory of Communication and Signals - LCS, CAIXA POSTAL 8174, São Paulo, SP, 01065-970, Brazil
fYear
1996
fDate
10-13 Sept. 1996
Firstpage
1
Lastpage
4
Abstract
The problem addressed in this paper is related to the fact that classical statistical approach for speaker recognition yields satisfactory results but at the expense of long length training and test utterances. An attempt to reduce the length of speaker samples is of great importance in the field of speaker recognition since the statistical approach, due to its limitations, is usually precluded from use in real-time applications. A novel method of text-independent speaker recognition which uses only the correlations among MFCCs, computed over selected speech segments of very-short length (approximately 120ms) is proposed. Three different neural networks — the Multi-Layer Perceptron (MLP), the Steinbuch´s Learnmatrix (SLM) and the Self-Organizing Feature Finder (SOFF) — are evaluated in a speaker recognition task. The ability of dimensionality reduction of the SOFF paradigm is also discussed.
Keywords
Correlation; Detectors; Feature extraction; Speaker recognition; Speech; Speech recognition; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
European Signal Processing Conference, 1996. EUSIPCO 1996. 8th
Conference_Location
Trieste, Italy
Print_ISBN
978-888-6179-83-6
Type
conf
Filename
7083248
Link To Document