• DocumentCode
    701521
  • Title

    Speaker recognition with artificial neural networks and MEL-frequency cepstral coefficients correlations

  • Author

    Soria, Roberto A.B. ; Cabral, Euvaldo F., Jr.

  • Author_Institution
    University of São Paulo - DEE/EPUSP, Laboratory of Communication and Signals - LCS, CAIXA POSTAL 8174, São Paulo, SP, 01065-970, Brazil
  • fYear
    1996
  • fDate
    10-13 Sept. 1996
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    The problem addressed in this paper is related to the fact that classical statistical approach for speaker recognition yields satisfactory results but at the expense of long length training and test utterances. An attempt to reduce the length of speaker samples is of great importance in the field of speaker recognition since the statistical approach, due to its limitations, is usually precluded from use in real-time applications. A novel method of text-independent speaker recognition which uses only the correlations among MFCCs, computed over selected speech segments of very-short length (approximately 120ms) is proposed. Three different neural networks — the Multi-Layer Perceptron (MLP), the Steinbuch´s Learnmatrix (SLM) and the Self-Organizing Feature Finder (SOFF) — are evaluated in a speaker recognition task. The ability of dimensionality reduction of the SOFF paradigm is also discussed.
  • Keywords
    Correlation; Detectors; Feature extraction; Speaker recognition; Speech; Speech recognition; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    European Signal Processing Conference, 1996. EUSIPCO 1996. 8th
  • Conference_Location
    Trieste, Italy
  • Print_ISBN
    978-888-6179-83-6
  • Type

    conf

  • Filename
    7083248