Mel-frequency cepstral coefficients as features for automatic speaker recognition

Author

Ivan D. Jokic;Stevan D. Jokic;Vlado D. Delic;Zoran H. Perie

Author_Institution

Faculty of Technical Sciences, University of Novi Sad, Trg Dositeja Obradovica 6, 21000 Novi Sad, Serbia

fYear

2015

Firstpage

419

Lastpage

424

Abstract

Automatic speaker recognizer can be based on the use of mel-frequency cepstral coefficients as speaker features. Mel-frequency cepstral coefficients depend on energy inside considered auditory critical bands. These auditory critical bands model masking phenomena. Application of triangular auditory critical bands results in better recognition accuracy with respect to the case when rectangular auditory critical bands are applied. Recognition accuracy when exponential auditory critical bands are applied outperforms recognition accuracy of automatic speaker recognizer when triangular or rectangular auditory critical bands are applied. Application of transformation on elements of speaker model, which target decreasing of difference between testing and training models of the same speaker, can increase recognition accuracy.

Keywords

"Speech","Covariance matrices","Speech recognition","Speaker recognition","Databases","Shape","Cepstral analysis"

Publisher

ieee

Conference_Titel

Telecommunications Forum Telfor (TELFOR), 2015 23rd

Type

conf

DOI

10.1109/TELFOR.2015.7377497

Filename

7377497

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=3727014