DocumentCode
3727014
Title
Mel-frequency cepstral coefficients as features for automatic speaker recognition
Author
Ivan D. Jokic;Stevan D. Jokic;Vlado D. Delic;Zoran H. Perie
Author_Institution
Faculty of Technical Sciences, University of Novi Sad, Trg Dositeja Obradovica 6, 21000 Novi Sad, Serbia
fYear
2015
Firstpage
419
Lastpage
424
Abstract
Automatic speaker recognizer can be based on the use of mel-frequency cepstral coefficients as speaker features. Mel-frequency cepstral coefficients depend on energy inside considered auditory critical bands. These auditory critical bands model masking phenomena. Application of triangular auditory critical bands results in better recognition accuracy with respect to the case when rectangular auditory critical bands are applied. Recognition accuracy when exponential auditory critical bands are applied outperforms recognition accuracy of automatic speaker recognizer when triangular or rectangular auditory critical bands are applied. Application of transformation on elements of speaker model, which target decreasing of difference between testing and training models of the same speaker, can increase recognition accuracy.
Keywords
"Speech","Covariance matrices","Speech recognition","Speaker recognition","Databases","Shape","Cepstral analysis"
Publisher
ieee
Conference_Titel
Telecommunications Forum Telfor (TELFOR), 2015 23rd
Type
conf
DOI
10.1109/TELFOR.2015.7377497
Filename
7377497
Link To Document