DocumentCode :
3727014
Title :
Mel-frequency cepstral coefficients as features for automatic speaker recognition
Author :
Ivan D. Jokic;Stevan D. Jokic;Vlado D. Delic;Zoran H. Perie
Author_Institution :
Faculty of Technical Sciences, University of Novi Sad, Trg Dositeja Obradovica 6, 21000 Novi Sad, Serbia
fYear :
2015
Firstpage :
419
Lastpage :
424
Abstract :
Automatic speaker recognizer can be based on the use of mel-frequency cepstral coefficients as speaker features. Mel-frequency cepstral coefficients depend on energy inside considered auditory critical bands. These auditory critical bands model masking phenomena. Application of triangular auditory critical bands results in better recognition accuracy with respect to the case when rectangular auditory critical bands are applied. Recognition accuracy when exponential auditory critical bands are applied outperforms recognition accuracy of automatic speaker recognizer when triangular or rectangular auditory critical bands are applied. Application of transformation on elements of speaker model, which target decreasing of difference between testing and training models of the same speaker, can increase recognition accuracy.
Keywords :
"Speech","Covariance matrices","Speech recognition","Speaker recognition","Databases","Shape","Cepstral analysis"
Publisher :
ieee
Conference_Titel :
Telecommunications Forum Telfor (TELFOR), 2015 23rd
Type :
conf
DOI :
10.1109/TELFOR.2015.7377497
Filename :
7377497
Link To Document :
بازگشت