Title :
A technique to overcome the problem of small size database for automatic speaker recognition
Author :
AlSulaiman, Mansour ; Mahmood, Awais ; Ghulam, Muhammad ; Bencherif, Mohamed A. ; Alotaibi, Yousef A.
Author_Institution :
Comput. Eng. Dept., King Saud Univ., Riyadh, Saudi Arabia
Abstract :
Modeling a system by statistical methods needs large amount of data to train the system. In real life such data are sometimes not available or hard to collect. Modeling the system with small size database will produce a system with poor performance. In this paper we propose a method for increasing the size of the database. The method works by generating new samples from the original samples, using combinations of the following methods: speech lengthening, noise adding, and word reversal. To make a proof of concept, we used a severe test condition, in which the original database consists of one sample per speaker, for a speaker recognition system. We tested the system using original samples. The best results were 90% and 90.41% recognition rates for two subsets of the database for 25 and 50 speakers respectively.
Keywords :
database management systems; speaker recognition; speech enhancement; statistical analysis; automatic speaker recognition; small size database; statistical methods; Databases; Hidden Markov models; Mel frequency cepstral coefficient; Noise; Speaker recognition; Speech; Speech recognition;
Conference_Titel :
Digital Information Management (ICDIM), 2010 Fifth International Conference on
Conference_Location :
Thunder Bay, ON
Print_ISBN :
978-1-4244-7572-8
DOI :
10.1109/ICDIM.2010.5664673