DocumentCode :
1782400
Title :
Experimental framework for mel-scaled LP based Bangla speech recognition
Author :
Muslima, Umme ; Islam, M. Babul
Author_Institution :
Dept. of Electron. & Commun. Eng., Univ. of Inf. Technol. & Sci., Dhaka, Bangladesh
fYear :
2014
fDate :
8-10 March 2014
Firstpage :
56
Lastpage :
59
Abstract :
This paper deals with the recognition process of Bangla speech. The used database consists of two sets of data - one is for training containing 3824 utterances of Bangla digit sequences of 25 male and 25 female speakers and the other one is test dataset containing 1985 utterances of 26 male and 26 female speakers. The test set is subdivided into four groups such as clean1, clean2, clean3 and clean4. Mel-LPC based front-end has been used to design the front-end, since it incorporate auditory-like frequency resolution. The Mel-LPC is a time-domain feature and computationally efficient. The Mel-LPC based cepstral coefficients are obtained directly from the input speech by using generalized autocorrelation function. In this estimation process bilinear transformation is not required, and frequency warping is obtained by using a first-order all-pass filter instead of unit delay. A detail experimental framework both for front-end and HMM based back-end have been presented in this paper. The final recognition experiments show the satisfactory performance of the developed system. The recognition accuracy are found to be 98.11%, 98.05%, 97.94%, and 97.63%, for test sets clean1, clean2, clean3 and clean4, respectively.
Keywords :
filtering theory; speaker recognition; speech coding; speech recognition; Bangla digit sequences; Mel-LPC based cepstral coefficients; Mel-LPC based front-end; Mel-scaled LP based Bangla speech recognition; auditory-like frequency resolution; bilinear transformation; first-order all-pass filter; generalized autocorrelation function; linear prediction coding; speaker utterance; Accuracy; Computational modeling; Databases; Hidden Markov models; Information technology; Speech; Speech recognition; Bangla speech recognition; Bilinear transformation; HMM; Mel-LPC;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and Information Technology (ICCIT), 2013 16th International Conference on
Conference_Location :
Khulna
Type :
conf
DOI :
10.1109/ICCITechn.2014.6997304
Filename :
6997304
Link To Document :
بازگشت