Title :
Bangla ASR design by suppressing gender factor with gender-independent and gender-based HMM classifiers
Author :
Hassan, Foyzul ; Kotwal, Mohammed Rokibul Alam ; Huda, Mohammad Nurul
Author_Institution :
Dept. of Comput. Sci. & Eng., United Int. Univ., Dhaka, Bangladesh
Abstract :
Hidden factor such as gender characteristic plays an important role on the performance of Bangla (widely used as Bengali) automatic speech recognition (ASR). If there is a suppression process that represses the decrease of differences in acoustic-likelihood among categories resulted from gender factors, a robust ASR system can be realized. In our previous paper, we proposed a technique of gender effects suppression that composed of two hidden Markov model (HMM)-based classifiers that focused on a gender factor. In the proposed study, we have designed a new ASR for Bangla by suppressing the gender effects, which embeds three HMM-based classifiers for corresponding male, female and geneder-independent (GI) characteristics. In an experiment on Bangla speech database prepared by us, the proposed system that incorporates GI-classifier has achieved a significant improvement of word correct rate, word accuracy and sentence correct rate in comparison with our previous method that did not incorporate GI-classifier.
Keywords :
gender issues; hidden Markov models; signal classification; speech recognition; Bangla ASR design; Bangla speech database; acoustic-likelihood; automatic speech recognition; gender characteristic; gender effects suppression; gender factor; gender-based HMM classifiers; gender-independent; hidden Markov model-based classifiers; sentence correct rate; word accuracy; word correct rate; Accuracy; Feature extraction; Hidden Markov models; Mel frequency cepstral coefficient; Speech; Speech recognition; acoustic model; automatic speech recognition; gender effects suppression; hidden Markov model;
Conference_Titel :
Information and Communication Technologies (WICT), 2011 World Congress on
Conference_Location :
Mumbai
Print_ISBN :
978-1-4673-0127-5
DOI :
10.1109/WICT.2011.6141432