Title :
Incorporation of dynamic parameters in hybrid feature-based Bangla phoneme recognition using multilayer Neural Networks
Author :
Kotwal, Mohammed Rokibul Alam ; Hassan, Foyzul ; Ahmed, Faisal ; Daud, Shakib Ibn ; Alam, Md Shafiul ; Huda, Mohammad Nurul
Author_Institution :
Dept. of CSE, United Int. Univ., Dhaka, Bangladesh
Abstract :
This paper presents a Neural Network-based Bangla phoneme recognition method for Automatic Speech Recognition (ASR). The method consists of three stages: at first stage, a multilayer neural network (MLN) converts acoustic features, mel frequency cepstral coefficients (MFCCs), into phoneme probabilities, where the second stage computes dynamic (velocity (Δ) and acceleration (ΔΔ)) parameters from the phoneme probabilities by using three point linear regression (LR). Finally, the phoneme probabilities, dynamic parameters, Δ and ΔΔ, and the input MFCCs, combined as hybrid features, are fed into a hidden Markov model (HMM) based classifier to obtain more accurate phoneme strings. From the experiments on Bangla speech corpus prepared by us, it is observed that the proposed method provides higher phoneme recognition performance than the existing method. Moreover, it requires a fewer mixture components in the HMMs.
Keywords :
cepstral analysis; hidden Markov models; multilayer perceptrons; natural language processing; probability; regression analysis; signal classification; speech recognition; Bangla speech corpus; HMM based classifier; acceleration parameters; acoustic feature conversion; automatic speech recognition; dynamic parameters; hidden Markov model; hybrid feature-based Bangla phoneme recognition; mel frequency cepstral coefficients; multilayer neural networks; phoneme probabilities; phoneme strings; three point linear regression; Context; Automatic Speech Recognition; Dynamic Parameters; Hidden Markov Model; Mel Frequency Cepstral Coefficients; Multilayer Neural Network;
Conference_Titel :
Computer and Information Technology (ICCIT), 2011 14th International Conference on
Conference_Location :
Dhaka
Print_ISBN :
978-1-61284-907-2
DOI :
10.1109/ICCITechn.2011.6164883