DocumentCode :
2329037
Title :
Bangla phoneme recognition using hybrid features
Author :
Kotwal, Mohammed Rokibul Alam ; Hossain, Md Shahadat ; Hassan, Foyzul ; Muhammad, Ghulam ; Huda, Moahammad Nurul ; Rahman, Chowdhury Mofizur
Author_Institution :
Dept. of CSE, United Int. Univ., Dhaka, Bangladesh
fYear :
2010
fDate :
18-20 Dec. 2010
Firstpage :
718
Lastpage :
721
Abstract :
This paper presents a Bangla phoneme recognition method for Automatic Speech Recognition (ASR). The method consists of three stages: i) a multilayer neural network (MLN), which converts acoustic features, mel frequency cepstral coefficients (MFCCs), into phoneme probabilities, ii) the phoneme probabilities obtained from the first stage and corresponding Δ and ΔΔ are inserted into another MLN to improve the phoneme probabilities by reducing the context effect and (iii) the phoneme probabilities of current frame and corresponding MFCCs are fed into a hidden Markov model (HMM) based classifier to obtain more accurate phoneme strings. From the experiments on Bangla speech corpus prepared by us, it is observed that the proposed method provides higher phoneme recognition performance than the existing method. Moreover, it requires a fewer mixture components in the HMMs.
Keywords :
hidden Markov models; neural nets; speech recognition; Bangla phoneme recognition; automatic speech recognition; hidden Markov model based classifier; mel frequency cepstral coefficients; multilayer neural network; phoneme probabilities; acoustic features; automatic speech recognition; hidden Markov models; multilayer neural network; phoneme probabilities;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electrical and Computer Engineering (ICECE), 2010 International Conference on
Conference_Location :
Dhaka
Print_ISBN :
978-1-4244-6277-3
Type :
conf
DOI :
10.1109/ICELCE.2010.5700793
Filename :
5700793
Link To Document :
بازگشت