DocumentCode :
2657350
Title :
An Inhibition/Enhancement network for noise robust ASR
Author :
Huda, Mohammad Nurul ; Hossain, Md Shahadat ; Hassan, Foyzul ; Hasan, Mohammad Mahedi ; Lisa, Nusrat Jahan ; Muhammad, Ghulam
Author_Institution :
Dept. of CSE, United Int. Univ., Dhaka, Bangladesh
fYear :
2010
fDate :
23-25 Dec. 2010
Firstpage :
446
Lastpage :
451
Abstract :
This paper describes an evaluation of Inhibition/Enhancement (In/En) network for noise robust automatic speech recognition (ASR). In articulatory feature based speech recognition using neural network, the In/En network is needed to discriminate whether the articulatory features (AFs) dynamic patterns of trajectories are convex or concave. The network is used to achieve categorical AFs movement by enhancing AFs peak patterns (convex patterns) and inhibiting AFs dip patterns (concave patterns). We have analyzed the effectiveness of the In/En algorithm by incorporating it into a system which consists of three stages: a) Multilayer Neural Networks (MLNs), b) an In/En Network and c) the Gram-Schmidt (GS) algorithm for orthogonalization. From the experiments using Japanese Newspaper Article Sentences (JNAS) database in clean and noisy acoustic environments, it is observed that the In/En network plays a significant role on the improvement of phoneme recognition performance. Moreover, the In/En network reduces the number of mixture components needed in Hidden Markov Models (HMMs).
Keywords :
hidden Markov models; multilayer perceptrons; speech recognition; AF dip patterns; AF peak patterns; Gram-Schmidt orthogonalization algorithm; Japanese newspaper article sentences; articulatory feature based speech recognition; enhancement network; hidden Markov models; inhibition network; multilayer neural networks; noise robust automatic speech recognition; phoneme recognition performance; Computers; Conferences; Information technology; Articulatory Features; Distinctive Phonetic Features; Hidden Markov Model; Inhibition/Enhancement Network; Local Features; Multilayer Neural Network;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and Information Technology (ICCIT), 2010 13th International Conference on
Conference_Location :
Dhaka
Print_ISBN :
978-1-4244-8496-6
Type :
conf
DOI :
10.1109/ICCITECHN.2010.5723899
Filename :
5723899
Link To Document :
بازگشت