DocumentCode :
1972912
Title :
Effect of Articulatory Trajectories on Phoneme Recognition Performance
Author :
Huda, Mohammad Nurul ; Muhammad, Ghulam ; Hasan, Mohammad Mahedi ; Rahman, Sharif Mohammad Musfiqur ; Hassan, Foyzul ; Kotwal, Mohammed Rokibul Alam
Author_Institution :
Dept. of CSE, United Int. Univ., Dhaka, Bangladesh
fYear :
2010
fDate :
22-23 June 2010
Firstpage :
62
Lastpage :
65
Abstract :
This paper presents a method for describing the effect of articulatory trajectories on phoneme recognition. The proposed method comprises three stages. The first stage embeds three multilayer neural networks (MLNs): MLN (LF-DPF) that maps acoustic features or local features (LFs) onto articulatory features or distinctive phonetic features (DPFs), MLN (cntxt) that reduces misclassifications at phoneme boundaries, and MLN (Dyn) that controls dynamics of DPF features. The second stage incorporates an inhibition/enhancement (In/En) network by varying the trajectories of articulators to achieve categorical DPF movement by enhancing DPF peak patterns and inhibiting DPF dip patterns. The third stage decor relates continuous DPF vectors using the Gram-Schmidt algorithm before feeding into a hidden Markov model (HMM)-based classifier. In the experiments on Japanese Newspaper Article Sentences (JNAS) database, the proposed feature extractor shows the performance of phoneme recognition with the variation of different trajectories.
Keywords :
hidden Markov models; neural nets; pattern classification; speech; speech processing; speech recognition; DPF dip patterns; DPF peak patterns; DPF vectors; Gram-Schmidt algorithm; Japanese newspaper article sentences database; acoustic features; articulatory features; articulatory trajectories; distinctive phonetic features; enhancement network; feature extractor; hidden Markov model-based classifier; inhibition network; local features; multilayer neural networks; phoneme recognition performance; Artificial neural networks; Delay effects; Electronic mail; Feature extraction; Hidden Markov models; Speech; Trajectory; Gram-Schmidt orthogonalization; articulatory features; hidden markov model; multilayer neural networks;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Computing and Cognitive Informatics (ICICCI), 2010 International Conference on
Conference_Location :
Kuala Lumpur
Print_ISBN :
978-1-4244-6640-5
Electronic_ISBN :
978-1-4244-6641-2
Type :
conf
DOI :
10.1109/ICICCI.2010.37
Filename :
5566037
Link To Document :
بازگشت