Title :
Effect of articulatory Δ and ΔΔ parameters on multilayer neural network based speech recognition
Author :
Banik, Manoj ; Kotwal, Mohammed Rokibul Alam ; Hassan, Foyzul ; Islam, Gazi Md Moshfiqul ; Rahman, Sharif Mohammad Musfiqur ; Hasan, Mohammad Mahedi ; Muhammad, Ghulam ; Mohammad, Nurul Huda
Author_Institution :
Dept. of CSE, Ahsanullah Univ. of Sci. & Technol., Dhaka, Bangladesh
Abstract :
This paper describes an effect of articulatory dynamic parameters (Δ and ΔΔ) on neural network based automatic speech recognition(ASR). Articulatory features (AFs) or distinctive phonetic features (DPFs)-based system shows its superiority in performances over acoustic features- based in ASR. These performances can be further improved by incorporating articulatory dynamic parameters into it. In this paper, we have proposed such a phoneme recognition system that comprises three stages: (i) DPFs extraction using a multilayer neural network (MLN) from acoustic features, (ii) incorporation of dynamic parameters into another MLN for reducing DPF context, and (iii) addition of an Inhibition/Enhancement (In/En) network for categorizing the DPF movement more accurately and Gram-Schmidt (GS) orthogonalization procedure for decorrelating the inhibited/enhanced data vector before connecting with hidden Markov model (HMMs)-based classifier. From the experiments on Japanese Newspaper Article Sentences (JNAS), it is observed that the proposed method provides a higher phoneme correct rate over the method that does not incorporate dynamic articulatory parameters. Moreover, it reduces mixture components in HMM for obtaining a higher recognition performance.
Keywords :
hidden Markov models; neural nets; speech recognition; ΔΔ parameter; DPF context; Gram-Schmidt orthogonalization procedure; acoustic feature; articulatory Δ parameter; articulatory dynamic parameter; automatic speech recognition; distinctive phonetic features; hidden Markov model based classifier; multilayer neural network; phoneme recognition system; phonetic feature; Acoustics; Artificial neural networks; Context; Feature extraction; Hidden Markov models; Speech; Speech recognition; Distinctive Phonetic Features; Dynamic Parameters; Hidden Markov Models; Local Features; Multi-Layer Neural Network;
Conference_Titel :
Circuits and Systems (APCCAS), 2010 IEEE Asia Pacific Conference on
Conference_Location :
Kuala Lumpur
Print_ISBN :
978-1-4244-7454-7
DOI :
10.1109/APCCAS.2010.5775027