Title :
Model-based articulatory phonetic features for improved speech recognition
Author :
Huang, Guangpu ; Er, Meng Joo
Author_Institution :
Comput. Vision Lab., Nanyang Technol. Univ., Singapore, Singapore
Abstract :
We describe a neural based articulatory phonetic inversion model to improve the recognition of the acoustically varying vowels and the syllable initial plosives. The model uses a set of continuous valued articulatory phonetic features (APFs) to explore the interactions between the motor control of articulators and the acoustic phonetic events. We demonstrate that the neural model gives more accurate and robust recognition performance on the TIMIT sentences. The model offers two salient properties: it allows asynchronous feature changes at phoneme boundaries, and it accounts for the dual aspects of human speech production and perception through a heuristic learning algorithm during APFs mapping.
Keywords :
learning (artificial intelligence); neural nets; speech recognition; APF mapping; TIMIT sentences; acoustically varying vowels; asynchronous feature changes; heuristic learning algorithm; human speech production; model-based articulatory phonetic features; motor control; neural based articulatory phonetic inversion model; phoneme boundaries; speech recognition; syllable initial plosives; Hidden Markov models; Muscles; Production; Speech; Synthesizers; Tongue;
Conference_Titel :
Neural Networks (IJCNN), The 2012 International Joint Conference on
Conference_Location :
Brisbane, QLD
Print_ISBN :
978-1-4673-1488-6
Electronic_ISBN :
2161-4393
DOI :
10.1109/IJCNN.2012.6252748