DocumentCode :
2776819
Title :
Model-based articulatory phonetic features for improved speech recognition
Author :
Huang, Guangpu ; Er, Meng Joo
Author_Institution :
Comput. Vision Lab., Nanyang Technol. Univ., Singapore, Singapore
fYear :
2012
fDate :
10-15 June 2012
Firstpage :
1
Lastpage :
8
Abstract :
We describe a neural based articulatory phonetic inversion model to improve the recognition of the acoustically varying vowels and the syllable initial plosives. The model uses a set of continuous valued articulatory phonetic features (APFs) to explore the interactions between the motor control of articulators and the acoustic phonetic events. We demonstrate that the neural model gives more accurate and robust recognition performance on the TIMIT sentences. The model offers two salient properties: it allows asynchronous feature changes at phoneme boundaries, and it accounts for the dual aspects of human speech production and perception through a heuristic learning algorithm during APFs mapping.
Keywords :
learning (artificial intelligence); neural nets; speech recognition; APF mapping; TIMIT sentences; acoustically varying vowels; asynchronous feature changes; heuristic learning algorithm; human speech production; model-based articulatory phonetic features; motor control; neural based articulatory phonetic inversion model; phoneme boundaries; speech recognition; syllable initial plosives; Hidden Markov models; Muscles; Production; Speech; Synthesizers; Tongue;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks (IJCNN), The 2012 International Joint Conference on
Conference_Location :
Brisbane, QLD
ISSN :
2161-4393
Print_ISBN :
978-1-4673-1488-6
Electronic_ISBN :
2161-4393
Type :
conf
DOI :
10.1109/IJCNN.2012.6252748
Filename :
6252748
Link To Document :
بازگشت