مرکز منطقه ای اطلاع رساني علوم و فناوري - Model-based articulatory phonetic features for improved speech recognition

DocumentCode :

2776819

Title :

Model-based articulatory phonetic features for improved speech recognition

Author :

Huang, Guangpu ; Er, Meng Joo

Author_Institution :

Comput. Vision Lab., Nanyang Technol. Univ., Singapore, Singapore

fYear :

2012

fDate :

10-15 June 2012

Firstpage :

Lastpage :

Abstract :

We describe a neural based articulatory phonetic inversion model to improve the recognition of the acoustically varying vowels and the syllable initial plosives. The model uses a set of continuous valued articulatory phonetic features (APFs) to explore the interactions between the motor control of articulators and the acoustic phonetic events. We demonstrate that the neural model gives more accurate and robust recognition performance on the TIMIT sentences. The model offers two salient properties: it allows asynchronous feature changes at phoneme boundaries, and it accounts for the dual aspects of human speech production and perception through a heuristic learning algorithm during APFs mapping.

Keywords :

learning (artificial intelligence); neural nets; speech recognition; APF mapping; TIMIT sentences; acoustically varying vowels; asynchronous feature changes; heuristic learning algorithm; human speech production; model-based articulatory phonetic features; motor control; neural based articulatory phonetic inversion model; phoneme boundaries; speech recognition; syllable initial plosives; Hidden Markov models; Muscles; Production; Speech; Synthesizers; Tongue;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Neural Networks (IJCNN), The 2012 International Joint Conference on

Conference_Location :

Brisbane, QLD

ISSN :

2161-4393

Print_ISBN :

978-1-4673-1488-6

Electronic_ISBN :

2161-4393

Type :

conf

DOI :

10.1109/IJCNN.2012.6252748

Filename :

6252748

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2776819