مرکز منطقه ای اطلاع رساني علوم و فناوري - Discriminatively estimated discrete, parametric and smoothed-discrete duration models for speech recognition

DocumentCode :

2179226

Title :

Discriminatively estimated discrete, parametric and smoothed-discrete duration models for speech recognition

Author :

Lehr, Maider ; Shafran, Izhak

Author_Institution :

Center for Spoken Language Understanding, Oregon Health & Sci. Univ., Portland, OR, USA

fYear :

2011

fDate :

22-27 May 2011

Firstpage :

5340

Lastpage :

5343

Abstract :

Duration of phonemic segments provide important cues for distinguishing words in languages such as Arabic. Recently, we proposed a discriminatively estimated joint acoustic, duration and language model for large vocabulary speech recognition. In that work, we found simple discrete models to be effective for modeling duration, albeit they were neither smoothed nor parsimonious. These limitations are ad dressed here with two alternative models parametric and smoothed-discrete models. Unlike previous work on para metric duration model, we estimate their parameters discriminatively and derive an analytical expression for estimating the parameters of a log-normal distribution using a recent approach. On a large vocabulary Arabic task, we empirically evaluated different segmental units and durations models. Our results show bigrams of clustered states modeled with smoothed-discrete duration models are relatively more accurate and efficient than other models considered.

Keywords :

speech recognition; discriminatively estimated discrete; large vocabulary speech recognition; log-normal distribution; parameter estimation; phonemic segments; smoothed-discrete duration models; Acoustics; Data models; Hidden Markov models; Parametric statistics; Speech recognition; Training data; Vocabulary;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on

Conference_Location :

Prague

ISSN :

1520-6149

Print_ISBN :

978-1-4577-0538-0

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2011.5947564

Filename :

5947564

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2179226