مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

542175

Title :

Direct models for phoneme recognition

Author :

Lilchododev, Anton ; Gao, Yuqing

Author_Institution :

School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA

Volume :

fYear :

2002

fDate :

13-17 May 2002

Abstract :

This paper presents the theoretical framework of a new statistical model for phoneme recognition. In contrast with traditional HMMs, the posterior probability of a state sequence given an observation sequence is computed directly with the new model. The development of this paper is based on Maximum Entropy Markov Models (MEMMs[5]), appearing as a result of the application of Maximum Entropy principle to sequential processes. The main contributions of our work include modifying the MEMM to large-scale speech recognition problem and introduction of another direct model (NOM), which overcomes the shortcome of the MEMM of poor representation of contextual information. Direct comparison of direct model phoneme recognizers with HMM-based recognizers demonstrates the superiority of the new models, particularly on smaller training sets.

Keywords :

Accuracy; Biological system modeling; Data models; Entropy; Heuristic algorithms; Hidden Markov models; Mathematical model;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on

Conference_Location :

Orlando, FL, USA

ISSN :

1520-6149

Print_ISBN :

0-7803-7402-9

Type :

conf

DOI :

10.1109/ICASSP.2002.5743661

Filename :

5743661

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=542175