مرکز منطقه ای اطلاع رساني علوم و فناوري - A segmental speech model with applications to word spotting

DocumentCode :

2021912

Title :

A segmental speech model with applications to word spotting

Author :

Gish, Herbert ; Ng, Kenney

Author_Institution :

BBN Systems & Technologies, Cambridge, MA, USA

Volume :

fYear :

1993

fDate :

27-30 April 1993

Firstpage :

447

Abstract :

The authors present a segmental speech model that explicitly models the dynamics in a variable-duration speech segment by using a time-varying trajectory model of the speech features in the segment. Each speech segment is represented by a set of statistics which includes a time-varying trajectory, a residual error covariance around the trajectory, and the number of frames in the segment. These statistics replace the frames in the segment and become the data that are modeled by either HMMs (hidden Markov models) or mixture models. This segment model is used to develop a secondary processing algorithm that rescores putative events hypothesized by a primary HMM word spotter to try to improve performance by discriminating true keywords from false alarms. This algorithm is evaluated on a keyword spotting task using the Road Rally Database, and performance is shown to improve significantly over that of the primary word spotter. The segmental model is also used on a TIMIT vowel classification task to evaluate its modeling capability.<>

Keywords :

error statistics; hidden Markov models; speech recognition; time-varying systems; HMM word spotter; Road Rally Database; TIMIT vowel classification; hidden Markov models; keywords; performance; residual error covariance; secondary processing algorithm; segmental speech model; time-varying trajectory model; variable-duration speech segment; word spotting;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on

Conference_Location :

Minneapolis, MN, USA

ISSN :

1520-6149

Print_ISBN :

0-7803-7402-9

Type :

conf

DOI :

10.1109/ICASSP.1993.319337

Filename :

319337

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2021912