مرکز منطقه ای اطلاع رساني علوم و فناوري - A comparative study of continuous speech recognition using neural networks and hidden Markov models

DocumentCode :

1913863

Title :

A comparative study of continuous speech recognition using neural networks and hidden Markov models

Author :

Renals, Steve ; McKelvie, David ; McInnes, Fergus

Author_Institution :

Centre for Speech Technol. Res., Edinburgh Univ., UK

fYear :

1991

fDate :

14-17 Apr 1991

Firstpage :

369

Abstract :

The recognition performances of two front ends are compared for two continuous speech recognition tasks. First, a neural network model (NNM) front end was used, with frame labeling performed by a radial basis function network and segmentation by a Viterbi algorithm. The second front end was a discrete hidden Markov model (HMM), featuring explicit state duration probability distributions. Two experiments were performed. The first used a speaker-dependent database, with a lexicon of 571 words. Using a low-perplexity grammar, the NNM front end produced a word accuracy of 94% and a sentence accuracy of 86%. This was slightly inferior to the HMM front end, which produced word accuracies of 96% and sentence accuracies of 88%. Without a grammar, word accuracies of 58% (NNM) and 49% (HMM) were recorded. The second set of experiments used the MIT portion of the TIMIT database (415 speakers and 2072 sentences in total). Results were poor for both front ends, with the NNM producing marginally better results

Keywords :

Markov processes; neural nets; speech recognition; MIT portion; TIMIT database; Viterbi segmentation algorithm; continuous speech recognition; discrete hidden Markov model; explicit state duration probability distributions; frame labeling; lexical access; low-perplexity grammar; neural network model; radial basis function network; sentence accuracy; speaker-dependent database; word accuracy; Bridges; Feedforward systems; Hidden Markov models; Labeling; Neural networks; Physics; Spatial databases; Speech recognition; Testing; Viterbi algorithm;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on

Conference_Location :

Toronto, Ont.

ISSN :

1520-6149

Print_ISBN :

0-7803-0003-3

Type :

conf

DOI :

10.1109/ICASSP.1991.150353

Filename :

150353

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1913863