A speaker-independent discrete utterance recognition system, combining deterministic and probabilistic strategies

Author

Vysotsky, George J.

Author_Institution

Voice Processing Corporation, Cambridge, MA

Volume

32

Issue

3

fYear

1984

fDate

6/1/1984 12:00:00 AM

Firstpage

489

Lastpage

499

Abstract

A speaker-independent isolated word speech recognition system is discussed in which an unknown utterance is described by a set of speech feature measurements and then compared with a reference set of the same measurements obtained during a training procedure with a population of speakers. To reduce significantly the number of word confusions, a segmentation procedure is used. As a result, the whole vocabulary is divided into a number of subgroups characterized by a certain phonetic structure which is represented by the sequence of four types of segments. Each of these segments reflects quite reliably certain speaker independent events in the speech signal. Thus, the subgroup may consist of the reference sets for several confusable words, on the other hand, each word is represented by its reference sets in a number of the subgroups. The system was evaluated Using a 20-word vocabulary (including ten digits). A mean recognition accuracy of about 95 percent was obtained. The tradeoff between quite precise segmentation, which leads to a very bulky system if it is speaker-independent, and rough segmentation, which in some cases does not reduce the number of confusable words too much, is also discussed.

Keywords

Acoustic signal processing; Binary search trees; Binary trees; Feature extraction; Helium; Pattern matching; Speech processing; Speech recognition; Vocabulary;

fLanguage

English

Journal_Title

Acoustics, Speech and Signal Processing, IEEE Transactions on

Publisher

ieee

ISSN

0096-3518

Type

jour

DOI

10.1109/TASSP.1984.1164351

Filename

1164351