DocumentCode :
1103530
Title :
A speaker-independent discrete utterance recognition system, combining deterministic and probabilistic strategies
Author :
Vysotsky, George J.
Author_Institution :
Voice Processing Corporation, Cambridge, MA
Volume :
32
Issue :
3
fYear :
1984
fDate :
6/1/1984 12:00:00 AM
Firstpage :
489
Lastpage :
499
Abstract :
A speaker-independent isolated word speech recognition system is discussed in which an unknown utterance is described by a set of speech feature measurements and then compared with a reference set of the same measurements obtained during a training procedure with a population of speakers. To reduce significantly the number of word confusions, a segmentation procedure is used. As a result, the whole vocabulary is divided into a number of subgroups characterized by a certain phonetic structure which is represented by the sequence of four types of segments. Each of these segments reflects quite reliably certain speaker independent events in the speech signal. Thus, the subgroup may consist of the reference sets for several confusable words, on the other hand, each word is represented by its reference sets in a number of the subgroups. The system was evaluated Using a 20-word vocabulary (including ten digits). A mean recognition accuracy of about 95 percent was obtained. The tradeoff between quite precise segmentation, which leads to a very bulky system if it is speaker-independent, and rough segmentation, which in some cases does not reduce the number of confusable words too much, is also discussed.
Keywords :
Acoustic signal processing; Binary search trees; Binary trees; Feature extraction; Helium; Pattern matching; Speech processing; Speech recognition; Vocabulary;
fLanguage :
English
Journal_Title :
Acoustics, Speech and Signal Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
0096-3518
Type :
jour
DOI :
10.1109/TASSP.1984.1164351
Filename :
1164351
Link To Document :
بازگشت