Supervised selection of prototypes for classification [speech recognition]

Author

Das, Subrata

Author_Institution

IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA

fYear

1990

fDate

3-6 Apr 1990

Firstpage

697

Abstract

Given sufficient samples of data tagged with their class identities, three techniques for constructing supervised prototypes to represent these classes are examined. The first method consists of averaging the tokens of each class separately to obtain the prototypes. In the second approach, several tokens, picked uniformly from each class, are designated as prototypes. The third technique involves a systematic search procedure to select effective prototypes and discard obsolete ones. Approximately two hours of continuous speech data from each of two speakers were used for experimentation. Each centisecond frame of speech was labeled with one of 200 phonetic subunit names utilizing hidden Markov model training and Viterbi alignment procedures. Prototypes were determined from the first part of the data, whereas the last part served to measure the classification performance. Average accuracies ranged from 24.2% with 200 prototypes in the first, to 31.5% with 32000 prototypes in the second, to 38.5% with 2258 prototypes in the third method

Keywords

Markov processes; learning systems; speech recognition; Viterbi alignment; hidden Markov model; learning systems; speech recognition; supervised prototypes; systematic search; Databases; Displays; Frequency; Hidden Markov models; Prototypes; Spectrogram; Speech; Speech recognition; Testing; Training data; Viterbi algorithm;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1990. ICASSP-90., 1990 International Conference on

Conference_Location

Albuquerque, NM

ISSN

1520-6149

Type

conf

DOI

10.1109/ICASSP.1990.115858

Filename

115858