Title :
Supervised selection of prototypes for classification [speech recognition]
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Abstract :
Given sufficient samples of data tagged with their class identities, three techniques for constructing supervised prototypes to represent these classes are examined. The first method consists of averaging the tokens of each class separately to obtain the prototypes. In the second approach, several tokens, picked uniformly from each class, are designated as prototypes. The third technique involves a systematic search procedure to select effective prototypes and discard obsolete ones. Approximately two hours of continuous speech data from each of two speakers were used for experimentation. Each centisecond frame of speech was labeled with one of 200 phonetic subunit names utilizing hidden Markov model training and Viterbi alignment procedures. Prototypes were determined from the first part of the data, whereas the last part served to measure the classification performance. Average accuracies ranged from 24.2% with 200 prototypes in the first, to 31.5% with 32000 prototypes in the second, to 38.5% with 2258 prototypes in the third method
Keywords :
Markov processes; learning systems; speech recognition; Viterbi alignment; hidden Markov model; learning systems; speech recognition; supervised prototypes; systematic search; Databases; Displays; Frequency; Hidden Markov models; Prototypes; Spectrogram; Speech; Speech recognition; Testing; Training data; Viterbi algorithm;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1990. ICASSP-90., 1990 International Conference on
Conference_Location :
Albuquerque, NM
DOI :
10.1109/ICASSP.1990.115858