DocumentCode :
417277
Title :
Extended cluster information vector quantization (ECI-VQ) for robust classification
Author :
Arrowood, Jon A. ; Clements, Mark A.
Author_Institution :
Center for Signal & Image Process., Georgia Inst. of Technol., Atlanta, GA, USA
Volume :
1
fYear :
2004
fDate :
17-21 May 2004
Abstract :
This paper presents a novel extension to vector quantization referred to as extended cluster information (ECI). In this method the decoder retains more general statistics about the VQ clusters found during codebook training than the single prototypical point of conventional VQ systems. Typically this information is unnecessary, however if the items being compressed are feature space vectors used as input to a statistical pattern classification system, the extra probabilistic information can be used during the classification as in Bayes predictive classification (BPC) to improve recognition results. To demonstrate ECI-VQ, a simple experiment is described where the Aurora2 distributed speech recognition front end is altered to provide more aggressive mel frequency cepstral coefficient (MFCC) compression. As the bit-rate drops, the corresponding recognition performance suffers. It is then shown that using ECI-VQ as the input to an uncertain observation (UO) speech recognizer, a number of errors due to compression can be corrected with no extra cost in bit-rate.
Keywords :
cepstral analysis; error correction; feature extraction; pattern classification; pattern clustering; speech recognition; statistical analysis; table lookup; vector quantisation; Aurora2 distributed speech recognition front end; ECI-VQ; aggressive MFCC compression; codebook training; error correction; extended cluster information; feature space vectors; mel frequency cepstral coefficient; recognition performance; robust classification; statistical pattern classification system; uncertain observation speech recognizer; vector quantization; Decoding; Error correction; Mel frequency cepstral coefficient; Pattern classification; Pattern recognition; Prototypes; Robustness; Speech recognition; Statistics; Vector quantization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1326129
Filename :
1326129
Link To Document :
بازگشت