Title :
Distributed speech recognition with codec parameters
Author :
Raj, Bhiksha ; Migdal, Joshua ; Singh, Rita
Author_Institution :
Mitsubishi Electr. Res. Labs., Cambridge, MA, USA
Abstract :
Communication devices which perform distributed speech recognition (DSR) tasks currently transmit standardized coded parameters of speech signals. Recognition features are extracted from signals reconstructed using these on a remote server. Since reconstruction losses degrade recognition performance, proposals are being considered to standardize DSR-codecs which derive recognition features, to be transmitted and used directly for recognition. However, such a codec must be embedded on the transmitting device, along with its current standard codec. Performing recognition using codec bitstreams avoids these complications: no additional feature-extraction mechanism is required on the device, and there are no reconstruction losses on the server. We propose an LDA-based method for extracting optimal feature sets from codec bitstreams and demonstrate that features so derived result in improved recognition performance for the LPC, GSM and CELP codecs. For GSM and CELP, we show that the performance is comparable to that with uncoded speech and standard DSR-codec features.
Keywords :
cellular radio; distributed processing; feature extraction; linear predictive coding; speech codecs; speech coding; speech recognition; CELP; GSM; LPC; codec parameters; coded parameters; distributed speech recognition; feature extraction; linear discriminant analysis; signal reconstruction; Code standards; Codecs; Degradation; Feature extraction; GSM; Linear predictive coding; Performance loss; Propagation losses; Proposals; Speech recognition;
Conference_Titel :
Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on
Print_ISBN :
0-7803-7343-X
DOI :
10.1109/ASRU.2001.1034604