DocumentCode
2770253
Title
Interpolative variable frame rate transmission of speech features for distributed speech recognition
Author
Deng, Huiqun ; Shaughnessy, Douglas O. ; Dahan, Jean ; Ganong, William F.
Author_Institution
Univ. of Quebec, Montreal
fYear
2007
fDate
9-13 Dec. 2007
Firstpage
591
Lastpage
595
Abstract
In distributed speech recognition, vector quantization is used to reduce the number of bits for coding speech features at the user end in order to save energy for transmitting speech feature streams to remote recognizers and reduce data traffic congestion. We notice that the overall bit rate of the transmitted feature streams could be further reduced by not sending redundant frames that can be interpolated at the remote server from received frames. Interpolation introduces errors and may degrade speech recognition. This paper investigates the methods of selecting frames for transmission and the effect of interpolation on recognition. Experiments on a large vocabulary recognizer show that with spline interpolation, the overall frame rate for transmission can be reduced by about 50% with a relative increase in word error rate less than 5.2% for clean and noisy speech.
Keywords
interpolation; speech coding; speech recognition; splines (mathematics); vector quantisation; distributed speech recognition; interpolative variable frame rate transmission; large vocabulary recognizer; speech coding; spline; vector quantization; Bit rate; Degradation; Error analysis; Interpolation; Noise reduction; Speech coding; Speech recognition; Spline; Vector quantization; Vocabulary; Data compression; interpolation; speech coding; speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
Conference_Location
Kyoto
Print_ISBN
978-1-4244-1746-9
Electronic_ISBN
978-1-4244-1746-9
Type
conf
DOI
10.1109/ASRU.2007.4430179
Filename
4430179
Link To Document