DocumentCode
492514
Title
Voicing Class Dependent Huffman Coding of Compressed Front-End Feature Vector for Distributed Speech Recognition
Author
Kim, Deok Su ; Kim, Hong Kook
Author_Institution
Dept. of Inf. & Commun., Gwangju Inst. of Sci. & Technol., Gwangju
Volume
3
fYear
2008
fDate
13-15 Dec. 2008
Firstpage
51
Lastpage
54
Abstract
In this paper, we propose an entropy coding method to further compress quantized mel-frequency cepstral coefficients (MFCCs) extracted for distributed speech recognition (DSR). In the ETSI extended DSR standard, MFCCs are compressed with additional parameters such as pitch and voicing class. It is observed that the distribution of MFCCs varies according to the voicing class, thereby enabling the design of different Huffman trees for MFCCs according to voicing class. Based on this observation, we could further reduce the bit-rates of compressed MFCCs compared to the Huffman coding method that does not consider voicing class. Subsequent experiments show that the bit-rate of the proposed method is 34.18 bits per frame, which is 1.84 bits/frame lower than that of the Huffman coding method without voicing.
Keywords
Huffman codes; cepstral analysis; data compression; entropy codes; feature extraction; speech coding; speech recognition; trees (mathematics); Huffman tree; compressed front-end feature vector; distributed speech recognition; entropy coding; quantized mel-frequency cepstral coefficient; voicing class dependent Huffman coding; Cepstral analysis; Data mining; Discrete cosine transforms; Entropy coding; Feature extraction; Huffman coding; Mel frequency cepstral coefficient; Quantization; Speech recognition; Telecommunication standards; DSR; Huffman coding; MFCC;
fLanguage
English
Publisher
ieee
Conference_Titel
Future Generation Communication and Networking Symposia, 2008. FGCNS '08. Second International Conference on
Conference_Location
Sanya
Print_ISBN
978-1-4244-3430-5
Electronic_ISBN
978-0-7695-3546-3
Type
conf
DOI
10.1109/FGCNS.2008.44
Filename
4813546
Link To Document