Title :
Waveform speech coding using multiscale recurrent patterns
Author :
Pinage, Frederico S. ; Feio, Lara C R L ; da Silva, Eduardo A B ; Netto, Sergio L.
Author_Institution :
PEE-COPPE, Fed. Univ. of Rio de Janeiro, Rio de Janeiro, Brazil
fDate :
May 30 2010-June 2 2010
Abstract :
This paper revisits the waveform paradigm for coding speech signals, using a multiscale recurrent-pattern matching approach. The so-called MMP (Multidimensional Multiscale Parser) algorithm uses a dictionary which is constantly updated with expansions, contractions, and concatenations of previously encoded segments. This provides a learning ability to the MMP, particularly suited for coding voiced and silent segments of speech. Additional features (nonuniform and auxiliary displacement dictionaries) are considered in order to adjust the MMP learning mechanism for the speech coding problem. Current MMP algorithm achieves a fair-to-good objective score when operating at 8 kbps, as indicated by several speech-coding experiments. This indicates that it may be worthy to further investigate the use of the multiscale recurrent pattern matching paradigm for speech coding.
Keywords :
pattern matching; speech coding; MMP algorithm; multidimensional multiscale parser; multiscale recurrent patterns; speech signals; waveform speech coding; Codecs; Dictionaries; Encoding; Humans; Image segmentation; Multidimensional systems; Pattern matching; Signal processing; Speech analysis; Speech coding;
Conference_Titel :
Circuits and Systems (ISCAS), Proceedings of 2010 IEEE International Symposium on
Conference_Location :
Paris
Print_ISBN :
978-1-4244-5308-5
Electronic_ISBN :
978-1-4244-5309-2
DOI :
10.1109/ISCAS.2010.5537982