Title :
Speech coding a new approach
Author_Institution :
CDAC, Kolkata, India
Abstract :
Text-to-speech synthesis, based on ESNOLA, uses signal dictionary having raw sound signals representing parts of phonemes. State-phase analysis for detection of voiced region along with detection of pitch also may be used for extraction of the most appropriate signal elements automatically from continuous speech in real time. The signal elements at the voiced zone are perceptual-pitch-periods. These signal are coded by simply inserting one information byte at the beginning of each element. The decoding is done using the information bit. The intervening signals are regenerated by linear estimation from the two perceptual-pitch-periods. This coding induces a ten-fold information reduction without significant loss of naturalness.
Keywords :
decoding; speech coding; speech synthesis; linear estimation; perceptual-pitch-periods; phonemes; raw sound signals; signal dictionary; speech coding; text-to-speech synthesis; Computer vision; Delay; Detection algorithms; Scattering; Signal analysis; Signal synthesis; Speech analysis; Speech coding; Speech synthesis; Time domain analysis;
Conference_Titel :
TENCON 2003. Conference on Convergent Technologies for the Asia-Pacific Region
Print_ISBN :
0-7803-8162-9
DOI :
10.1109/TENCON.2003.1273165