DocumentCode
2635385
Title
Speech coding a new approach
Author
Mandal, S.K.D.
Author_Institution
CDAC, Kolkata, India
Volume
4
fYear
2003
fDate
15-17 Oct. 2003
Firstpage
1483
Abstract
Text-to-speech synthesis, based on ESNOLA, uses signal dictionary having raw sound signals representing parts of phonemes. State-phase analysis for detection of voiced region along with detection of pitch also may be used for extraction of the most appropriate signal elements automatically from continuous speech in real time. The signal elements at the voiced zone are perceptual-pitch-periods. These signal are coded by simply inserting one information byte at the beginning of each element. The decoding is done using the information bit. The intervening signals are regenerated by linear estimation from the two perceptual-pitch-periods. This coding induces a ten-fold information reduction without significant loss of naturalness.
Keywords
decoding; speech coding; speech synthesis; linear estimation; perceptual-pitch-periods; phonemes; raw sound signals; signal dictionary; speech coding; text-to-speech synthesis; Computer vision; Delay; Detection algorithms; Scattering; Signal analysis; Signal synthesis; Speech analysis; Speech coding; Speech synthesis; Time domain analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
TENCON 2003. Conference on Convergent Technologies for the Asia-Pacific Region
Print_ISBN
0-7803-8162-9
Type
conf
DOI
10.1109/TENCON.2003.1273165
Filename
1273165
Link To Document