Speech coding a new approach

Author

Mandal, S.K.D.

Author_Institution

CDAC, Kolkata, India

Volume

fYear

2003

fDate

15-17 Oct. 2003

Firstpage

1483

Abstract

Text-to-speech synthesis, based on ESNOLA, uses signal dictionary having raw sound signals representing parts of phonemes. State-phase analysis for detection of voiced region along with detection of pitch also may be used for extraction of the most appropriate signal elements automatically from continuous speech in real time. The signal elements at the voiced zone are perceptual-pitch-periods. These signal are coded by simply inserting one information byte at the beginning of each element. The decoding is done using the information bit. The intervening signals are regenerated by linear estimation from the two perceptual-pitch-periods. This coding induces a ten-fold information reduction without significant loss of naturalness.

Keywords

decoding; speech coding; speech synthesis; linear estimation; perceptual-pitch-periods; phonemes; raw sound signals; signal dictionary; speech coding; text-to-speech synthesis; Computer vision; Delay; Detection algorithms; Scattering; Signal analysis; Signal synthesis; Speech analysis; Speech coding; Speech synthesis; Time domain analysis;

fLanguage

English

Publisher

ieee

Conference_Titel

TENCON 2003. Conference on Convergent Technologies for the Asia-Pacific Region

Print_ISBN

0-7803-8162-9

Type

conf

DOI

10.1109/TENCON.2003.1273165

Filename

1273165

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=2635385