• DocumentCode
    2635385
  • Title

    Speech coding a new approach

  • Author

    Mandal, S.K.D.

  • Author_Institution
    CDAC, Kolkata, India
  • Volume
    4
  • fYear
    2003
  • fDate
    15-17 Oct. 2003
  • Firstpage
    1483
  • Abstract
    Text-to-speech synthesis, based on ESNOLA, uses signal dictionary having raw sound signals representing parts of phonemes. State-phase analysis for detection of voiced region along with detection of pitch also may be used for extraction of the most appropriate signal elements automatically from continuous speech in real time. The signal elements at the voiced zone are perceptual-pitch-periods. These signal are coded by simply inserting one information byte at the beginning of each element. The decoding is done using the information bit. The intervening signals are regenerated by linear estimation from the two perceptual-pitch-periods. This coding induces a ten-fold information reduction without significant loss of naturalness.
  • Keywords
    decoding; speech coding; speech synthesis; linear estimation; perceptual-pitch-periods; phonemes; raw sound signals; signal dictionary; speech coding; text-to-speech synthesis; Computer vision; Delay; Detection algorithms; Scattering; Signal analysis; Signal synthesis; Speech analysis; Speech coding; Speech synthesis; Time domain analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    TENCON 2003. Conference on Convergent Technologies for the Asia-Pacific Region
  • Print_ISBN
    0-7803-8162-9
  • Type

    conf

  • DOI
    10.1109/TENCON.2003.1273165
  • Filename
    1273165