• DocumentCode
    3652271
  • Title

    Efficient representation of short-time phase based on group delay

  • Author

    H. Banno; Jinlin Lu;S. Nakamura;K. Shikano;H. Kawahara

  • Author_Institution
    Graduate Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Japan
  • Volume
    2
  • fYear
    1998
  • Firstpage
    861
  • Abstract
    An efficient representation of short-time phase characteristics of speech sounds is proposed, based on findings which suggest the perceptual importance of phase characteristics. Subjective tests indicated that the synthesized speech sounds by the proposed method are indistinguishable from the original speech sounds with a moderate data compression. The proposed representation uses lower-order coefficients of the inverse Fourier transform of the group delay of speech. It also alleviates the voiced/unvoiced decision, which is an indispensable part in conventional speech coding algorithms. These features make our method potentially very useful in many applications like speech morphing.
  • Keywords
    "Speech synthesis","Speech coding","Linear predictive coding","Speech analysis","Delay estimation","Bit rate","Delay effects","Information science","Acoustical engineering","Systems engineering and theory"
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-4428-6
  • Type

    conf

  • DOI
    10.1109/ICASSP.1998.675401
  • Filename
    675401