• DocumentCode
    417195
  • Title

    Automatically derived units for segment vocoders

  • Author

    Ramasubramanian, V. ; Sreenivas, T.V.

  • Author_Institution
    Dept. of Electr. Commun. Eng., Indian Inst. of Sci., Bangalore, India
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    Segment vocoders play a special role in very low bitrate speech coding to achieve intelligible speech at bitrates of ∼300 bits/sec. We explore the definition and use of automatically derived units for segment quantization in segment vocoders. We consider three automatic segmentation techniques, namely, spectral transition measures (STM), maximum-likelihood (ML) segmentation (unconstrained) and duration-constrained ML segmentation, towards defining diphone-like and phone-like units. We show that the ML segmentations realize phone-like units which are significantly better than those obtained by STM in terms of match accuracy with TIMIT phone segmentation as well as actual vocoder performance measured in terms of segmental SNR. Moreover, the phone-like units of ML segmentations also outperform the diphone-like units obtained using STM in early vocoders. We also show that the segment vocoder can operate at very high intelligibility when used in a single-speaker mode.
  • Keywords
    quantisation (signal); speech coding; vocoders; TIMIT phone segmentation; automatically derived units; diphone-like units; duration-constrained ML segmentation; intelligible speech; maximum-likelihood segmentation; phone-like units; segment quantization; segment vocoders; single-speaker mode; spectral transition measures; speech coding; unconstrained ML segmentation; Bit rate; Hidden Markov models; Impedance matching; Maximum likelihood decoding; Quantization; Speech analysis; Speech coding; Speech recognition; Speech synthesis; Vocoders;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1326025
  • Filename
    1326025