• DocumentCode
    3783704
  • Title

    Diphone-like units without phonemes - option for very low bit rate speech coding

  • Author

    P. Motlicek;G. Baudoin;J. Cernocky

  • Author_Institution
    Inst. of Radioelectron., Tech. Univ. Brno, Czech Republic
  • Volume
    2
  • fYear
    2001
  • fDate
    6/23/1905 12:00:00 AM
  • Firstpage
    463
  • Abstract
    The aim of our effort is to reach higher quality of the resulting speech coded by a very low bit rate (VLBR) segmental coder. The basic units are found automatically in a training database using temporal decomposition and vector quantization. They are modeled by HMM. Then two methods of re-segmentation are used in order to find new longer units. In the first approach borders are set to the centers of previous units. In the second, borders are fixed to the centers of middle HMM states of previous units. The number of frames in new units is conditioned to be bigger than a fixed constant. Hence, new units can consist of several previous segments. Decreasing transition noise of the resultant speech was obtained using these techniques.
  • Keywords
    "Bit rate","Speech coding","Databases","Speech synthesis","Decoding","Speech recognition","Automatic speech recognition","Hidden Markov models","Speech enhancement","Vocoders"
  • Publisher
    ieee
  • Conference_Titel
    EUROCON´2001, Trends in Communications, International Conference on.
  • Print_ISBN
    0-7803-6490-2
  • Type

    conf

  • DOI
    10.1109/EURCON.2001.938162
  • Filename
    938162