• DocumentCode
    1859652
  • Title

    Wavelet-based voiced/unvoiced classification algorithm

  • Author

    Jafer, E. ; Mahdi, A.E.

  • Author_Institution
    Dept. of Electron. & Comput. Eng., Limerick Univ., Ireland
  • Volume
    2
  • fYear
    2003
  • fDate
    2-5 July 2003
  • Firstpage
    667
  • Abstract
    A new wavelet-based algorithm for classification of speech into voiced and unvoiced segments is presented. The algorithm is based on statistical analysis of the frequency distribution of the average energy in the wavelet domain, and on the short-time zero-crossing rate of the speech signal. First, the ratio of the average energy in the wavelet low-bands to that in the wavelet highest-band for each speech segment is computed using a 4-level dyadic wavelet transform, and compared to a predetermined threshold. This is followed by measuring the zero-crossing rate of the segment and comparing it to a threshold equal to the median of the zero-crossing rates. An experimentally verified criterion based on the above two comparison processes is then applied to obtain the voicing decision. The performance of the algorithm has been evaluated using a large speech database. The algorithm is shown to perform well in the cases of both clean and noise-degraded speech.
  • Keywords
    discrete wavelet transforms; speech processing; statistical analysis; 4-level dyadic wavelet transform; algorithm performance evaluation; average energy frequency distribution; average energy ratio; noise-degraded speech; predetermined threshold; short-time zero-crossing rate median; speech classification; speech database; speech processing; speech signal; statistical analysis; voiced/unvoiced classification algorithm; voicing decision; wavelet domain; wavelet highest-band; wavelet low-band; wavelet-based algorithm; Classification algorithms; Discrete wavelet transforms; Frequency; Multimedia databases; Multiresolution analysis; Speech analysis; Speech enhancement; Speech processing; Wavelet analysis; Wavelet transforms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Video/Image Processing and Multimedia Communications, 2003. 4th EURASIP Conference focused on
  • Print_ISBN
    953-184-054-7
  • Type

    conf

  • DOI
    10.1109/VIPMC.2003.1220540
  • Filename
    1220540