DocumentCode
1859652
Title
Wavelet-based voiced/unvoiced classification algorithm
Author
Jafer, E. ; Mahdi, A.E.
Author_Institution
Dept. of Electron. & Comput. Eng., Limerick Univ., Ireland
Volume
2
fYear
2003
fDate
2-5 July 2003
Firstpage
667
Abstract
A new wavelet-based algorithm for classification of speech into voiced and unvoiced segments is presented. The algorithm is based on statistical analysis of the frequency distribution of the average energy in the wavelet domain, and on the short-time zero-crossing rate of the speech signal. First, the ratio of the average energy in the wavelet low-bands to that in the wavelet highest-band for each speech segment is computed using a 4-level dyadic wavelet transform, and compared to a predetermined threshold. This is followed by measuring the zero-crossing rate of the segment and comparing it to a threshold equal to the median of the zero-crossing rates. An experimentally verified criterion based on the above two comparison processes is then applied to obtain the voicing decision. The performance of the algorithm has been evaluated using a large speech database. The algorithm is shown to perform well in the cases of both clean and noise-degraded speech.
Keywords
discrete wavelet transforms; speech processing; statistical analysis; 4-level dyadic wavelet transform; algorithm performance evaluation; average energy frequency distribution; average energy ratio; noise-degraded speech; predetermined threshold; short-time zero-crossing rate median; speech classification; speech database; speech processing; speech signal; statistical analysis; voiced/unvoiced classification algorithm; voicing decision; wavelet domain; wavelet highest-band; wavelet low-band; wavelet-based algorithm; Classification algorithms; Discrete wavelet transforms; Frequency; Multimedia databases; Multiresolution analysis; Speech analysis; Speech enhancement; Speech processing; Wavelet analysis; Wavelet transforms;
fLanguage
English
Publisher
ieee
Conference_Titel
Video/Image Processing and Multimedia Communications, 2003. 4th EURASIP Conference focused on
Print_ISBN
953-184-054-7
Type
conf
DOI
10.1109/VIPMC.2003.1220540
Filename
1220540
Link To Document