DocumentCode
353516
Title
Improving the robustness of wavelet transform for epoch detection
Author
Lam, Y.Y. ; Luk, R.W.P. ; Chung, F.L.
Author_Institution
Dept. of Comput., Hong Kong Polytech. Univ., China
Volume
3
fYear
2000
fDate
2000
Firstpage
1303
Abstract
This paper investigates (1) the robustness of epoch detection (i.e. identification of glottal closure) by the wavelet transform and (2) the methods to improve its robustness. We achieved a similar identification performance (2% error rate) to earlier investigation using the spline wavelet transform, under Gaussian noise degradation. However, the performance under other types of noise degradation, such as periodic noise (e.g. traffic lights) and short noise (e.g. keyboard noise), is not as robust as before. The scale matching technique could not secure good performance because the spline wavelet has poor recall performance. We explored the use of the Gaussian wavelet transform. Instead of scale matching, a single level is used and the recall of epochs associated with the nearest laryngograph differences by the Gaussian wavelet is about 30% more than by the spline wavelet, across different types of noise degradation. However, the spline wavelet has less false alarm (29% on average) in identification and the peaks correspond well to epoch positions (with less [standard] deviation). We evaluated detection schemes using both scalograms of Gaussian and spline wavelets and achieved improvement of recall (26%), with a relative position consistency of 1.4 ms
Keywords
Gaussian noise; speech processing; splines (mathematics); wavelet transforms; Gaussian noise degradation; Gaussian wavelet transform; detection schemes; epoch detection; glottal closure; identification performance; laryngograph differences; noise degradation; periodic noise; robustness; scale matching technique; short noise; spline wavelet; wavelet transform; Degradation; Error analysis; Frequency; Gaussian noise; Low-frequency noise; Noise robustness; Speech; Spline; Wavelet transforms; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location
Istanbul
ISSN
1520-6149
Print_ISBN
0-7803-6293-4
Type
conf
DOI
10.1109/ICASSP.2000.861817
Filename
861817
Link To Document