DocumentCode :
353516
Title :
Improving the robustness of wavelet transform for epoch detection
Author :
Lam, Y.Y. ; Luk, R.W.P. ; Chung, F.L.
Author_Institution :
Dept. of Comput., Hong Kong Polytech. Univ., China
Volume :
3
fYear :
2000
fDate :
2000
Firstpage :
1303
Abstract :
This paper investigates (1) the robustness of epoch detection (i.e. identification of glottal closure) by the wavelet transform and (2) the methods to improve its robustness. We achieved a similar identification performance (2% error rate) to earlier investigation using the spline wavelet transform, under Gaussian noise degradation. However, the performance under other types of noise degradation, such as periodic noise (e.g. traffic lights) and short noise (e.g. keyboard noise), is not as robust as before. The scale matching technique could not secure good performance because the spline wavelet has poor recall performance. We explored the use of the Gaussian wavelet transform. Instead of scale matching, a single level is used and the recall of epochs associated with the nearest laryngograph differences by the Gaussian wavelet is about 30% more than by the spline wavelet, across different types of noise degradation. However, the spline wavelet has less false alarm (29% on average) in identification and the peaks correspond well to epoch positions (with less [standard] deviation). We evaluated detection schemes using both scalograms of Gaussian and spline wavelets and achieved improvement of recall (26%), with a relative position consistency of 1.4 ms
Keywords :
Gaussian noise; speech processing; splines (mathematics); wavelet transforms; Gaussian noise degradation; Gaussian wavelet transform; detection schemes; epoch detection; glottal closure; identification performance; laryngograph differences; noise degradation; periodic noise; robustness; scale matching technique; short noise; spline wavelet; wavelet transform; Degradation; Error analysis; Frequency; Gaussian noise; Low-frequency noise; Noise robustness; Speech; Spline; Wavelet transforms; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location :
Istanbul
ISSN :
1520-6149
Print_ISBN :
0-7803-6293-4
Type :
conf
DOI :
10.1109/ICASSP.2000.861817
Filename :
861817
Link To Document :
بازگشت