DocumentCode
1498935
Title
Application of wavelet transforms for C/V segmentation on Mandarin speech signals
Author
Chen, S.H. ; Wang, J.F.
Author_Institution
Dept. of Electr. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
Volume
148
Issue
2
fYear
2001
fDate
4/1/2001 12:00:00 AM
Firstpage
133
Lastpage
139
Abstract
It has been demonstrated that wavelet transforms can be developed to find the C/V segmentation point of a Mandarin speech signal. The basic idea is the utilisation of a specific function, the product function, for indicating the C/V segmentation point. Based on the wavelet transforms, the product function is generated from the appropriate approximation signal and detail signal of the input speech, and its energy profile contains the evidence for detecting the C/V segmentation point. It is shown that the C/V segmentation point can be obtained directly using of the product function and its energy profile. The main advantage of the proposed scheme is the capability of forward and directly searching for the C/V segmentation point, and there is no need to set any predetermined threshold. Thus, the pitch detector and backward-processing required in the conventional C/V segmentation algorithm are completely avoided. The analysis of the proposed algorithm on various types of Mandarin speech indicates considerable improvement over the conventional method. Experiments show that the overall accuracy rate of the proposed method reaches 95.4%
Keywords
natural languages; signal resolution; speech processing; wavelet transforms; C/V segmentation algorithm; Mandarin speech signals segmentation; accuracy rate; approximation signal; backward-processing; energy profile; input speech; multiresolution analysis; pitch detector; product function; wavelet transforms;
fLanguage
English
Journal_Title
Vision, Image and Signal Processing, IEE Proceedings -
Publisher
iet
ISSN
1350-245X
Type
jour
DOI
10.1049/ip-vis:20010151
Filename
926836
Link To Document