• DocumentCode
    1498935
  • Title

    Application of wavelet transforms for C/V segmentation on Mandarin speech signals

  • Author

    Chen, S.H. ; Wang, J.F.

  • Author_Institution
    Dept. of Electr. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
  • Volume
    148
  • Issue
    2
  • fYear
    2001
  • fDate
    4/1/2001 12:00:00 AM
  • Firstpage
    133
  • Lastpage
    139
  • Abstract
    It has been demonstrated that wavelet transforms can be developed to find the C/V segmentation point of a Mandarin speech signal. The basic idea is the utilisation of a specific function, the product function, for indicating the C/V segmentation point. Based on the wavelet transforms, the product function is generated from the appropriate approximation signal and detail signal of the input speech, and its energy profile contains the evidence for detecting the C/V segmentation point. It is shown that the C/V segmentation point can be obtained directly using of the product function and its energy profile. The main advantage of the proposed scheme is the capability of forward and directly searching for the C/V segmentation point, and there is no need to set any predetermined threshold. Thus, the pitch detector and backward-processing required in the conventional C/V segmentation algorithm are completely avoided. The analysis of the proposed algorithm on various types of Mandarin speech indicates considerable improvement over the conventional method. Experiments show that the overall accuracy rate of the proposed method reaches 95.4%
  • Keywords
    natural languages; signal resolution; speech processing; wavelet transforms; C/V segmentation algorithm; Mandarin speech signals segmentation; accuracy rate; approximation signal; backward-processing; energy profile; input speech; multiresolution analysis; pitch detector; product function; wavelet transforms;
  • fLanguage
    English
  • Journal_Title
    Vision, Image and Signal Processing, IEE Proceedings -
  • Publisher
    iet
  • ISSN
    1350-245X
  • Type

    jour

  • DOI
    10.1049/ip-vis:20010151
  • Filename
    926836