• DocumentCode
    1617158
  • Title

    Harmonic-plus-noise decomposition and its application in voiced/unvoiced classification

  • Author

    Ahn, Raphael ; Holmes, W. Harvey

  • Author_Institution
    Sch. of Electr. Eng., New South Wales Univ., Kensington, NSW, Australia
  • Volume
    2
  • fYear
    1997
  • Firstpage
    587
  • Abstract
    In this paper, we present an improved algorithm to decompose the harmonic and the noise components of voiced speech. The improvements make the method more accurate and robust by employing a harmonic extrapolation and a noise extrapolation in alternating iterative steps and by including a new pitch detection algorithm. This new technique has been found to improve both the convergence and accuracy of separation of the harmonic and the noise components. In separating the noise and the harmonic components, this improved harmonic-plus-noise (H+N) decomposition method provides many useful ways to measure the strength of voicing. Two such measures are investigated with respect to their ability to discern voiced and unvoiced segments of speech. They are the harmonic-to-noise energy ratio and the sub-band harmonic-to-noise energy ratio. Tests show that these measures perform more reliably and more robustly in comparison to classical measures such as the zero-crossing rate, the LPC prediction gain, the 1st LP coefficient and the RMS energy
  • Keywords
    acoustic noise; extrapolation; harmonic analysis; pattern classification; speech recognition; convergence; harmonic component; harmonic extrapolation; harmonic-plus-noise decomposition; harmonic-to-noise energy ratio; iterative steps; noise component; noise extrapolation; pitch detection algorithm; sub-band harmonic-to-noise energy ratio; unvoiced classification; unvoiced segments; voiced classification; voiced segments; voicing; Convergence; Detection algorithms; Energy measurement; Extrapolation; Gain measurement; Iterative algorithms; Iterative methods; Noise measurement; Noise robustness; Speech enhancement;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    TENCON '97. IEEE Region 10 Annual Conference. Speech and Image Technologies for Computing and Telecommunications., Proceedings of IEEE
  • Conference_Location
    Brisbane, Qld.
  • Print_ISBN
    0-7803-4365-4
  • Type

    conf

  • DOI
    10.1109/TENCON.1997.648274
  • Filename
    648274