DocumentCode
1617158
Title
Harmonic-plus-noise decomposition and its application in voiced/unvoiced classification
Author
Ahn, Raphael ; Holmes, W. Harvey
Author_Institution
Sch. of Electr. Eng., New South Wales Univ., Kensington, NSW, Australia
Volume
2
fYear
1997
Firstpage
587
Abstract
In this paper, we present an improved algorithm to decompose the harmonic and the noise components of voiced speech. The improvements make the method more accurate and robust by employing a harmonic extrapolation and a noise extrapolation in alternating iterative steps and by including a new pitch detection algorithm. This new technique has been found to improve both the convergence and accuracy of separation of the harmonic and the noise components. In separating the noise and the harmonic components, this improved harmonic-plus-noise (H+N) decomposition method provides many useful ways to measure the strength of voicing. Two such measures are investigated with respect to their ability to discern voiced and unvoiced segments of speech. They are the harmonic-to-noise energy ratio and the sub-band harmonic-to-noise energy ratio. Tests show that these measures perform more reliably and more robustly in comparison to classical measures such as the zero-crossing rate, the LPC prediction gain, the 1st LP coefficient and the RMS energy
Keywords
acoustic noise; extrapolation; harmonic analysis; pattern classification; speech recognition; convergence; harmonic component; harmonic extrapolation; harmonic-plus-noise decomposition; harmonic-to-noise energy ratio; iterative steps; noise component; noise extrapolation; pitch detection algorithm; sub-band harmonic-to-noise energy ratio; unvoiced classification; unvoiced segments; voiced classification; voiced segments; voicing; Convergence; Detection algorithms; Energy measurement; Extrapolation; Gain measurement; Iterative algorithms; Iterative methods; Noise measurement; Noise robustness; Speech enhancement;
fLanguage
English
Publisher
ieee
Conference_Titel
TENCON '97. IEEE Region 10 Annual Conference. Speech and Image Technologies for Computing and Telecommunications., Proceedings of IEEE
Conference_Location
Brisbane, Qld.
Print_ISBN
0-7803-4365-4
Type
conf
DOI
10.1109/TENCON.1997.648274
Filename
648274
Link To Document