DocumentCode :
1617158
Title :
Harmonic-plus-noise decomposition and its application in voiced/unvoiced classification
Author :
Ahn, Raphael ; Holmes, W. Harvey
Author_Institution :
Sch. of Electr. Eng., New South Wales Univ., Kensington, NSW, Australia
Volume :
2
fYear :
1997
Firstpage :
587
Abstract :
In this paper, we present an improved algorithm to decompose the harmonic and the noise components of voiced speech. The improvements make the method more accurate and robust by employing a harmonic extrapolation and a noise extrapolation in alternating iterative steps and by including a new pitch detection algorithm. This new technique has been found to improve both the convergence and accuracy of separation of the harmonic and the noise components. In separating the noise and the harmonic components, this improved harmonic-plus-noise (H+N) decomposition method provides many useful ways to measure the strength of voicing. Two such measures are investigated with respect to their ability to discern voiced and unvoiced segments of speech. They are the harmonic-to-noise energy ratio and the sub-band harmonic-to-noise energy ratio. Tests show that these measures perform more reliably and more robustly in comparison to classical measures such as the zero-crossing rate, the LPC prediction gain, the 1st LP coefficient and the RMS energy
Keywords :
acoustic noise; extrapolation; harmonic analysis; pattern classification; speech recognition; convergence; harmonic component; harmonic extrapolation; harmonic-plus-noise decomposition; harmonic-to-noise energy ratio; iterative steps; noise component; noise extrapolation; pitch detection algorithm; sub-band harmonic-to-noise energy ratio; unvoiced classification; unvoiced segments; voiced classification; voiced segments; voicing; Convergence; Detection algorithms; Energy measurement; Extrapolation; Gain measurement; Iterative algorithms; Iterative methods; Noise measurement; Noise robustness; Speech enhancement;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
TENCON '97. IEEE Region 10 Annual Conference. Speech and Image Technologies for Computing and Telecommunications., Proceedings of IEEE
Conference_Location :
Brisbane, Qld.
Print_ISBN :
0-7803-4365-4
Type :
conf
DOI :
10.1109/TENCON.1997.648274
Filename :
648274
Link To Document :
بازگشت