DocumentCode
2017705
Title
Speech enhancement as a functional approximation and generalization
Author
Lu, Xugang ; Unoki, Masashi ; Isotani, Ryosuke ; Kawai, Hisashi ; Nakamura, Satoshi
Author_Institution
Nat. Inst. of Inf. & Commun. Technol., Japan
fYear
2010
fDate
Nov. 29 2010-Dec. 3 2010
Firstpage
18
Lastpage
22
Abstract
Noise reduction is used to reduce the noise effect on speech, and is important for many real speech applications. However, noise reduction inevitably causes speech distortion. The trade-off between noise reduction and speech distortion is always a key concern in designing noise reduction algorithms. In this study, we took a new look at this problem, and regarded the speech estimation as a functional approximation problem which was concerned with the approximation error and generalization ability (or complexity). In order to get a good generalization ability, a regularization framework was adopted which gave a constraint on the approximation function with certain smoothness. Moreover, the approximation function was selected in a reproducing kernel Hilbert space (RKHS). By this selection, a nonlinear mapping function could be incorporated in the approximation function with the application of the kernel trick. This approximation could explore the nonlinear and high-order statistical structure of speech which was different from traditional methods that only explore the linear and low-order statistical information of speech. By real simulations, we showed that (1) incorporating nonlinearity in the mapping function could bring better representational ability of the approximation function, hence a better trade-off between noise reduction and speech distortion than that of using a linear mapping function, and (2) a better speech enhancement performance than that of a compared classical speech enhancement method on the basis of segmental signal to noise ratio improvement and log spectral distance measurement.
Keywords
Hilbert spaces; approximation theory; speech enhancement; statistical analysis; RKHS; approximation error; functional approximation; functional approximation problem; functional generalization; noise reduction; nonlinear mapping function; reproducing kernel Hilbert space; speech distortion; speech effect; speech enhancement; speech estimation; statistical structure; Approximation methods; Kernel; Noise; Noise measurement; Noise reduction; Speech; Speech enhancement;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
Conference_Location
Tainan
Print_ISBN
978-1-4244-6244-5
Type
conf
DOI
10.1109/ISCSLP.2010.5684882
Filename
5684882
Link To Document