Title :
Multisensory speech enhancement in noisy environments using bone-conducted and air-conducted microphones
Author :
Mingzi Li ; Cohen, Israel ; Mousazadeh, Saman
Author_Institution :
Dept. of Electr. Eng., Technion - Israel Inst. of Technol., Haifa, Israel
Abstract :
In this paper, we propose a speech enhancement algorithm for estimating the clean speech using samples of air-conducted and bone-conducted speech signals. We introduce a model in a supervised learning framework by approximating a mapping from concatenation of noisy air-conducted and bone-conducted speech to clean speech in the short time Fourier transform domain. Two function extension schemes are utilized: geometric harmonics and Laplacian pyramid. Performances obtained from the two schemes are evaluated and compared in terms of spectrograms and log spectral distance measures.
Keywords :
Fourier transforms; Laplace transforms; geometry; learning (artificial intelligence); microphones; sensor fusion; speech enhancement; Laplacian pyramid; air-conducted microphone; air-conducted speech signal; bone-conducted microphone; bone-conducted speech signal; clean speech estimation; geometric harmonics; log spectral distance measurement; multisensory speech enhancement; noisy environment; short time Fourier transform domain; spectrograms term; supervised learning framework; two function extension scheme; Harmonic analysis; Kernel; Laplace equations; Microphones; Noise measurement; Speech; Speech enhancement; Laplacian pyramid; Multisensory; bone-conducted microphone; geometric harmonics;
Conference_Titel :
Signal and Information Processing (ChinaSIP), 2014 IEEE China Summit & International Conference on
Conference_Location :
Xi´an
Print_ISBN :
978-1-4799-5401-8
DOI :
10.1109/ChinaSIP.2014.6889190