DocumentCode
178233
Title
Ultrasound-coupled semi-supervised nonnegative matrix factorisation for speech enhancement
Author
Barker, Trevor ; Virtanen, Tuomas ; Delhomme, Olivier
Author_Institution
Dept. of Signal Process., Tampere Univ. of Technol., Tampere, Finland
fYear
2014
fDate
4-9 May 2014
Firstpage
2129
Lastpage
2133
Abstract
We present an extension to an existing speech enhancement technique, whereby the incorporation of easily obtained Doppler-based ultrasound data, obtained from frequency shifts caused by a talker´s mouth movements, is shown to improve speech enhancement results. Noisy speech mixtures were enhanced using semi-supervised nonnegative matrix factorisation (NMF). Ultrasound data recorded alongside the speech is transformed into the spectral domain and used additionally to audio in the mixture to be separated. Speech components are learned from a training set, whilst noise components are estimated from the mixture signal. We show that the ultrasound data can improve source-to-distortion ratios for the enhanced speech, relative to both the non-ultrasound NMF case and an established Wiener filter-based speech enhancement method.
Keywords
Wiener filters; matrix algebra; speech enhancement; NMF; Wiener filter; frequency shifts; mixture signal; noise components; noisy speech mixtures; source-to-distortion ratios; speech components; speech enhancement technique; ultrasound coupled semisupervised nonnegative matrix factorisation; ultrasound data; Acoustics; Dictionaries; Noise; Speech; Speech enhancement; Ultrasonic imaging; Acoustic Doppler Sensor; Nonnegative Matrix Factorisation; Source Separation; Ultrasound;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location
Florence
Type
conf
DOI
10.1109/ICASSP.2014.6853975
Filename
6853975
Link To Document