• DocumentCode
    178233
  • Title

    Ultrasound-coupled semi-supervised nonnegative matrix factorisation for speech enhancement

  • Author

    Barker, Trevor ; Virtanen, Tuomas ; Delhomme, Olivier

  • Author_Institution
    Dept. of Signal Process., Tampere Univ. of Technol., Tampere, Finland
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    2129
  • Lastpage
    2133
  • Abstract
    We present an extension to an existing speech enhancement technique, whereby the incorporation of easily obtained Doppler-based ultrasound data, obtained from frequency shifts caused by a talker´s mouth movements, is shown to improve speech enhancement results. Noisy speech mixtures were enhanced using semi-supervised nonnegative matrix factorisation (NMF). Ultrasound data recorded alongside the speech is transformed into the spectral domain and used additionally to audio in the mixture to be separated. Speech components are learned from a training set, whilst noise components are estimated from the mixture signal. We show that the ultrasound data can improve source-to-distortion ratios for the enhanced speech, relative to both the non-ultrasound NMF case and an established Wiener filter-based speech enhancement method.
  • Keywords
    Wiener filters; matrix algebra; speech enhancement; NMF; Wiener filter; frequency shifts; mixture signal; noise components; noisy speech mixtures; source-to-distortion ratios; speech components; speech enhancement technique; ultrasound coupled semisupervised nonnegative matrix factorisation; ultrasound data; Acoustics; Dictionaries; Noise; Speech; Speech enhancement; Ultrasonic imaging; Acoustic Doppler Sensor; Nonnegative Matrix Factorisation; Source Separation; Ultrasound;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6853975
  • Filename
    6853975