• DocumentCode
    1688101
  • Title

    Enhancement of throat microphone recordings by learning phone-dependent mappings of speech spectra

  • Author

    Turan, M. A. Tugtekin ; Erzin, E.

  • Author_Institution
    Multimedia, Vision & Graphics Lab., Koc Univ., Istanbul, Turkey
  • fYear
    2013
  • Firstpage
    7049
  • Lastpage
    7053
  • Abstract
    We investigate spectral envelope mapping problem with joint analysis of throat- and acoustic-microphone recordings to enhance throat-microphone speech. A new phone-dependent GMM-based spectral envelope mapping scheme, which performs the minimum mean square error (MMSE) estimation of the acoustic-microphone spectral envelope, has been proposed. Experimental evaluations are performed to compare the proposed mapping scheme to the state-of-theart GMM-based estimator using both objective and subjective evaluations. Objective evaluations are performed with the log-spectral distortion (LSD) and the wideband perceptual evaluation of speech quality (PESQ) metrics. Subjective evaluations are performed with the A/B pair comparison listening test. Both objective and subjective evaluations yield that the proposed phone-dependent mapping consistently improves performances over the state-of-the-art GMM estimator.
  • Keywords
    Gaussian processes; learning (artificial intelligence); least mean squares methods; speech enhancement; A-B pair comparison listening test; LSD; MMSE estimation; PESQ metrics; acoustic-microphone recordings; learning phone-dependent mappings; log-spectral distortion; minimum mean square error estimation; objective evaluations; phone-dependent GMM; spectral envelope mapping problem; speech enhancement; speech quality metrics; speech spectra; subjective evaluations; throat microphone recordings; wideband perceptual evaluation; Acoustics; Microphones; Robustness; Speech; Speech enhancement; Speech recognition; spectral envelope estimation; speech enhancement; throat-microphone;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6639029
  • Filename
    6639029