• DocumentCode
    2177696
  • Title

    Speech bandwidth extension using Gaussian mixture model-based estimation of the highband mel spectrum

  • Author

    Pulakka, Hannu ; Remes, Ulpu ; Palomäki, Kalle ; Kurimo, Mikko ; Alku, Paavo

  • Author_Institution
    Dept. of Signal Process. & Acoust., Aalto Univ., Aalto, Finland
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    5100
  • Lastpage
    5103
  • Abstract
    The quality and intelligibility of narrowband telephone speech can be enhanced by artificial bandwidth extension. This study combines Gaussian mixture model-based (GMM) mel spectrum extension with a filter bank implementation for generating the missing spectral content in the highband at 4-8 kHz. The narrowband mel spectrum is calculated from input speech and the GMM is used to estimate the mel spectrum in the highband. An excitation signal for the highband is generated as a combination of upsampled linear prediction residual and modulated noise. The excitation is divided into sub-bands that are weighted and summed to realize the estimated mel spectrum. The bandwidth-extended output is obtained as the sum of the artificial highband signal and narrowband speech. Listening tests indicate that this method is preferred over narrowband speech and over a previously presented artificial bandwidth extension method which is implemented in some mobile phone models.
  • Keywords
    Gaussian processes; speech processing; GMM; Gaussian mixture model-based estimation; artificial highband signal; frequency 4 kHz to 8 kHz; highband mel spectrum; mobile phone models; narrowband speech; speech bandwidth extension; Mathematical model; Narrowband; Niobium; Speech; Speech processing; Wideband; Gaussian mixture model; bandwidth extension; mel spectrum; speech enhancement; speech processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947504
  • Filename
    5947504