• DocumentCode
    2792026
  • Title

    Voice source estimation for artificial bandwidth extension of telephone speech

  • Author

    Thomas, Mark R P ; Gudnason, John ; Naylor, Patrick A. ; Geiser, Bernd ; Vary, Peter

  • Author_Institution
    Commun. & Signal Process. Group, Imperial Coll. London, London, UK
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    4794
  • Lastpage
    4797
  • Abstract
    Artificial bandwidth extension (ABWE) of speech signals aims to estimate wideband speech (50 Hz - 7 kHz) from narrowband signals (300 Hz - 3.4 kHz). Applying the source-filter model of speech, many existing algorithms estimate vocal tract filter parameters independently of the source signal. However, many current methods for extending the narrowband voice source signal are limited to straightforward signal processing techniques which are only effective for high-band estimation. This paper presents a method for ABWE that employs novel data-driven modelling and an existing spectral mirroring technique to estimate the wideband source signal in both the high and low extension bands. A state-of-the-art Hidden Markov Model-based estimator evaluates the temporal and spectral envelopes in the missing frequency bands, with which the ABWE speech signal is synthesized. Informal listening tests comparing two existing source estimation techniques and two permutations of the proposed approach show an improvement in the perceived bandwidth of speech signals, in particular towards low frequencies. Subjective tests on the same data show a preference for the proposed techniques over the existing methods under test.
  • Keywords
    hidden Markov models; speech enhancement; telephony; artificial bandwidth extension; data driven modelling; hidden Markov model; source estimation techniques; spectral mirroring technique; speech enhancement; speech signal; telephone speech; voice source estimation; voice source modelling; wideband source signal; Bandwidth; Filters; Frequency estimation; Hidden Markov models; Narrowband; Signal processing algorithms; Speech synthesis; Telephony; Testing; Wideband; Speech enhancement; artificial bandwidth extension; voice source modelling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495149
  • Filename
    5495149