DocumentCode
2792026
Title
Voice source estimation for artificial bandwidth extension of telephone speech
Author
Thomas, Mark R P ; Gudnason, John ; Naylor, Patrick A. ; Geiser, Bernd ; Vary, Peter
Author_Institution
Commun. & Signal Process. Group, Imperial Coll. London, London, UK
fYear
2010
fDate
14-19 March 2010
Firstpage
4794
Lastpage
4797
Abstract
Artificial bandwidth extension (ABWE) of speech signals aims to estimate wideband speech (50 Hz - 7 kHz) from narrowband signals (300 Hz - 3.4 kHz). Applying the source-filter model of speech, many existing algorithms estimate vocal tract filter parameters independently of the source signal. However, many current methods for extending the narrowband voice source signal are limited to straightforward signal processing techniques which are only effective for high-band estimation. This paper presents a method for ABWE that employs novel data-driven modelling and an existing spectral mirroring technique to estimate the wideband source signal in both the high and low extension bands. A state-of-the-art Hidden Markov Model-based estimator evaluates the temporal and spectral envelopes in the missing frequency bands, with which the ABWE speech signal is synthesized. Informal listening tests comparing two existing source estimation techniques and two permutations of the proposed approach show an improvement in the perceived bandwidth of speech signals, in particular towards low frequencies. Subjective tests on the same data show a preference for the proposed techniques over the existing methods under test.
Keywords
hidden Markov models; speech enhancement; telephony; artificial bandwidth extension; data driven modelling; hidden Markov model; source estimation techniques; spectral mirroring technique; speech enhancement; speech signal; telephone speech; voice source estimation; voice source modelling; wideband source signal; Bandwidth; Filters; Frequency estimation; Hidden Markov models; Narrowband; Signal processing algorithms; Speech synthesis; Telephony; Testing; Wideband; Speech enhancement; artificial bandwidth extension; voice source modelling;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location
Dallas, TX
ISSN
1520-6149
Print_ISBN
978-1-4244-4295-9
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2010.5495149
Filename
5495149
Link To Document