DocumentCode :
1515764
Title :
Bandwidth Extension of Telephone Speech to Low Frequencies Using Sinusoidal Synthesis and a Gaussian Mixture Model
Author :
Pulakka, Hannu ; Remes, Ulpu ; Yrttiaho, Santeri ; Palomäki, Kalle ; Kurimo, Mikko ; Alku, Paavo
Author_Institution :
Dept. of Signal Process. & Acoust., Aalto Univ., Espoo, Finland
Volume :
20
Issue :
8
fYear :
2012
Firstpage :
2219
Lastpage :
2231
Abstract :
The quality of narrowband telephone speech is degraded by the limited audio bandwidth. This paper describes a method that extends the bandwidth of telephone speech to the frequency range 0-300 Hz. The method generates the lowest harmonics of voiced speech using sinusoidal synthesis. The energy in the extension band is estimated from spectral features using a Gaussian mixture model. The amplitudes and phases of the synthesized sinusoidal components are adjusted based on the amplitudes and phases of the narrowband input speech, which provides adaptivity to varying input bandwidth characteristics. The proposed method was evaluated with listening tests in combination with another bandwidth extension method for the frequency range 4-8 kHz. While the low-frequency bandwidth extension was not found to improve perceived quality, the method reduced dissimilarity with wideband speech.
Keywords :
harmonics; speech processing; telephone sets; Gaussian mixture model; bandwidth extension; frequency 4 kHz to 8 kHz; harmonics; limited audio bandwidth; low frequencies; narrowband input speech; narrowband telephone speech; sinusoidal synthesis; voiced speech; wideband speech; Educational institutions; Harmonic analysis; Narrowband; Speech; Speech processing; Wideband; Bandwidth extension; Gaussian mixture model (GMM); listening test; speech enhancement; speech processing;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2012.2199110
Filename :
6198871
Link To Document :
بازگشت