Title :
Bandwidth extension of telephone speech using frame-based excitation and robust features
Author :
Uysal, Ismail ; Sathyendra, Harsha ; Harris, John G.
Author_Institution :
Comput. NeuroEngineering Lab., Univ. of Florida, Gainesville, FL, USA
Abstract :
The standards that are still in use for telephone communications since the 1950s limit the information bandwidth to 300-3400Hz. However, in normal conversational speech, the frequency content is mainly between 0-8000Hz. This constraint degrades not only the sound quality but also the intelligibility of the transmitted signal. Instead of modifying the present telecommunication infrastructures, which would cost billions of dollars, many researchers have been studying more efficient methods to increase the quality of telephone speech. This paper develops an innovative solution to bandwidth extension, which is based upon the Linear Source Filter Model that breaks speech up into two parts: the excitation and the spectral envelope. Novel approaches are used to extend the frequency information for both parts. This algorithm particularly emphasizes low frequency reconstruction without neglecting high frequencies. Furthermore, different feature sets to model the spectral envelope are employed for better performance under noisy conditions.
Keywords :
spectral analysis; speech enhancement; speech intelligibility; telephony; bandwidth 300 Hz to 3400 Hz; bandwidth extension; frame-based excitation; frequency information; frequency reconstruction; linear source filter model; robust feature; sound quality; spectral envelope; telephone communication; telephone speech; Feature extraction; Frequency modulation; Hidden Markov models; Maximum likelihood detection; Niobium; Nonlinear filters; Speech;
Conference_Titel :
Signal Processing Conference, 2005 13th European
Conference_Location :
Antalya
Print_ISBN :
978-160-4238-21-1