DocumentCode :
697837
Title :
A statistical framework for artificial bandwidth extension exploiting speech waveform and phonetic transcription
Author :
Bauer, P. ; Fingscheidt, T.
Author_Institution :
Dept. of Signal Process., Tech. Univ. Braunschweig, Braunschweig, Germany
fYear :
2009
fDate :
24-28 Aug. 2009
Firstpage :
1839
Lastpage :
1843
Abstract :
In the past, artificial bandwidth extension (ABWE) has primarily been investigated to enhance transmitted narrowband speech signals at the receiving side. State-of-the-art schemes show improved quality versus narrowband speech; however, a clear gap to wideband speech is still reported. This is largely due to the insufficient ABWE performance on fricatives, particularly /s/. We asked ourselves to what extent the speech quality could be improved, if we knew the currently spoken phoneme. In this paper we present a framework using phonetic transcriptions as a-priori knowledge besides the speech waveform. Possible applications are high-quality offline ABWE of telephone, pilot, or historic speech recordings, memory efficient narrowband speech synthesis followed by ABWE, and extension of narrowband telephone databases to train wideband acoustic models for automatic speech recognition. For the classical conversational telephony application, an improved ABWE scheme is also proposed making use of transcription information only during training.
Keywords :
speech processing; speech recognition; statistical analysis; ABWE; artificial bandwidth extension; automatic speech recognition; narrowband speech signals; narrowband telephone databases; phonetic transcription; speech quality; speech waveform; statistical framework; wideband acoustic models; wideband speech; Abstracts; Legged locomotion; Robustness; Wideband;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference, 2009 17th European
Conference_Location :
Glasgow
Print_ISBN :
978-161-7388-76-7
Type :
conf
Filename :
7077409
Link To Document :
بازگشت