DocumentCode :
3527900
Title :
Combining frontend-based memory with MFCC features for Bandwidth Extension of narrowband speech
Author :
Nour-Eldin, Amr H. ; Kabal, Peter
Author_Institution :
Dept. of Electr. & Comput. Eng., McGill Univ., Montreal, QC
fYear :
2009
fDate :
19-24 April 2009
Firstpage :
4001
Lastpage :
4004
Abstract :
In this paper, we continue our previous work on improving Bandwidth Extension (BWE) of narrowband speech. We have shown that including memory into the parametrization frontend (through delta features) results in higher highband certainty irrespective of feature type, with MFCCs exhibiting higher correlation, in general, between both bands, reaching twice that using LSFs. By incorporating memory into the frontend of a conventional LP-based BWE system, we were able to translate the higher correlation due to memory into BWE performance improvement. Using high-resolution inverse DCT, we also achieved high quality speech reconstruction from MFCCs, thus enabling MFCC-based BWE with improved performance compared to conventional static LP-based BWE. We continue this work by incorporating the superior correlation properties of frontend memory into our MFCC-based BWE system. Log-Spectral Distortion as well as the more perceptually-correlated Itakura-based measures show that incorporating memory into our MFCC-based BWE system results in BWE performance superior to that of our dynamic LP-based BWE system.
Keywords :
discrete cosine transforms; distortion; signal reconstruction; speech processing; MFCC features; bandwidth extension; frontend-based memory; high quality speech reconstruction; high-resolution inverse DCT; log-spectral distortion; narrowband speech; perceptually-correlated Itakura-based measures; Acoustic noise; Bandwidth; Cepstral analysis; Frequency estimation; Mel frequency cepstral coefficient; Mutual information; Narrowband; Speech analysis; Wideband; Working environment noise; Bandwidth extension; high-resolution IDCT; highband certainty; memory inclusion; mutual information;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1520-6149
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2009.4960505
Filename :
4960505
Link To Document :
بازگشت