DocumentCode :
3700123
Title :
Speech reconstruction from mel-frequency cepstral coefficients via ?1-norm minimization
Author :
Gang Min; Xiongwei Zhang; Jibin Yang; Xia Zou
Author_Institution :
Lab of Intelligent Information Processing, PLA University of Science and Technology, Qinhuai District, Nanjing, China
fYear :
2015
Firstpage :
1
Lastpage :
5
Abstract :
This paper presents a high quality speech reconstruction method from Mel-frequency cepstral coefficients (MFCC). Due to the sparse characteristic of the power spectrum of speech, the ℓ1-norm minimization method is used to tackle the under-determined nature of the speech reconstruction problem. The phase spectrum is recovered by the well-known LSE-ISTFTM algorithm. Experimental results demonstrate that the quality of the reconstructed speech is dramatically improved than the common ℓ2-norm minimization method, it sounds very close to the original speech when using the high-resolution MFCC, the PESQ score reaches 4.0.
Keywords :
"Speech","Mel frequency cepstral coefficient","Minimization methods","Speech processing","Reconstruction algorithms"
Publisher :
ieee
Conference_Titel :
Multimedia Signal Processing (MMSP), 2015 IEEE 17th International Workshop on
Type :
conf
DOI :
10.1109/MMSP.2015.7340799
Filename :
7340799
Link To Document :
بازگشت