DocumentCode :
2192002
Title :
Reconstruction of Normal Speech from Whispered Speech Based on RBF Neural Network
Author :
Tao, Zhi ; Gu, Ji-Hua ; Tan, Xue-Dan ; Xu, Yi-Shen ; Han, Tao ; Zhao, He-Ming
Author_Institution :
Dept. of Phys. Sci. & Tech., Soochow Univ., Suzhou, China
fYear :
2010
fDate :
2-4 April 2010
Firstpage :
374
Lastpage :
377
Abstract :
Restriction of normal speech from Chinese whispered speech based on radial basis function neural network (RBF NN) is proposed in this paper. Firstly, capture the nonlinear mapping of spectral envelope between whispered and normal speech by RBF NN; secondly, modify the spectral envelope of the whispered speech by adopting the trained neural network; finally, convert the whispered speech into normal speech by using the linear spectral pairs (LSP) synthesizer. Both subjective and objective assessments are conducted on the converted speech quality. Simulation results show that the score of the Mean Opinion Score (MOS) is 3.2; the distorted distance of bark spectrum is decreased. Both intelligibility and quality of the converted speech are satisfied.
Keywords :
radial basis function networks; signal reconstruction; speech processing; Chinese whispered speech; RBF neural network; bark spectrum; linear spectral pairs; mean opinion score; nonlinear mapping; normal speech reconstruction; radial basis function neural network; spectral envelope; speech quality; Electrons; Frequency; Information security; Information technology; Intelligent networks; Mobile communication; Neural networks; Radial basis function networks; Speech coding; Speech synthesis; radial basis function neural network; voice conversion; whispered speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Information Technology and Security Informatics (IITSI), 2010 Third International Symposium on
Conference_Location :
Jinggangshan
Print_ISBN :
978-1-4244-6730-3
Electronic_ISBN :
978-1-4244-6743-3
Type :
conf
DOI :
10.1109/IITSI.2010.118
Filename :
5453596
Link To Document :
بازگشت