Title :
Reconstruction of Normal Speech from Whispered Speech Based on RBF Neural Network
Author :
Tao, Zhi ; Gu, Ji-Hua ; Tan, Xue-Dan ; Xu, Yi-Shen ; Han, Tao ; Zhao, He-Ming
Author_Institution :
Dept. of Phys. Sci. & Tech., Soochow Univ., Suzhou, China
Abstract :
Restriction of normal speech from Chinese whispered speech based on radial basis function neural network (RBF NN) is proposed in this paper. Firstly, capture the nonlinear mapping of spectral envelope between whispered and normal speech by RBF NN; secondly, modify the spectral envelope of the whispered speech by adopting the trained neural network; finally, convert the whispered speech into normal speech by using the linear spectral pairs (LSP) synthesizer. Both subjective and objective assessments are conducted on the converted speech quality. Simulation results show that the score of the Mean Opinion Score (MOS) is 3.2; the distorted distance of bark spectrum is decreased. Both intelligibility and quality of the converted speech are satisfied.
Keywords :
radial basis function networks; signal reconstruction; speech processing; Chinese whispered speech; RBF neural network; bark spectrum; linear spectral pairs; mean opinion score; nonlinear mapping; normal speech reconstruction; radial basis function neural network; spectral envelope; speech quality; Electrons; Frequency; Information security; Information technology; Intelligent networks; Mobile communication; Neural networks; Radial basis function networks; Speech coding; Speech synthesis; radial basis function neural network; voice conversion; whispered speech;
Conference_Titel :
Intelligent Information Technology and Security Informatics (IITSI), 2010 Third International Symposium on
Conference_Location :
Jinggangshan
Print_ISBN :
978-1-4244-6730-3
Electronic_ISBN :
978-1-4244-6743-3
DOI :
10.1109/IITSI.2010.118