DocumentCode :
2715420
Title :
Spectral Enhancement of Whispered Speech Based on Probability Mass Function
Author :
Sharifzadeh, Hamid Reza ; McLoughlin, Ian Vince ; Ahmadi, Farzaneh
Author_Institution :
Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
fYear :
2010
fDate :
9-15 May 2010
Firstpage :
207
Lastpage :
211
Abstract :
Whispered speech can be effectively used for quiet and private communications over mobile phones and is also the communication means for ENT patients under a regime of voice rest. The reconstruction of natural sounding speech from such whispers can be useful for several types of application across different scientific fields ranging from communications to biomedical engineering. Despite the useful applications for a such technology, the reconstruction of natural speech from whispers has received relatively little research effort to date. This paper presents novel methods for spectral enhancement and formant smoothing with the aim of attaining more natural sounding speech within the reconstruction process. The proposed approach uses a probability mass-density function to identify a reliable formant trajectory through whispers and apply vocal modifications accordingly. Subjective evaluation experiments were performed, and are reported, to assess the performance of the techniques. A method for the near real-time conversion of whispers to normal phonated speech through a modified CELP codec has been discussed in our previously published work which, the proposed formant modification approach in this paper builds upon.
Keywords :
speech enhancement; ENT patients; biomedical engineering; formant smoothing; mobile phones; natural sounding speech; private communications; probability mass-density function; quiet communications; spectral enhancement; whispered speech; Frequency estimation; Mobile communication; Mobile handsets; Natural languages; Smoothing methods; Speech codecs; Speech coding; Speech enhancement; Speech processing; Working environment noise; CELP codec; formant trajectory; linear predictive coding; spectral enhancement; whispered speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Telecommunications (AICT), 2010 Sixth Advanced International Conference on
Conference_Location :
Barcelona
Print_ISBN :
978-1-4244-6748-8
Type :
conf
DOI :
10.1109/AICT.2010.47
Filename :
5489846
Link To Document :
بازگشت